gpt4 book ai didi

java - 使用 selenium webdriver 循环遍历表

转载 作者:行者123 更新时间:2023-11-30 07:33:58 36 4
gpt4 key购买 nike

我有一个可以在这里找到的表格:Ontario Gov Employee Directory我正在尝试循环遍历表以提取数据,但很难找到能够执行此操作的 xpath。

当我检查我看到的元素时,该表没有 id:

<table title="results_list" border="0" width="100%" cellspacing="0" cellpadding="0">

<tbody>
<tr>
<td class="content" valign="top" align="right" width="50">1. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("32528")'>Aagaard, Lindsay</a>] [ Senior Policy Advisor ] [TREASURY BOARD SECRETARIAT]
<br>[DEPUTY PREMIER AND PRESIDENT OF THE TREASURY BOARD, Toronto]

<!-- [416-327-0948] -->



[416-327-0948] [



<a href="mailto:lindsay.aagaard@ontario.ca">
lindsay.aagaard@ontario.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">2. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("34417")'>Aalto, Margaret</a>] [ Probation Officer ] [CHILDREN AND YOUTH SERVICES]
<br>[THUNDER BAY, Thunder Bay]

<!-- [807-475-1310] -->



[807-475-1310] [



<a href="mailto:margaret.aalto@ontario.ca">
margaret.aalto@ontario.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">3. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("9187")'>Aarlaht, Andrew</a>] [ Business Analyst ] [COMMUNITY AND SOCIAL SERVICES]
<br>[HAMILTON, BUSINESS SERVICES UNIT, Hamilton]

<!-- [905-521-7335] -->



[905-521-7335] [



<a href="mailto:andrew.aarlaht@ontario.ca">
andrew.aarlaht@ontario.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">4. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("9187")'>Aarlaht, Andrew</a>] [ Business Analyst ] [CHILDREN AND YOUTH SERVICES]
<br>[HAMILTON, BUSINESS SERVICES UNIT, Hamilton]

<!-- [905-521-7335] -->



[905-521-7335] [



<a href="mailto:andrew.aarlaht@ontario.ca">
andrew.aarlaht@ontario.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">5. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("19146")'>Aarons, Drew</a>] [ Messenger ] [LEGISLATIVE OFFICES]
<br>[PARLIAMENTARY PROTOCOL, Toronto]

<!-- [416-325-7455] -->



[416-325-7455] [



<a href="mailto:daarons@ola.org">
daarons@ola.org</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">6. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("113729")'>Aaswaakshin, Neegann</a>] [ Articling Student ] [ABORIGINAL AFFAIRS]
<br>[LEGAL SERVICES, Toronto]

<!-- [416-212-2271] -->



[416-212-2271] [



<a href="mailto:Neegann.Aaswaakshin@ontario.ca">
Neegann.Aaswaakshin@ontario.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">7. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("32196")'>Abad, Lilian</a>] [ Executive Assistant ] [TRANSPORTATION]
<br>[GO TRANSIT, Toronto]

<!-- [416-202-5506] -->



[416-202-5506] [



<a href="mailto:lilian.abad@gotransit.com">
lilian.abad@gotransit.com</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">8. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("114240")'>Abadesso, Jennifer</a>] [ Employment Program Consultant (Acting) ] [TRAINING, COLLEGES AND UNIVERSITIES]
<br>[FOUNDATION SKILLS, Toronto]

<!-- [416-327-2065] -->



[416-327-2065] [



<a href="mailto:jennifer.abadesso@ontario.ca">
jennifer.abadesso@ontario.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">9. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("104293")'>Abakunzi, Louis</a>] [ Customer Service Representative (Bilingual) ] [GOVERNMENT AND CONSUMER SERVICES]
<br>[SERVICEONTARIO CONTACT CENTRE - NORTH YORK, Toronto]

<!-- [416-235-2999] -->



[416-235-2999] [



<a href="mailto:Louis.K.Abakunzi@ontario.ca">
Louis.K.Abakunzi@ontario.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

<tr>
<td class="content" valign="top" align="right" width="50">10. &nbsp;</td>
<td class="content">[<a class="results" href='javascript:showEmployeeDetail("19309")'>Aban, Edencio</a>] [ Audit Supervisor ] [ATTORNEY GENERAL]
<br>[AUDIT AND COMPLIANCE, Toronto]

<!-- [416-326-6295] -->



[416-326-6295] [



<a href="mailto:edencio.aban@agco.ca">
edencio.aban@agco.ca</a>]
</td>
</tr>
<tr>
<td>&nbsp;</td>
</tr>

</tbody>
</table>

如何循环访问这些行中的数据?

最佳答案

这是一个表中的表,并且具有一些非常标准的格式。您面临什么挑战?

The table doesn't have an id, when I inspect the element I see:

它还有其他可以使用的属性,例如标题。使用 xpath //table[@title="results_list"]/tbody/tr/td 从最里面的表中查找每个数据元素。或者从 xpath 中删除最后一个 /td 以获取每一行。之后,找到其下的每个 td 元素并使用其 text

注意:最里面的表的第一列包含序列号,第二列包含实际数据。我建议获取每个 td,然后使用“innerHTML”属性或 elem.text。之后,使用正则表达式提取不同的部分。

>>> all_tdata = driver.find_elements_by_xpath('//table[@title="results_list"]/tbody/tr/td')
>>> for td in all_tdata:
... print td.get_attribute('innerHTML') # save this in var and regex it
... # or
... data = td.text

关于java - 使用 selenium webdriver 循环遍历表,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/35675895/

36 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com