python - 如何使用 XPath Selenium 和 Python 从 <p> 标签获取文本-6ren

python - 如何使用 XPath Selenium 和 Python 从
标签获取文本

转载作者：行者123 更新时间：2023-12-02 16:34:34

24

4

我需要用 XPath 从 <p> 中的文本中捕获一行.我需要存储文本 Content-type: text/plain; charset=us-ascii进入 python 中的变量，但出现下一个错误:

selenium.common.exceptions.WebDriverException: Message: TypeError: Expected an element or WindowProxy, got: [object Text] {}

这是我正在尝试的代码:

import selenium.webdriver as webdriver

browser = webdriver.Firefox()
browser.get('https://www.w3.org/Protocols/rfc1341/7_1_Text.html')

foo = browser.find_element_by_xpath('/html/body/p[5]/text()')
print(foo)

<h1>7.1  The Text Content-Type</h1>
<p>
The text Content-Type is intended for sending material which
is  principally textual in form.  It is the default Content-
Type.  A "charset" parameter may be  used  to  indicate  the
character set of the body text.  The primary subtype of text
is "plain".  This indicates plain (unformatted)  text.   The
default  Content-Type  for  Internet  mail  is  "text/plain;
charset=us-ascii".
<p>
Beyond plain text, there are many formats  for  representing
what might be known as "extended text" -- text with embedded
formatting and  presentation  information.   An  interesting
characteristic of many such representations is that they are
to some extent  readable  even  without  the  software  that
interprets  them.   It is useful, then, to distinguish them,
at the highest level, from such unreadable data  as  images,
audio,  or  text  represented in an unreadable form.  In the
absence  of  appropriate  interpretation  software,  it   is
reasonable to show subtypes of text to the user, while it is
not reasonable to do so with most nontextual data.
<p>
Such formatted textual  data  should  be  represented  using
subtypes  of text.  Plausible subtypes of text are typically
given by the common name of the representation format, e.g.,
"text/richtext".
<p>
<h3>7.1.1     The charset parameter</h3>
<p>
A critical parameter that may be specified in  the  Content-
Type  field  for  text  data  is the character set.  This is
specified with a "charset" parameter, as in:
<p>
     Content-type: text/plain; charset=us-ascii
<p>
Unlike some  other  parameter  values,  the  values  of  the
charset  parameter  are  NOT  case  sensitive.   The default
character set, which must be assumed in  the  absence  of  a
charset parameter, is US-ASCII.

最佳答案

打印文本 Content-type: text/plain; charset=us-ascii 你必须诱导 WebDriverWait对于 visibility_of_element_located()，您可以使用以下任一项 Locator Strategies :

使用 XPATH 和 text 属性:

driver.get("https://www.w3.org/Protocols/rfc1341/7_1_Text.html")
print(WebDriverWait(driver, 20).until(EC.visibility_of_element_located((By.XPATH, "//h3[contains(., 'The charset parameter')]//following-sibling::p[2]"))).text)

使用 XPATH 和 get_attribute():

driver.get("https://www.w3.org/Protocols/rfc1341/7_1_Text.html")
print(WebDriverWait(driver, 20).until(EC.visibility_of_element_located((By.XPATH, "//h3[contains(., 'The charset parameter')]//following-sibling::p[2]"))).get_attribute("innerHTML"))

控制台输出:

Content-type: text/plain; charset=us-ascii

注意:您必须添加以下导入:

from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.common.by import By
from selenium.webdriver.support import expected_conditions as EC

关于python - 如何使用 XPath Selenium 和 Python 从 <p> 标签获取文本，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/62925043/

24

4

0

文章推荐： arrays - 映射减少字符串中的总和项目权重

文章推荐： python - 从图像中删除红色文本

文章推荐： java - 是否有独立于平台的方式来列出视频输入设备

xpath - [xpath]在获取 xpath 以获得相邻表格行的帮助时需要帮助
编辑:为什么这被否决了？我真的不知道...顺便说一句 ../不起作用，因为我不想要 Table 的父级但实际上想要 ../td+1 我不知道这是否可能？嗨，大家好。我手头有一个相当复杂的问题..
java - Xpath:使用同级 xpath 定位同级 xpath
我很难找到需要单击的输入(复选框)元素的 xpath。我正在尝试使用其他跨度元素来定位它。元素包含 Angular 属性，不知道这是否重要？元素的结构如下: Company name
xpath - xpath 中从不使用哪些字符？
我正在尝试构建一个包含许多 XPath 作为参数的 DSL。我是 XPath 的新手，我需要一个 XPath 语法中从未使用过的字符，这样我就可以在脚本的一行中分隔 n 个 XPath。我的问题:哪些
xpath - Xpath，用于检索父级内部的所有标签
使用xpath在父标签内找到特定标签：输入样例：
xpath - Xpath，在其他两个节点之间查找节点
我需要构造一个通用XPath来找到正确的节点，其中的标准是日期和时间。例如查找“ 5 May”，“ 12:17:44”的节点 XML具有日期和时间标签。不方便地，日期标签仅在当天的第一次出现时填充。
xpath - Xpath/xQuery的月份差异
我正在尝试获取xPath几个月内两个日期之间的差异。几天之内我就没问题了（1127） days-from-duration(xs:date('2012-06-17')-xs:date('2009-0
xpath - Xpath-如何选择一个包含文本但仅此而已的元素
我试图选择一个包含一段文本的元素，但是我不想选择包含该文本加上其他文本的元素。我通常会使用text()='abc def'，但是这个特定元素在前后都包含空格。这是一个示例片段：
xpath - XPath:通过子节点的属性值获取节点
亲爱的，您能帮我用这个XPATH吗？可以说我有以下HTML代码 text value1 value2 text 我需要构建一
xpath - 具有排除功能的复杂 xpath
我正在尝试提取带有排除项的 xpath，但无法执行此操作。 (//div[@class='row site country-names']/following-sibling::div)[1]/di
xpath - 在单个节点中获取所有包含html的文本scrapy xpath
response.xpath('//*[@id="blah"]//text()') 假设我的html是 This is a simple text foo and this is after tag.
xpath - Xpath:基于父值的查询中的异常
除了那些具有"//ul/li[not(@*)][count(*)=0]"父项的人以外，我需要全部接受。我已经尝试过，但是不幸的是它不起作用。有谁知道，我该怎么处理？提前致谢。最佳答案我认为您需
xpath - XPath:使用子字符串之后仅返回一个匹配项
我使用XPath的问题是，每当我使用“子字符串”功能时，我只会得到一个匹配项，而我想全部获得它们。另一个问题是，每当我使用“子字符串”和运算符的组合时它只是行不通（没有匹配项）。例如：http:/
xpath - XPath:获取具有位置和属性的元素
我正在尝试通过其位置和属性获取项目，但不知道如何做。我要实现的是将这一点统一起来： Xpath("//h4/a[contains(@href,'#vuln_')]") 还有这个： Xpath
xpath - Xpath |路径内的运算符
我有一个xpath如下： .//*[text()='Name:']/../child::select | .//*[text()='Name:']/../child::span 但是对我来说，它既不紧
xpath - xpath:在同一级别上组合多个过滤器
我拼命试图在xpath中组合几个过滤器。假设我的数据如下所示： DELETE 1 This is my title my sh
xpath - 如何通过 Xpath 设置元素，在那些已经使用 Xpath 设置的元素内？
我想在已经通过 xpath 设置的其他元素中使用 xpath 来指示元素的位置。下面的一个已经通过 xpath 设置(我没有改变) //Base_Code
xpath - Xpath:在括号内获取值？
是否可以使用xpath直接在括号内抓取信息？还是以后再用正则表达式过滤？ HTML如下所示： Product name (UN1QU3 C0D3) 使用以下Xpath表达式，我可以在此中获取所有内容：
xpath - XPath:根据另一个节点值选择一个节点
我试图使用一个XPath表达式来选择一个节点，该节点的子节点与文档中的另一个节点匹配。匹配将意味着该节点的所有属性都相同。因此，如果将一个节点与多个属性进行比较，则无法进行单独的属性比较。作为示例
xpath - XPath 中艾弗森括号最简单的语法是什么？
我想在 XPath 表达式中使用 Iverson 括号(即映射 true => 1，false => 0)。示例:而不是书写 someNumber+(if(elem1/elem2[@attr='12
xpath - XPath:如何通过IN条件选择节点？
是否可以以类似方式选择节点？ './tr[position() in (1, 3, 7)]' 我只找到以下解决方案： './tr[position() = 1 or position() = 3 or

首页

博学

6Ren·AI

商城

python - 如何使用 XPath Selenium 和 Python 从
标签获取文本

首页

博学

6Ren·AI

商城

python - 如何使用 XPath Selenium 和 Python 从 标签获取文本

python - 如何使用 XPath Selenium 和 Python 从
标签获取文本