python - 网页抓取谷歌财经-6ren

python - 网页抓取谷歌财经

转载作者：行者123 更新时间：2023-12-05 06:45:58

24

4

我正在尝试自学如何通过网络抓取股票数据。我是一个新手，所以请原谅我可能会问的任何愚蠢问题。

这是我用来抓取价格的代码，我也在尝试抓取市盈率。

import urllib.request
from bs4 import BeautifulSoup

start = 'http://www.google.com/finance?cid=694653'

page = urllib.request.urlopen(start)
soup = BeautifulSoup(page)
          

P = soup.find('span',{'id':'ref_694653_l'})

print(P.get_text())

                     
pe = soup.find_all('td',{'class':'val'})

print(pe[5].get_text())

pe = soup.find('td',{'data-snapfield':'pe_ratio'})

print(pe.td.next_sibling.get_text())

我可以得到价格数据，也可以得到市盈率，但不是直接的。我尝试使用 next_sibling和 next_element但它给我一个错误，说没有属性。

我无法弄清楚如何从表中抓取数据，因为它通常是按行设置的，而且数据周围的类通常很常见，例如 <td>或 <tr> .

所以只是想寻求一些帮助来降低市盈率。

谢谢大家

是的

最佳答案

这将有助于:

>>> pe = soup.find('td',{'data-snapfield':'pe_ratio'})
>>> pe
<td class="key" data-snapfield="pe_ratio">P/E
</td>
>>> print(pe.td.next_sibling.get_text())
Traceback (most recent call last):
  File "<stdin>", line 1, in <module>
AttributeError: 'NoneType' object has no attribute 'next_sibling'
>>> 
>>> 
>>> 
>>> pe
<td class="key" data-snapfield="pe_ratio">P/E
</td>
>>> pe.td
>>> pe.next_sibling
u'\n'
>>> pe.next_sibling.next_sibling
<td class="val">29.69
</td>
>>> pe.next_sibling.next_sibling.get_text()
u'29.69\n'

关于python - 网页抓取谷歌财经，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/20022891/

24

4

0

文章推荐： objective-c - 你如何量化你的 Objective-C 代码质量？

文章推荐： r - 从已知百分位数生成正态分布

文章推荐： .net - MouseEventArgs.GetPosition 在 WPF 中没有按预期工作

文章推荐： string - Powershell - $args 去除逗号

google-finance - 如何使用 Google 财经？
我想开发一个小应用程序来自动从 Google Finance 获取股票价格并将其存储在我的本地机器中以供将来分析。谁能给我一些如何开始的线索？我知道一些 C#。它适合这个目的吗？先感谢您。最佳
finance - Google 财经 API 地址已更改
直到昨天以下网址(Google Finance API) https://www.google.com/finance/getprices?q=TSLA&x=NASDAQ&i=3600&p=5d&f=
javascript - Google 财经 - 获取报价搜索框 - 列对齐
Google 如何设法正确对齐 google finance url 中“获取报价”搜索框建议下拉列表中的第二列(即代码名称) 示例:如果您输入 iii - 第二列完全对齐。它不使用固定宽度的字体
google-api - Google 财经 API 错误
我想使用 Google Finance API 转换货币。但它在我的本地机器(印度)上不起作用。当我从位于美国的一台服务器进行访问并提供正确的结果时，它工作正常。网址:https://finance
c# - Google 财经，如何获取流式传输的 JSON 数据？
我之前试图解释这一点，但显然失败了! 因此，如果您打开了一个 google finance 图表，例如: http://www.google.com/finance?q=INDEXNASDAQ:.IX
google-sheets - 如何将 Google 财经(或雅虎财经)的财务数据导入到 Google 表格中？
我正在努力将财务数据输入谷歌表格。我在这里阅读了一些答案并尝试了这个:=importhtml("https://www.google.com/finance?q="&B2&"&fstype=ii&ei

首页

博学

6Ren·AI

商城

python - 网页抓取谷歌财经