gpt4 book ai didi

python - 如何从 BeautifulSoup 类中提取值(value)

转载 作者:行者123 更新时间:2023-12-04 03:20:38 25 4
gpt4 key购买 nike

我有一个看起来像这样的网络文档:-

    <table class="table "><col width="75px"></col><col width="1px"></col><tbody><tr class="tablerow style2" prodid="143012"><td class="pricecell"><span class="WebRupee">Rs.</span>
29
<br/><font style="font-size:smaller;font-weight:normal">
3 days
</font></td><td class="spacer"></td><td class="detailcell"><span><span class="label label-default" style="background-color:#3cb521;color:#fff;border:1px solid #3cb521">FULL TT</span> 
</span><span><span class="label label-default" style="background-color:#fff;color:#0c7abc;border:1px solid #0c7abc">SMS</span> 
</span><div style="padding-top:5px">
29

Full Talktime
</div><div class="detailtext"> 5 Local A2A SMS valid for 1 day </div></td></tr><tr class="tablerow style2" prodid="127535"><td class="pricecell"><span class="WebRupee">Rs.</span>
59
<br/><font style="font-size:smaller;font-weight:normal">
7 days
</font></td><td class="spacer"></td><td class="detailcell"><span><span class="label label-default" style="background-color:#3cb521;color:#fff;border:1px solid #3cb521">FULL TT</span> 
</span><span><span class="label label-default" style="background-color:#fff;color:#0c7abc;border:1px solid #0c7abc">SMS</span> 
</span><div style="padding-top:5px">
59

Full Talktime
</div><div class="detailtext"> 10 A2A SMS valid for 2 days </div></td></tr><tr class="tablerow style2" prodid="143025"><td class="pricecell"><span class="WebRupee">Rs.</span>
99
<br/><font style="font-size:smaller;font-weight:normal">
12 days
</font></td><td class="spacer"></td><td class="detailcell"><span><span class="label label-default" style="background-color:#3cb521;color:#fff;border:1px solid #3cb521">FULL TT</span> 
</span><div style="padding-top:5px">
99

Full Talktime
</div><div class="detailtext"> 10 Local A2A SMS for 2 days only </div>

我想要值 29, 3 days,29 full talktime, 59, 7 days,59 full talktime 等

但是如果我尝试下面的脚本,我会得到整个文档。

from bs4 import BeautifulSoup
import requests

r = requests.get("http://www.ireff.in/plans/airtel/karnataka")

data = r.text

soup = BeautifulSoup(data,"html.parser")

table = soup.find('table',{'class':'table'})
print(table)

我哪里错了?我想专门获取这些值。

或者如果表格可以转换为 json 数组,那也会有所帮助。

最佳答案

您需要更深入地挖掘才能获得您想要的特定数据。例如,要获取价格,请搜索类为“pricecell”的表格单元格。然后你可以获得包含的文本并解析它。一些示例代码(未测试):

price_cells = soup.findAll('td', {'class': 'pricecell'})
for price_cell in price_cells:
print(price_cell.text)

关于python - 如何从 BeautifulSoup 类中提取值(value),我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/38546575/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com