gpt4 book ai didi

python - soup.find 找不到 div 的类

转载 作者:行者123 更新时间:2023-12-01 09:15:57 37 4
gpt4 key购买 nike

我正在尝试抓取此页面:https://1xbet.cm/en/line/Football/1536237-FIFA-World-Cup-2018/因为这是足球赔率,但是当我尝试通过 BeautifulSoup 查找相关类(class)时,我没有得到任何返回。有人可以解释为什么我没有找到任何东西吗?

enter image description here

class GetData():

def __init__(self, url):

self.url = url
r = requests.get(url)
self.soup = BeautifulSoup(r.text, "lxml")

def do_smth(self):

content = self.soup.find_all("div", class_="bets_content")
print(content)

url = 'https://1xbet.cm/en/line/Football/1536237-FIFA-World-Cup-2018/'
gd = GetData(url)
gd.do_smth()

最佳答案

我认为 BeautifulSoup 无法帮助您从该网站抓取数据,因为该网站使用 VueJS 作为消费的 JavaScript 框架网站 API/Web 服务以获得最终模板。

因此,为了获取数据,您可以直接解析 API/Web 服务并获取您需要的内容。

以下是使用 requestsre 模块的示例:

import re
import requests


class GetData:
def __init__(self):
self.main_url = 'https://1xbet.cm/en/line/Football/1536237-FIFA-World-Cup-2018/'
self.headers = {
'accept-encoding': 'gzip, deflate, br',
'accept-language': 'fr-FR,fr;q=0.9,en-US;q=0.8,en;q=0.7',
'referer': 'https://1xbet.cm/en/line/Football/1536237-FIFA-World-Cup-2018/',
'user-agent': 'Mozilla/5.0 (X11; Linux x86_64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/67.0.3396.79 Safari/537.36'
}



def read(self, url):
with requests.get(url, headers=self.headers) as response:
if response.status_code == 200:
return response.json()
else:
raise Exception('Got error: {}'.format(response.status_code))



def pretty_print(self, msg, data):
print(msg + ' :')
print(data)
print('#' * 40)



def get_teams_id(self, url):
teams_regex = re.findall(r'/(\d+)-', self.main_url)
if teams_regex:
teams_id = teams_regex[0]
return url.format(teams_id)
else:
raise ValueError("Cannot parse Teams ID")



def get_teams_info(self, pretty_print=False):
teams_url = 'https://1xbet.cm/LineFeed/GetChampTeams?id={}&lng=en'
valid_url = self.get_teams_id(teams_url)
data = self.read(valid_url)
if pretty_print:
values = data.get('Value', [])
teams = [values[k:k+2] for k in range(0, len(values), 2)]
teams_pretty = '\n'.join(' VS '.join(map(
lambda x: '{}({})'.format(x.get('N'), x.get('I')), k)
) for k in teams
)
self.pretty_print('Teams Info', teams_pretty)

return data



def get_teams_cotes(self, pretty_print=False):
cotes_url = 'https://1xbet.cm/LineFeed/Get1x2_VZip?champs={}&count=50&lng=en&tf=1500000&mode=4'
valid_url = self.get_teams_id(cotes_url)
data = self.read(valid_url)
if pretty_print:
values = data.get('Value')
for k in values:
msg = '{}\n{} VS {}\nCotes: [{}, ..., {}]'.format(
k.get('L'),
k.get('O1'),
k.get('O2'),
k.get('E')[0],
k.get('E')[-1]
)
self.pretty_print('Events & Cotes', msg)

return data



if __name__ == '__main__':
app = GetData()
_ = app.get_teams_info(pretty_print=True)
_ = app.get_teams_cotes(pretty_print=True)

如果您运行此代码,您将得到与此类似的结果:

Teams Info :
Belgium(12609) VS Croatia(12739)
England(12763) VS France(12771)
########################################
Events & Cotes :
FIFA World Cup 2018
France VS Belgium
Cotes: [{'T': 1, 'G': 1, 'C': 2.58}, ..., {'T': 181, 'G': 19, 'C': 2.125}]
########################################
Events & Cotes :
FIFA World Cup 2018
Croatia VS England
Cotes: [{'T': 1, 'G': 1, 'C': 3.64}, ..., {'T': 181, 'G': 19, 'C': 1.805}]
########################################

现在轮到您解析数据并获取您需要的内容了。请善待网站。

关于python - soup.find 找不到 div 的类,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/51256334/

37 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com