gpt4 book ai didi

php - PHP在没有API的情况下获取Youtube注释

转载 作者:行者123 更新时间:2023-12-03 05:23:31 24 4
gpt4 key购买 nike

我正在使用PHP,并尝试抓取youtube评论,但未使用youtube API

我可以很好地获取评论的第一页,但是带有 token 的“显示更多”表单很困难。

这是显示更多请求的实时标题

https://www.youtube.com/comment_ajax?action_load_comments=1&filter=-kWHMH2kxXs&order_by_time=false

POST /comment_ajax?action_load_comments=1&filter=-kWHMH2kxXs&order_by_time=false HTTP/1.1
Host: www.youtube.com
User-Agent: Mozilla/5.0 (Windows NT 6.1; WOW64; rv:33.0) Gecko/20100101 Firefox/33.0
Accept: text/html,application/xhtml+xml,application/xml;q=0.9,*/*;q=0.8
Accept-Language: en-US,en;q=0.5
Accept-Encoding: gzip, deflate
DNT: 1
X-YouTube-Page-CL: 78947767
X-YouTube-Page-Timestamp: Fri Oct 31 12:43:20 2014 (1414784600)
X-YouTube-Variants-Checksum: 9225d6367a37f5c51f11f11009c7ed18
Content-Type: application/x-www-form-urlencoded; charset=UTF-8
Referer: https://www.youtube.com/all_comments?v=-kWHMH2kxXs
Content-Length: 1712
Cookie: VISITOR_INFO1_LIVE=xJf1SvXMyvQ; PREF=fv=12.0.0&al=en&f5=30&f1=50000000; YSC=AQRpt5kYK6k; ACTIVITY=1414984537233; SID=DQAAANgAAAAVDrV25qIMIY1h4EnHCkB8QuHQpVPP2YhTT2PPvE2wkYbGLdHG9xQWEX_ADKYlKolQJRwza-Js_dmVlB-No68zaXnhLFm0NnbUaEV4zsaUwT5R_Kg1YvR2RvixP0OIw603Gax8sXIfXHmALqdYxWJ46Dt1qh2TmVoX06w7KlOQgvBE6_yViqu4j0b1iUSdVwJfMkhi8NVymGGsHWOVm027hdYdKKJTUC8-PJYbVKvItugatr0dJRL5_s6_l-P1ZFP2-OKrhb0H3ORmPU1EaFtsbPB3ZFjut09hxPFKCOq51w; HSID=A7izZQDyAow9noXRe; SSID=ALKL6xYLDPNuetixT; APISID=mOy8lSC4EJ0mUUs_/AjHuF16GPSuuL0gzQ; SAPISID=tmpcNtK_8ScRg3Mc/AMrBhFSc1O6ejr4HK; LOGIN_INFO=fe8cf1ebd49b7874eaf88ae7e3930925c2sAAAB7IjgiOiAxMzY1NjQ3NTc1MzUsICI0IjogIkdBSUEiLCAiNyI6IDE0MTQ5NDk2NDcsICIxIjogMSwgIjIiOiAiSWtxR3c2a2ZQdVRRQktZWHhGdWRaZz09IiwgIjMiOiAyNzc5ODQzNTk2fQ==; lwb=1; wide=1
Connection: keep-alive
Pragma: no-cache
Cache-Control: no-cache
page_token=Cg0Q8o2Ks8HdwQIgACgBErMHCAIQ6OuMlsHdwQIqpQe2m7DWq%2BSSlskBkZa45rrvrcmUAfyw%2B4WRtdrj%2FAG57%2FHkzJuNvcYB2POv74G5pZ8Mo%2Frf4Om0q%2FgS39TjhqfnzrMd6JbK9qnmj5sCueOj2YLYwozGAe7z0di9pPucHO3%2Fs42y3ejI%2BQHLhcen4eTv5x7NgNyI35nQrIcB1emWlMOJtcnlAdflt%2BbyjLnC4QGZktCSyZro5Anvhui%2BqObE%2B9kB5cahueaIkf27Ae7kkt%2Fry%2BqyOKfNh%2BCooZ%2FwcJWwt%2BP2xZmjGpO5lIqhrayYlQG4t9z%2Bo%2FDA2sEB94DOtZmc2dQG%2FavmlfWh5PkehJ6lrrLIzIW6Adr9%2BuHWkLmPFITArbbFleOkzQH8ytjyg7iuloABg6j8t%2Bj2ypG6AaeNhpf5yq6FnAGxq7u%2Bx6Sjl4EBg4vMmNDj3ulAsK7JrsSUz9%2FKAeTs4OfIwt33ygG277%2BRvaHOpE7Vqtuw6ZOy%2FR%2By7Nrixrj%2BzgWqs8avyoSK2tgBp6fHg%2BKLh4dr6NOqqLnb8JnnAYKH87SF%2B%2BXujgHDv7SArt7Z0ZUBu6Sy3YTBo%2FZ9%2Ft7QwYSLneytAaXG%2BfKblNOsVO3Bj%2BHxiqnLSJmuyeXj6ZT67wGK%2F%2FO9n9nA0uIBuIaez6%2BP1aWWAeybq8L%2F44efFuWNrO7h8JCCBPiVz8eq0sCnQcHjjaS2go3M5QH0wI3WwNa%2B9Uq1o8mZ85SZ4jiKs5O38prnsq8B5pDPvK%2FxlbFGz7%2FA%2BuSR2cffAZePkYuJ35n6Mt%2FUyO73kfqiOMGetu%2F%2Fr%2BCDkAHgkeLxh5K1ugWmoLjQis%2FZlBzfhbLI%2Ff7%2BjpkBrYHs0fvW%2B%2FvAAYCl99yYrb66hwGP6vuCparSjqYB1LbP14TY8ZJg7oyBnOGppMCgAY6u1rXq9pr9nAGn167P5Mmp1iCR%2F%2FLzxI%2Fw1njc5KjE2cGZpGSKivTwzv7HgmnkmIjsn7ra6vkB%2FI2mxMXS9ZPaAarLnYXDnumlggHvobaC%2BYuQr9kBk7i9hsKjgd1XiY7T2u%2BHys3PAb2Nw97A7Z2vrQGfof2Coeay7nSG68mDmpjcnQqWw7W157Tw9GLdq9TXpajvrbcByp2Iy8Cex6rdAfqvsIGn0uHwaMOap9bYho%2BaVLrAxbKru6C1ngGt1drm%2FsWs9yW%2Fm9722KfXxr4Bq8GDlZbnz4wPj4GPj5bd0MzOAb3ZiovalKGiL7zYhbO81fvlPJjlto%2Bh26qewgHUy%2B3ZxuXk4AYYAQ%3D%3D&session_token=QUFFLUhqa0RnaTI1Z3dnTVdTaTZEbUM2Vkp4WWpnTlJPUXxBQ3Jtc0tuN2g1WTh0c2FIa2JMU0FER1oxWU5HTzNtMzMyLXRuUEYzeldzMU5VMnhmOXdUN0U2TnEzNW9KSjFXb0FoV1Y0QUxSTG9SWXVlSXk0am50RTFBcUhnSi10QklHdXVlcE5LUGVYLXZ0YzNvajUzYTZGRUhtbWVISklFS2JSSjVwRXdRWERua04yYktDd243NGFfOGcyVUdIZXVaMmc%3D
HTTP/1.1 200 OK
Alternate-Protocol: 443:quic,p=0.01
Cache-Control: no-cache
Content-Disposition: attachment
Content-Encoding: gzip
Content-Length: 13424
Content-Type: application/json; charset=UTF-8
Date: Mon, 03 Nov 2014 04:24:24 GMT
Expires: Tue, 27 Apr 1971 19:44:06 EST
Server: gwiseguy/2.0
x-content-type-options: nosniff
X-Frame-Options: SAMEORIGIN
X-XSS-Protection: 1; mode=block
X-Firefox-Spdy: 3.1

我能够使用所有URL参数构建CURL请求,这很容易
https://www.youtube.com/comment_ajax?action_load_comments=1&filter=-kWHMH2kxXs&order_by_time=false

但是,如何包含page_token?

我从注释页面中提取了 token ,并显示了更多表格,但我不知道该如何添加它

我已经试过了
$headers = array(
"Cache-Control: no-cache",
"page_token=" . $dataToken
);
curl_setopt($ch, CURLOPT_HTTPHEADER, $headers);

但是我总是被踢回一个错误
[HTTP/1.1 403 Forbidden Date: Mon, 03 Nov 2014 04:58:24 GMT Server: gwiseguy/2.0 Cache-Control: no-cache X-Frame-Options: SAMEORIGIN X-Content-Type-Options: nosniff Content-Type: text/html; charset=utf-8 X-XSS-Protection: 1; mode=block; report=https://www.google.com/appserve/security-bugs/log/youtube Expires: Tue, 27 Apr 1971 19:44:06 EST Content-Length: 0 Alternate-Protocol: 443:quic,p=0.01 ] 

我拥有所有需要的数据,只是不确定如何构造CURL请求。

任何帮助将非常感激。

谢谢。

最佳答案

尽管当前为only supported by the v2 API,但您应该重新考虑use it。从加载第一页评论后

https://gdata.youtube.com/feeds/api/videos/-kWHMH2kxXs/comments?orderby=published

您只需点击Feed的next链接即可获得下一批注释。

关于php - PHP在没有API的情况下获取Youtube注释,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/26708414/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com