gpt4 book ai didi

java - 使用 StormCrawler 抓取某些 url 时出现 X509 证书异常

转载 作者:太空宇宙 更新时间:2023-11-04 10:36:31 24 4
gpt4 key购买 nike

我一直在使用StormCrawler来抓取网站。作为https协议(protocol),我在StormCrawler中设置了默认的https协议(protocol)。但是,当我抓取一些网站时,我收到以下异常:

Caused by: sun.security.provider.certpath.SunCertPathBuilderException: unable to find valid certification path to requested target
at sun.security.provider.certpath.SunCertPathBuilder.build(SunCertPathBuilder.java:141) ~[?:1.8.0_131]
at sun.security.provider.certpath.SunCertPathBuilder.engineBuild(SunCertPathBuilder.java:126) ~[?:1.8.0_131]
at java.security.cert.CertPathBuilder.build(CertPathBuilder.java:280) ~[?:1.8.0_131]
at sun.security.validator.PKIXValidator.doBuild(PKIXValidator.java:382) ~[?:1.8.0_131]
at sun.security.validator.PKIXValidator.engineValidate(PKIXValidator.java:292) ~[?:1.8.0_131]
at sun.security.validator.Validator.validate(Validator.java:260) ~[?:1.8.0_131]
at sun.security.ssl.X509TrustManagerImpl.validate(X509TrustManagerImpl.java:324) ~[?:1.8.0_131]
at sun.security.ssl.X509TrustManagerImpl.checkTrusted(X509TrustManagerImpl.java:229) ~[?:1.8.0_131]
at sun.security.ssl.X509TrustManagerImpl.checkServerTrusted(X509TrustManagerImpl.java:124) ~[?:1.8.0_131]
at sun.security.ssl.ClientHandshaker.serverCertificate(ClientHandshaker.java:1496) ~[?:1.8.0_131]
... 20 more

是否有自动下载证书和设置爬虫的机制以及如何设置爬虫的配置?

最佳答案

此问题并非 StormCrawler 特有。 This answer解释说您可以手动导入证书,这并不是一个真正的选择,除非您专门爬行该网站。另一种选择是禁用证书验证。这需要修改协议(protocol)实现,但应该是可行的。

您尝试过 OKHttp 实现吗?它的行为可能与 Apache HttClient 不同。请参阅okhttp wiki .

关于java - 使用 StormCrawler 抓取某些 url 时出现 X509 证书异常,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/49399561/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com