python - 请求为什么在获取安全链接时失败-6ren

python - 请求为什么在获取安全链接时失败

转载作者：可可西里更新时间：2023-11-01 17:26:59

当我尝试发出安全请求时，出现了一个奇怪的错误，而且我找不到错误。我确定这是愚蠢的事情。

#!/usr/bin/env python


'''
this module was designed with web scrapers and web crawlers in mind.
I find my self writing these functions all the time. I Wrote this model
to save time.
'''

import requests
import urlparse
import urllib2
import urllib
import re
import os
import json
from fake_useragent import UserAgent

class InvalidURL(Exception):
    pass

class URL(object):
    '''Common routines for dealing with URLS.
    '''
    def __init__(self, url):
        '''Setup the initial state
        '''
        self.raw_url = url
        self.url = urlparse.urlparse(url)
        self.scheme = self.url.scheme
        self.domain = self.url.netloc
        self.path = self.url.path
        self.params = self.url.params
        self.query = self.url.query
        self.fragment = self.url.fragment


    def __str__(self):
        ''' This os called when somthing
        asks for a string representation of the
        url
        '''
        return self.raw_url


    def valid(self):
        """Validate the url.

        returns True if url is valid
        and False if it is not
        """
        regex = re.compile(
            r'^(?:http|ftp)s?://' # http:// or https://
            r'(?:(?:[A-Z0-9](?:[A-Z0-9-]{0,61}[A-Z0-9])?\.)+(?:[A-Z]{2,6}\.?|[A-Z0-9-]{2,}\.?)|'
            r'localhost|' #localhost...
            r'\d{1,3}\.\d{1,3}\.\d{1,3}\.\d{1,3})' # ...or ip
            r'(?::\d+)?' # optional port
            r'(?:/?|[/?]\S+)$', re.IGNORECASE)
        match = regex.match(self.raw_url)
        if match:
            return True


    def unquote(self):
        """unquote('abc%20def') -> 'abc def'."""

        return urllib2.unquote(self.raw_url)


    def quote(self):
        """quote('abc def') -> 'abc%20def'

        Each part of a URL, e.g. the path info, the query, etc., has a
        different set of reserved characters that must be quoted.

        RFC 2396 Uniform Resource Identifiers (URI): Generic Syntax lists
        the following reserved characters.

        reserved    = ";" | "/" | "?" | ":" | "@" | "&" | "=" | "+" |
                      "$" | ","

        Each of these characters is reserved in some component of a URL,
        but not necessarily in all of them.

        By default, the quote function is intended for quoting the path
        section of a URL.  Thus, it will not encode '/'.  This character
        is reserved, but in typical usage the quote function is being
        called on a path where the existing slash characters are used as
        reserved characters.
        """
        return urllib2.quote(self.raw_url)


    def parameters(self):
        """
        parse the parameters of the url
        and return them as a dict.
        """
        return urlparse.parse_qs(self.params)


    def secure(self):
        """ Checks if the url uses ssl. """
        if self.scheme == 'https':
            return True


    def extention(self):
        """ return the file extention """
        return os.path.splitext(self.path)[1]


    def absolute(self):
        """ Checks if the URL is absolute. """
        return bool(self.domain)


    def relitive(self):
        """ Checks if the url is relitive. """
        return bool(self.scheme) is False


    def encode(self, mapping):
        """Encode a sequence of two-element tuples or dictionary into a URL query string.

        If any values in the query arg are sequences and doseq is true, each
        sequence element is converted to a separate parameter.

        If the query arg is a sequence of two-element tuples, the order of the
        parameters in the output will match the order of parameters in the
        input.
        """
        query = urllib.urlencode(mapping)
        return urlparse.urljoin(self.raw_url, query)


class Request(object):


    allow_redirects = True
    timeout = 5
    ramdom_useragent = 0
    verify = False
    session = requests.Session()
    stream = True
    proxies = {}

    def __init__(self, url):
        """ Set the inital state """
        self.agentHeaders = {}
        self.url = URL(url)
        if not self.url.valid():
            raise InvalidURL("{} is invalid".format(url))

    def stream(self, answer):
        self.stream = bool(answer)

    def randomUserAgent(self):
        """ Set a random User-Agent """
        self.setUserAgent(UserAgent().random)


    def allowRedirects(self, answer):
        """ Choose whether or not to follow redirects."""
        self.allow_redirects = bool(answer)


    def setUserAgent(self, agent):
        """ Set the User-Agent """
        self.setHeaders('User-Agent', agent)


    def setHeaders(self, key, value):
        """ Set custom headers """
        self.agentHeaders[key] = value


    def verify(self, answer):
        """ Set whether or not to verify SSL certs"""
        self.verify = bool(answer)


    def get(self):
        """Sends a GET request"""
        return self.session.get(
            url=self.url,
            headers=self.agentHeaders,
            allow_redirects=self.allow_redirects,
            timeout=self.timeout,
            verify=self.verify,
            stream=self.stream,
            proxies=self.proxies
            )


    def head(self):
        """ Send a head request and return the headers """
        return self.session.head(
            self.url,
            headers=self.agentHeaders,
            allow_redirects=self.allow_redirects,
            timeout=self.timeout,
            verify=self.verify,
            proxies=self.proxies
            ).headers


    def options(self):
        """ Send a options request and return the options """
        return self.session.options(
            self.url,
            headers=self.agentHeaders,
            allow_redirects=self.allow_redirects,
            timeout=self.timeout,
            verify=self.verify,
            proxies=self.proxies
            ).headers['allow']


    def json(self):
        """
        Deserialize json data (a ``str`` or ``unicode`` instance
        containing a JSON document) to a Python object.
        """
        return json.loads(self.text)


    def headerValue(self, value):
        """ Get a value from the headers. """
        return self.headers().get(value)



request = Request('https://www.google.com')
req =  request.get()
print req.text
print request.head()
print 
print req.headers.get('link')
print request.options()

request = Request('https://www.google.com')
req =  request.get()

Sat Jul 29 HttpClient python UserAgent.py 
Traceback (most recent call last):
  File "UserAgent.py", line 234, in <module>
    req =  request.get()
  File "UserAgent.py", line 192, in get
    proxies=self.proxies
  File "/home/ricky/.local/lib/python2.7/site-packages/requests/sessions.py", line 515, in get
    return self.request('GET', url, **kwargs)
  File "/home/ricky/.local/lib/python2.7/site-packages/requests/sessions.py", line 502, in request
    resp = self.send(prep, **send_kwargs)
  File "/home/ricky/.local/lib/python2.7/site-packages/requests/sessions.py", line 612, in send
    r = adapter.send(request, **kwargs)
  File "/home/ricky/.local/lib/python2.7/site-packages/requests/adapters.py", line 407, in send
    self.cert_verify(conn, request.url, verify, cert)
  File "/home/ricky/.local/lib/python2.7/site-packages/requests/adapters.py", line 224, in cert_verify
    if not cert_loc or not os.path.exists(cert_loc):
  File "/usr/lib/python2.7/genericpath.py", line 26, in exists
    os.stat(path)
TypeError: coercing to Unicode: need string or buffer, instancemethod found

最佳答案

看看你的Request.verify方法:

def verify(self, answer):
    """ Set whether or not to verify SSL certs"""
    self.verify = bool(answer)

它与 Request.verify 冲突属性。

因此，当您调用 Request.get() ，你正在传递你的 verify verify 的实例方法requests.session.get(..., verify=<your method>) 中的参数, 而不是字符串( should point to a certificate bundle )或 bool .

线索在您的堆栈跟踪中:TypeError: coercing to Unicode: need string or buffer, instancemethod found .

解决方案:重命名您的 verify类似于 setVerify 的方法(与其他方法保持一致)。

与此问题无关，我建议您实现 Request通过扩展 requests.Session 类类(class)。这样你就可以定义更少的方法(比如 get 、 head 、 json 等)

关于python - 请求为什么在获取安全链接时失败，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/45392983/

文章推荐： json - Angular 2+ http get() JSON 数据。无法过滤数据

文章推荐： java - 多个线程读取和导入目录中的文件

文章推荐： angular - 如何使用我的 http 请求配置代理？

文章推荐： json - 在 Node-Red 中处理 HTTP 请求中的 MQTT 通信

python - Python 请求(AJAX 请求)数据丢失
我正在尝试从该网站抓取历史天气数据: http://www.hko.gov.hk/cis/dailyExtract_uc.htm?y=2016&m=1 在阅读了 AJAX 调用后，我发现请求数据的正确
rest - 链接 postman 请求 - 从另一个请求调用 postman 请求？
我有两个 postman 请求 x,y，它们命中了两个不同的休息 api X,Y 中的端点。 x 会给我一个身份验证 token ，这是发出 y 请求所必需的。如何在请求 y 中发出请求 x ？也就是
javascript - Node.js 请求 - 处理多个 POST 请求
我使用请求库通过 API 与其他服务器进行通信。但现在我需要同时发送多个(10 个或更多)POST 请求，并且只有在所有响应都正确的情况下才能进一步前进。通常语法看起来有点像这样: var optio
javascript - 如果提交了新的 AJAX 请求，则取消 AJAX 请求
背景:当用户单击按钮时，其类会在class1和class2之间切换，并且此数据是通过 AJAX 提交。为了确认此数据已保存，服务器使用 js 进行响应(更新按钮 HTML)。问题:如果用户点击按钮的
Node.js 请求 - 打印帖子的整个 http 请求(原始)
我正在将 Node.js 中的请求库用于 Google 的文本转语音 API。我想打印出正在发送的请求，如 python example . 这是我的代码: const request = requi
python - 请求、请求 2 和请求 3 之间有什么区别？
我经常使用requests。最近我发现还有一个 requests2 和即将到来的 requests3 虽然有一个 page其中简要提到了 requests3 中的内容，我一直无法确定 requests
python - 在 POST 请求(python 请求)后获取响应/返回值
我正在尝试将图像发送到我的 API，然后从中获取结果。例如，我使用发送一个 bmp 图像文件 file = {"img": open("img.bmp)} r = requests.post(url,
azure - Azure 中两个虚拟机之间的内部 HTTP 请求 - 默认情况下安全还是需要发送 HTTPS 请求？
我发现 Google Cloud 确保移出其物理环境的任何请求都经过强制加密，请参阅(虚拟机到虚拟机标题下的第 6 页)this link Azure(和 AWS)是否遵循类似的程序？如果有人能给我指
javascript - jQuery:执行同步 AJAX 请求，然后执行一系列其他 ajax 请求
我有一个 ASP.NET MVC 应用程序，我正在尝试在 javascript 函数中使用 jQuery 来创建一系列操作。该函数由三部分组成。我想做的是:如果满足某些条件，那么我想执行同步 jQu
javascript - Http 请求 - 外部 url 请求 ember js
我找不到如何执行 get http 请求，所以我希望你们能帮助我。这个想法是从外部url(例如 https://api.twitter.com/1.1/search/tweets.json?q=tw
android - 请求 READ_SMS 请求 "send and view SMS messages"
我的应用只需要使用“READ_SMS”权限。我的问题是，在 Android 6.0 上，当我需要使用新的权限系统时，它会要求用户“发送和查看短信”。这是我的代码: ActivityCompat.re
node.js - 为什么即使我的前端代码只是发出 POST 请求，浏览器也会发送 OPTIONS 请求？
我的前端代码: { this.searchInput = input; }}/> 搜索 // search method: const baseUrl = 'http://localho
c# - 将 HTTP 请求 header 添加到 WCF 请求
我有一个由 AJAX 和 C# 应用程序使用的 WCF 服务，我需要通过 HTTP 请求 header 发送一个参数。在我的 AJAX 上，我添加了以下内容并且它有效: $.ajax({
javascript - node.js + 请求 => node.js + bluebird + 请求
我正在尝试了解如何使用 promises 编写代码。请检查我的代码。这样对吗？ Node.js + 请求: request(url, function (error, response, body)
gwt - 如果失败，如何重新发送 GWT RPC 请求(或如何创建持久的 RPC 请求)？
如果失败(除 HTTP 200 之外的任何响应代码)，我需要重试发送 GWT RPC 请求。原因很复杂，所以我不会详细说明。到目前为止，我在同一个地方处理所有请求响应，如下所示: // We
php - 发起 POST 请求，执行操作，然后完成 POST 请求 - 如何？
当用户单击提交按钮时，我希望提交表单。然而，就在这种情况发生之前，我希望弹出一个窗口并让他们填写一些数据。一旦他们执行此操作并关闭该子窗口，我希望发出 POST 请求。这可能吗？如果可能的话如何？我
javascript - 什么更好？更多 HTTP 请求 = 更少的数据传输或更少的 HTTP 请求 = 更多的数据传输？
像 Facebook 这样的网站使用“延迟”加载 js。当你必须考虑到我有一台服务器，流量很大时。我很感兴趣 - 哪一个更好？当我一次执行更多 HTTP 请求时 - 页面加载速度较慢(由于限制(一
java - Servlet 容器创建 Servlet 请求/响应对象还是 HttpServlet 请求/响应对象？
Servlet 容器是否创建 ServletRequest 和 Response 对象或 Http 对象？如果是ServletRequest，谁在调用服务方法之前将其转换为HttpServletReq
php - HTTP 请求 URL 不是 HTTP 请求 header 的一部分吗？
这是维基百科文章的摘录: In contrast to the GET request method where only a URL and headers are sent to the serv
node.js - 首先完成一个 HTTP post 请求，然后再循环执行下一个 HTTP post 请求
我有一个循环，每次循环时都会发出 HTTP post 请求。 for(let i = 1; i console.log("succes at " + i), error => con

可可西里

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

python - 请求为什么在获取安全链接时失败