python - 为什么 urlparse.urlenparse 工作不一致？-6ren

python - 为什么 urlparse.urlenparse 工作不一致？

转载作者：行者123 更新时间：2023-11-28 16:44:11

25

4

当netloc为空时urlparse.urlunparse不一致:

>>> urlparse.urlunparse(('http','','test_path', None, None, None))
'http:///test_path'
>>> urlparse.urlunparse(('ftp','','test_path', None, None, None))
'ftp:///test_path'
>>> urlparse.urlunparse(('ssh','','test_path', None, None, None))
'ssh:test_path'

这是错误还是功能？我希望 urlunparse 始终如第一个示例所示，即使无法识别方案也是如此。

最佳答案

您传递给 urlunparse 的 data 元组具有以下组件:

scheme, netloc, url, query, fragment = data

当没有netloc，并且scheme不在uses_netloc中时，url为

    url = scheme + ':' + url

这就是 urlunparse(调用 urlunsplit)的方式 is defined :

def urlunsplit(data):
    ...
    scheme, netloc, url, query, fragment = data
    if netloc or (scheme and scheme in uses_netloc and url[:2] != '//'):
        if url and url[:1] != '/': url = '/' + url
        url = '//' + (netloc or '') + url
    if scheme:
        url = scheme + ':' + url

请注意 'ssh' 不在 uses_netloc 中:

uses_netloc = ['ftp', 'http', 'gopher', 'nntp', 'telnet',
               'imap', 'wais', 'file', 'mms', 'https', 'shttp',
               'snews', 'prospero', 'rtsp', 'rtspu', 'rsync', '',
               'svn', 'svn+ssh', 'sftp','nfs','git', 'git+ssh']

如果您提供 netloc，您会得到一个以 ssh:// 开头的 url:

In [140]: urlparse.urlunparse(('ssh','netloc','test_path', None, None, None))
Out[140]: 'ssh://netloc/test_path'

关于python - 为什么 urlparse.urlenparse 工作不一致？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/15514306/

25

4

0

文章推荐： python - 忽略 python 地址中的序号

文章推荐： javascript - 全页导航幻灯片淡入淡出问题

文章推荐： html - 更改页面宽度颜色背景

文章推荐： python - 属性错误 : 'QuerySet' object has no attribute

python， "urlparse.urlparse(url).hostname"返回无值
登录网站后我想收集它的链接。这是我用这个函数做的(使用 mechanize 和 urlparse 库): br = mechanize.Browser() . . #logging in on web
python - 当我使用 import urlparse 时，没有名为 'urlparse' 的模块
我无法运行此代码，有人可以帮助我吗？ import sys, urllib, re, urlparse from urllib import urlretrieve from BeautifulSou
python-3.x - 没有名为 'urlparse' 的模块，但我没有使用 urlparse
我试图弄清楚为什么我会看到错误 ModuleNotFoundError: No module named 'urlparse'但我从不在我的代码中调用 urlparse。当我尝试使用 pip 安装 u
python - urlparse.urlparse 在方案后返回 3 '/' 而不是 2
如果缺少“http”方案名称，我想在给定的 url 字符串前面添加它。否则，不要管 url，所以我认为 urlparse 是执行此操作的正确方法。但是每当没有方案并且我使用 get url 时，我会在
python - 当我使用 import urlparse 时，Heroku 日志显示 "No module named ' urlparse'"
我在使用 https://devcenter.heroku.com/articles/heroku-postgresql#connecting-in-python 中的以下代码行时遇到问题 impor
python - urlparse() 查询字符串丢失
我有两个系统: 第一个按预期工作: >>> urlparse.urlparse('foo://bar/?blu=1') ParseResult(scheme='foo', netloc='bar',
Python urlparse 给出错误的结果
我正在尝试用 python 分隔 url 的不同部分s urlparse，但我似乎在结果中得到了错误的值。 baseline = runSql(conn,"Select url from malwar
python - urlparse 的奇怪行为
我想知道是否有已知的解决方法来解决我在 python 的 urlparse 中看到的一些奇怪行为。以下是 python 解释器中几行的一些结果: >>> import urlparse >>> ur
Python urlparse，正确还是不正确？
Python 的 urlparse 函数将 url 解析为六个部分(方案、netloc、路径和其他东西) 现在我发现解析“example.com/path/file.ext”不返回 netloc，而是
Python urlparse.unparse_qsl？
在Python的urlparse中，可以使用urlparse解析URL，然后parse_qsl解析query。我想删除一个查询(名称、值)对，然后重建 URL。有urlunparse方法，没有un
Python urlparse——提取没有子域的域名
需要一种使用 Python urlparse 从 url 中提取不带子域的域名的方法。例如，我想从 "http://www.google.com" 之类的完整网址中提取 "google.com"。
基于python(urlparse)模板的使用方法总结
1、简介 urlparse模块用户将url解析为6个组件，并以元组形式返回，返回的6个部分，分别是：scheme(协议)、netloc(网络位置)、path(路径)、params(路径段参数)、
python - 如何在 urlparse 中用新值替换查询？
所以我有一个数据如下: item = '//s780.scene7.com/is/image/forever/301596014_001?hei=98&wid=98' 使用 urlparse 模块。
Python urlparse : small issue
我正在制作一个解析 html 并从中获取图像的应用程序。使用 Beautiful Soup 可以轻松解析并下载 html，图像也可以使用 urllib2。我确实在 urlparse 中遇到问题，无法
python - urlparse.urljoin() 不处理无效的父目录
在从相对目录构造绝对 URL 时，是否有一种方法可以解决“无效”父目录的问题，还是我应该只使用 .replace()？ >>> from urlparse import urljoin >>> url
python - urlparse 失败并显示简单的 url
这段简单的代码使 urlparse 变得疯狂，它没有正确获取主机名，而是将其设置为 None: from urllib.parse import urlparse parsed = urlparse(
python - 我应该使用哪个 : urlparse or urlsplit?
其中URL parsing function pair我应该使用，为什么？ urlparse和 urlunparse , 或 urlsplit和 urlunsplit ? 最佳答案直接来自 the
python - 为什么 urlparse.urlenparse 工作不一致？
当netloc为空时urlparse.urlunparse不一致: >>> urlparse.urlunparse(('http','','test_path', None, None, None))
python - urlparse 不会为无效的 url 引发异常
我有以下代码，用于检查输入的 url 是否有效: #!/usr/bin/env python3 import sys import urllib.parse # ... def checkValidU
Python urlparse.parse_qs unicode url
urlparse.parse_qs 对于解析 url 参数很有用，它可以很好地处理简单的 ASCII url，用 str 表示。所以我可以解析一个查询，然后使用 urllib.urlencode 从解

首页

博学

6Ren·AI

商城

python - 为什么 urlparse.urlenparse 工作不一致？