python-3.x - requests.exceptions.MissingSchema : Invalid URL 'None' : No schema supplied while trying to find broken links through Selenium and Python-6ren

python-3.x - requests.exceptions.MissingSchema : Invalid URL 'None' : No schema supplied while trying to find broken links through Selenium and Python

转载作者：行者123 更新时间：2023-12-04 13:02:04

26

4

我想使用 Selenium + Python 在我的网页上找到损坏的链接。我尝试了上面的代码，但它显示了以下错误:

requests.exceptions.MissingSchema: Invalid URL 'None': No schema supplied. Perhaps you meant http://None?

代码试验:

for link in links:

    r = requests.head(link.get_attribute('href'))
    print(link.get_attribute('href'), r.status_code)

完整代码:

def test_lsearch(self):
    driver=self.driver
    driver.get("http://www.google.com")
    driver.set_page_load_timeout(10)
    driver.find_element_by_name("q").send_keys("selenium")

    driver.set_page_load_timeout(10)
    el=driver.find_element_by_name("btnK")
    el.click()
    time.sleep(5)

    links=driver.find_elements_by_css_selector("a")
    for link in links:
        r=requests.head(link.get_attribute('href'))
        print(link.get_attribute('href'),r.status_code)

最佳答案

这个错误信息...

    raise MissingSchema(error)
requests.exceptions.MissingSchema: Invalid URL 'None': No schema supplied. Perhaps you meant http://None?

...暗示对 unicode 域名和路径的支持在收集的 中失败href 属性。

此错误在 models.py 中定义如下:

    # Support for unicode domain names and paths.
    scheme, auth, host, port, path, query, fragment = parse_url(url)
    if not scheme:
        raise MissingSchema("Invalid URL {0!r}: No schema supplied. "
                            "Perhaps you meant http://{0}?".format(url))

解决方案

一旦关键字 的搜索结果可用，您可能正在尝试查找损坏的链接。 Selenium 在 Google Home Page Search Box .为此，您可以使用以下解决方案:

代码块:

import requests
from selenium import webdriver
from selenium.webdriver.common.by import By
from selenium.webdriver.support.ui import WebDriverWait
from selenium.webdriver.support import expected_conditions as EC
from selenium.webdriver.common.keys import Keys 

options = webdriver.ChromeOptions() 
options.add_argument("start-maximized")
options.add_argument('disable-infobars')
driver=webdriver.Chrome(chrome_options=options, executable_path=r'C:\Utility\BrowserDrivers\chromedriver.exe')
driver.get('https://google.co.in/')
search = driver.find_element_by_name('q')
search.send_keys("selenium")
search.send_keys(Keys.RETURN)
links = WebDriverWait(driver, 10).until(EC.visibility_of_any_elements_located((By.XPATH, "//div[@class='rc']//h3//ancestor::a[1]")))
print("Number of links : %s" %len(links))
for link in links:
    r = requests.head(link.get_attribute('href'))
    print(link.get_attribute('href'), r.status_code)

控制台输出:

Number of links : 9
https://www.seleniumhq.org/ 200
https://www.seleniumhq.org/download/ 200
https://www.seleniumhq.org/docs/01_introducing_selenium.jsp 200
https://www.guru99.com/selenium-tutorial.html 200
https://en.wikipedia.org/wiki/Selenium_(software) 200
https://github.com/SeleniumHQ 200
https://www.edureka.co/blog/what-is-selenium/ 200
https://seleniumhq.github.io/selenium/docs/api/py/ 200
https://seleniumhq.github.io/docs/ 200

更新

根据您的反问，从 Selenium 的角度，规范地回答为什么 xpath 有效而不是 tagName 会有点困难。也许您可能想更深入地研究这些讨论:

Bug 1323614 - Cannot authenticate: requests.exceptions.MissingSchema: Invalid URL 'stage/auth/token/obtain/': No schema supplied.

Invalid URL 'None': No schema supplied. Perhaps you meant http://None?

关于python-3.x - requests.exceptions.MissingSchema : Invalid URL 'None' : No schema supplied while trying to find broken links through Selenium and Python，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/54325867/

26

4

0

文章推荐： javascript - 在 Javascript 中解析 Django DateTimeField

文章推荐： amazon-web-services - AWS 允许访问不同账户的 kinesis 流

文章推荐： ansible - 使用 celery 以编程方式运行 ansible playbooks

python - 在Python中， 'except Exception as e'和 'except Exception, e'有什么区别
这个问题在这里已经有了答案: Python try...except comma vs 'as' in except (5 个回答) 关闭7年前。在python中，有两种方法可以捕获异常 excep
java - Runtime Exception extends Exception 和 Custom Exception extends from Exception 为什么后一个是编译时异常而另一个不是？
在 Java 中，我有一个从 Exception 扩展的异常类，但是每当我抛出它时，编译器都会说它需要被捕获/必须声明方法 throws异常。当我使用从 Exception 扩展的 RuntimeE
exception - haskell "exceptions"
我有一组用户、组以及用户和组之间的映射。我有各种操作这些集合的函数，但是不能为不存在的用户添加用户组映射，也不能删除仍然有用户作为成员的组等。所以基本上我希望这些函数抛出必须由调用者明确处理的“异常
exception - 最大请求长度超出异常(exception)
我正在尝试使用上载控件上载20兆的文件，并且在Visual Studio的内置Web服务器上可以正常工作，但是一旦将其发布到生产服务器（我无权访问），我总是收到以下错误消息： Server Error
java - 当抛出 'Exception B'时如何断言 "Exception A: Exception B"？
我想断言运行某些代码时会引发特定异常(SSLHandshakeException)。 assertThatThrownBy(() -> { // some code }).is
c++ - 编译错误 - 没有匹配函数调用 'Exception::Exception(Exception)'
这个问题我暂时解决不了。我很乐意提供一些建议。当我尝试抛出异常时(我自己创建了一个 Java 风格的异常) throw Exception (); 编译器提出抗议: DataTypes/Date.c
python - 为什么 "except:"能够捕获此错误，但不能捕获 "except Exception, e:"？
我有以下文件: from fabric.api import env, execute, run env.hosts = ['1.2.3.4'] def taskA(): run('ls')
python - 在 Python 中使用 "except Exception"与 "except ... raise"
我正在阅读一些包含类似于以下功能的源代码: def dummy_function(): try: g = 1/0 except Exception as e:
exception - 值多态和 "generating an exception"
根据标准 ML 的定义(修订版): The idea is that dynamic evaluation of a non-expansive expression will neither gen
exception - 非详尽模式的更好异常(exception)，以防万一
当 GHCi 在运行时发现调用产生的值与函数的模式匹配不匹配时，有没有办法让 GHCi 产生更好的异常消息？它目前给出了产生非详尽模式匹配的函数的行号，虽然有时会有所帮助，但确实需要一轮调试，有时我
exception - 我有时沉没异常(exception)可以吗？
我有一个最佳实践问题。我意识到这是主观的，但想问问比我更聪明的人，这是否是一种常见的编程实践。如果您有一种不希望干扰应用程序重要功能的非关键方法，那么使用这样的错误接收器是否常见？ Try
exception - 术语-异常(exception)
在编程中，异常是否总是错误(被零除，访问冲突等等)？如果不是，您能否提供不是错误的异常示例？谢谢。最佳答案异常通常用于管理错误，它们使错误处理更加容易，但它们并不总是错误。任何需要单独代码路
exception - OCaml 内部结构 : Exceptions
我很想知道 OCaml 运行时如何处理异常以使它们如此轻量。他们是使用 setjmp/longjmp 还是在每个函数中返回一个特殊值并传播它？在我看来，longjmp会给系统带来一点压力，但只有在引
c# - 当我有 System.Exception 和 MyNamespace.Exception 时，为什么捕获 "Exception"没有歧义？
在我的 C# 代码中，我可以访问 MyNamespace.Exception 以及 System.Exception。当我想捕获其中一个异常时，理想情况下我会完全限定要捕获的异常或使用别名来明确说明。
c++ - std::exception::_Raise 和 std::exception::exception 上的 VC++ 链接器错误
我正在使用 Visual C++ 2005 Express Edition 并遇到以下链接器错误: 19>mylib1.lib(mylibsource1.obj) : error LNK2019: u
java - IntelliJ IDEA : How can I create an exception breakpoint that stops on all exceptions *except for* ClassNotFoundException?
这个问题在这里已经有了答案: Is there "Break on Exception" in IntelliJ? (6 个回答) 关闭7年前。我想在调试器中运行我的测试套件并中断任何意外异常，但是
java - LOGGER.error(exception.getMessage()) 和 LOGGER.error(exception.getMessage(), exception) 有什么区别
Like in this picture 我知道它们都可以正常工作，但我只是想知道它们之间有何不同？ PS:我是初学者。最佳答案 A LogEvent可以同时包含消息和异常。如果您使用第一种形式:
exception - 跳过异常(exception) Doctrine 迁移
我知道避免 Doctrine 上的异常似乎是一种奇怪的行为，但我需要这样做，因为我在一个旧项目中工作，过去有人执行了一些迁移，然后他决定删除它，所以现在复制起来很复杂本地生产环境没有崩溃，这就是为什么
exception - 蛋糕PHP 2 : new exceptions
我想创建一个名为 SecurityException 的新异常。我应该把代码放在哪里？ class SecurityException extends CakeException {}; 谢谢! 最
exception-handling - 有标准异常(exception)吗？
我一直在使用throw new Exception("...")在我的代码中，因为我找不到其他可以使用的东西。我正在寻找像 C++'s 这样的东西 out_of_range 和 logic_error

首页

博学

6Ren·AI

商城

python-3.x - requests.exceptions.MissingSchema : Invalid URL 'None' : No schema supplied while trying to find broken links through Selenium and Python