Python 类型错误 : 'NoneType' object is not iterable-6ren

Python 类型错误 : 'NoneType' object is not iterable

转载作者：行者123 更新时间：2023-11-28 21:53:16

我已经看了其他帖子，好像和我有同样的问题，但我的问题还没有解决......

我正在尝试提取域名列表的谷歌网页排名，在本例中为“domain_list”。下面是我正在使用的代码。我一直收到此错误，但无法真正弄清其根本原因。

import struct
import sys
import urllib
import urllib2
import httplib
import re
import xml.etree.ElementTree


domain_list = open('/data/personal/samaneh/test.txt','r')

class RankProvider(object):
    """Abstract class for obtaining the page rank (popularity)
        from a provider such as Google or Alexa.
        """
    def __init__(self, host, proxy=None, timeout=30):
        """Keyword arguments:
            host -- toolbar host address
            proxy -- address of proxy server. Default: None
            timeout -- how long to wait for a response from the server.
            Default: 30 (seconds)
            """
        self._opener = urllib2.build_opener()
        if proxy:
            self._opener.add_handler(urllib2.ProxyHandler({"http": proxy}))

        self._host = host
        self._timeout = timeout

    def get_rank(self, url):
        """Get the page rank for the specified URL
            Keyword arguments:
            url -- get page rank for url
            """
        raise NotImplementedError("You must override get_rank()")


class AlexaTrafficRank(RankProvider):
    """ Get the Alexa Traffic Rank for a URL
        """
    def __init__(self, host="xml.alexa.com", proxy=None, timeout=30):
        """Keyword arguments:
            host -- toolbar host address: Default: joolbarqueries.google.com
            proxy -- address of proxy server (if required). Default: None
            timeout -- how long to wait for a response from the server.
            Default: 30 (seconds)
            """
        super(AlexaTrafficRank, self).__init__(host, proxy, timeout)

    def get_rank(self, url):
        """Get the page rank for the specified URL
            Keyword arguments:
            url -- get page rank for url
            """
        query = "http://%s/data?%s" % (self._host, urllib.urlencode((
                                                                     ("cli", 10),
                                                                     ("dat", "nsa"),
                                                                     ("ver", "quirk-searchstatus"),
                                                                     ("uid", "20120730094100"),
                                                                     ("userip", "192.168.0.1"),
                                                                     ("url", url))))

        response = self._opener.open(query, timeout=self._timeout)
        if response.getcode() == httplib.OK:
            data = response.read()

            element = xml.etree.ElementTree.fromstring(data)
            for e in element.find("SD"):
                popularity = e.find("POPULARITY")
                if popularity is not None:
                    return int(popularity.get("TEXT"))


class GooglePageRank(RankProvider):
    """ Get the google page rank figure using the toolbar API.
        Credits to the author of the WWW::Google::PageRank CPAN package
        as I ported that code to Python.
        """
    def __init__(self, host="toolbarqueries.google.com", proxy=None, timeout=30):
        """Keyword arguments:
            host -- toolbar host address: Default: toolbarqueries.google.com
            proxy -- address of proxy server (if required). Default: None
            timeout -- how long to wait for a response from the server.
            Default: 30 (seconds)
            """
        super(GooglePageRank, self).__init__(host, proxy, timeout)
        self._opener.addheaders = [("User-agent", "Mozilla/4.0 (compatible; \
                                    GoogleToolbar 2.0.111-big; Windows XP 5.1)")]

    def get_rank(self, url):
        # calculate the hash which is required as part of the get
        # request sent to the toolbarqueries url.
        ch = '6' + str(self._compute_ch_new("info:%s" % (url)))

        query = "http://%s/tbr?%s" % (self._host, urllib.urlencode((
                                                                    ("client", "navclient-auto"),
                                                                    ("ch", ch),
                                                                    ("ie", "UTF-8"),
                                                                    ("oe", "UTF-8"),
                                                                    ("features", "Rank"),
                                                                    ("q", "info:%s" % (url)))))

        response = self._opener.open(query, timeout=self._timeout)
        if response.getcode() == httplib.OK:
            data = response.read()
            match = re.match("Rank_\d+:\d+:(\d+)", data)
            if match:
                rank = match.group(1)
                return int(rank)

    @classmethod
    def _compute_ch_new(cls, url):
        ch = cls._compute_ch(url)
        ch = ((ch % 0x0d) & 7) | ((ch / 7) << 2);

        return cls._compute_ch(struct.pack("<20L", *(cls._wsub(ch, i * 9) for i in range(20))))

    @classmethod
    def _compute_ch(cls, url):
        url = struct.unpack("%dB" % (len(url)), url)
        a = 0x9e3779b9
        b = 0x9e3779b9
        c = 0xe6359a60
        k = 0

        length = len(url)

        while length >= 12:
            a = cls._wadd(a, url[k+0] | (url[k+1] << 8) | (url[k+2] << 16) | (url[k+3] << 24));
            b = cls._wadd(b, url[k+4] | (url[k+5] << 8) | (url[k+6] << 16) | (url[k+7] << 24));
            c = cls._wadd(c, url[k+8] | (url[k+9] << 8) | (url[k+10] << 16) | (url[k+11] << 24));

            a, b, c = cls._mix(a, b, c)

            k += 12
            length -= 12

        c = cls._wadd(c, len(url));

        if length > 10: c = cls._wadd(c, url[k+10] << 24)
        if length > 9: c = cls._wadd(c, url[k+9] << 16)
        if length > 8: c = cls._wadd(c, url[k+8] << 8)
        if length > 7: b = cls._wadd(b, url[k+7] << 24)
        if length > 6: b = cls._wadd(b, url[k+6] << 16)
        if length > 5: b = cls._wadd(b, url[k+5] << 8)
        if length > 4: b = cls._wadd(b, url[k+4])
        if length > 3: a = cls._wadd(a, url[k+3] << 24)
        if length > 2: a = cls._wadd(a, url[k+2] << 16)
        if length > 1: a = cls._wadd(a, url[k+1] << 8)
        if length > 0: a = cls._wadd(a, url[k])

        a, b, c = cls._mix(a, b, c);

        # integer is always positive
        return c

    @classmethod
    def _mix(cls, a, b, c):
        a = cls._wsub(a, b); a = cls._wsub(a, c); a ^= c >> 13;
        b = cls._wsub(b, c); b = cls._wsub(b, a); b ^= (a << 8) % 4294967296;
        c = cls._wsub(c, a); c = cls._wsub(c, b); c ^= b >>13;
        a = cls._wsub(a, b); a = cls._wsub(a, c); a ^= c >> 12;
        b = cls._wsub(b, c); b = cls._wsub(b, a); b ^= (a << 16) % 4294967296;
        c = cls._wsub(c, a); c = cls._wsub(c, b); c ^= b >> 5;
        a = cls._wsub(a, b); a = cls._wsub(a, c); a ^= c >> 3;
        b = cls._wsub(b, c); b = cls._wsub(b, a); b ^= (a << 10) % 4294967296;
        c = cls._wsub(c, a); c = cls._wsub(c, b); c ^= b >> 15;

        return a, b, c

    @staticmethod
    def _wadd(a, b):
        return (a + b) % 4294967296

    @staticmethod
    def _wsub(a, b):
        return (a - b) % 4294967296


if __name__ == "__main__":
    for url in domain_list:
#    url = "http://www.archlinux.org"
        providers = (AlexaTrafficRank(), GooglePageRank(),)
        print("Traffic stats for: %s" % (url))
        for p in providers:
            print("%s:%s" % (p.__class__.__name__, p.get_rank(url)))

这是我得到的完整错误:

Traceback (most recent call last):
  File "test-alexa-ranking.py", line 187, in <module>
    print("%s:%s" % (p.__class__.__name__, p.get_rank(url)))
  File "test-alexa-ranking.py", line 69, in get_rank
    for e in element.find("SD"):
TypeError: 'NoneType' object is not iterable

有人可以帮我吗？

最佳答案

element.find() 仅返回 one 匹配项(如果存在)，否则返回 None。

您要么想使用 element.find_all()，它将始终返回一个匹配项列表(可以为空)，要么首先测试是否存在匹配项:

match = element.find('SD')
if not match:
    return None

鉴于您确实在寻找包含的元素，您可以使用 XPath expression这里:

# find the first POPULARITY tag directly under an SD tag with a TEXT attribute
pop = element.find('.//SD/POPULARITY[@TEXT]')
if pop is not None:
    return int(pop.attrib['TEXT'])

这看起来确实适用于 test XML document。 .

关于Python 类型错误 : 'NoneType' object is not iterable，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/26935783/

文章推荐： python - Python 中的基本转换器

文章推荐： ios - 从类方法中的 block 返回值

文章推荐： Python hangman，替换字母？

python - * : 'NoneType' and 'NoneType' (Python) 不支持的操作数类型
我目前正在尝试定义函数，但遇到了这个错误。我只是想做一个简单的函数，用户输入 2 个数字，然后将它们相乘。也请尽可能简单地解释我做错了什么。 (我是菜鸟) def userinput(): w
Python-类型错误 : unorderable types: NoneType() < NoneType()
使用IPtools python 包我试图查看 IP 地址是否在特定范围内。这是我的代码: for line in g: org= line.split("|")[0] ranges
Python:无法摆脱 TypeError:+ 不支持的操作数类型: 'NoneType' 和 'NoneType'
输入 [['1','2','3'],['a','b','c'],['6','7','8'],['e','f','g']] 输出应该是: 1, 2, 3a, b, c6, 7, 8e, f, g Cod
Python:使用 lambda 时，& 不支持的操作数类型: 'NoneType' 和 'NoneType'
我目前正在使用 lambda 使 tkinter 按钮依次执行两件事: def classManip(): cManip = tk.Toplevel() cManip.title
python - 我不断收到 "TypeError: unsupported operand type(s) for -: ' NoneType' 和 'NoneType' ”
我正在学习Python，作为练习，我编写了一些代码来查找用户定义函数的导数。代码如下。 def fx(value, function): x = value return eval(f
python - 类型错误 : 'NoneType' object is not subscriptable followed by AttributeError: 'NoneType' object has no attribute 'split'
使用 Django。我有以下模型: class Postagem(models.Model): id = models.AutoField(primary_key=True, editable=Fal
python-2.7 - feature_importances_ 在 ExtraTreesClassifier :TypeError: 'NoneType' object is not iterable 中显示为 NoneType
我正在尝试为给定的数据集选择重要的特征(或者至少了解哪些特征解释更多的变异性)。为此，我使用 ExtraTreesClassifier 和 GradientBoostingRegressor - 然后
Diffusers SDXL "TypeError: argument of type 'NoneType' is not iterable"(Differs SDXL“TypeError：‘NoneType’类型的参数不可迭代”)
刚刚获得了SDXL模型的访问权限，希望为即将发布的版本进行测试...不幸的是，我们当前用于我们服务的代码似乎不能与稳定ai/稳定-扩散-xl-base-0.9一起工作，我不完全确定SDXL有什么不同，
What causes `None` results from BeautifulSoup functions? How can I avoid "AttributeError: 'NoneType' object has no attribute..." with BeautifulSoup?(是什么原因导致了BeautifulSoup函数的`None‘结果？如何避免“AttributeError：‘NoneType’对象没有属性...”配上美人汤？)
通常，当我尝试使用BeautifulSoup解析网页时，BeautifulSoup函数会得到NONE结果，否则就会引发AttributeError。。以下是一些独立的(即，由于数据是硬编码的，不需要访
What causes `None` results from BeautifulSoup functions? How can I avoid "AttributeError: 'NoneType' object has no attribute..." with BeautifulSoup?(是什么原因导致了BeautifulSoup函数的`None‘结果？如何避免“AttributeError：‘NoneType’对象没有属性...”配上美人汤？)
通常，当我尝试使用BeautifulSoup解析网页时，BeautifulSoup函数会得到NONE结果，否则就会引发AttributeError。。以下是一些独立的(即，由于数据是硬编码的，不需要访
python - 检查可迭代列表中的 NoneTypes
我想遍历可迭代列表，但要求某些元素的类型可以是 None。这看起来像这样: none_list = [None, [0, 1]] for x, y in none_list: print("
Python: 'NoneType' 对象不可在非空查询结果上订阅
我得到object is not subscriptable在非空查询结果上。当我打印时 c.fetchone()它打印了正确的结果，但是当我检查类型时它显示 import sqlite3 conn
python - NoneType 即使定义了属性也会出错？
我在第 15 行收到此错误，但我不明白为什么。有任何想法吗？看来属性已经明确定义了，所以我完全不知所措。任何帮助将非常感激。AttributeError:“NoneType”对象没有属性“Sheets
python - 子类化对象导致 NoneType
我尝试对 Chrome WebDriver 进行子类化以包含一些初始化和清理代码，但随后 Python 提示创建的对象设置为 None: import glob import selenium imp
Python: 'NoneType' 对象不可下标'错误
这个问题已经有答案了: Why do I get AttributeError: 'NoneType' object has no attribute 'something'? (10 个回答) 已关
python - 为什么返回 Nonetype？
这个问题已经有答案了: Why does the print function return None? (1 个回答) 已关闭 6 年前。我对 Python 还很陌生。我正在制作一个生成器，可以为
Python 在解析表时处理 NoneType
我正在尝试比较两个表( table_a 和 table_b )并减去 table_a 的最后一列从table_b的最后一列开始。但是，table_a 包含一个额外的行，导致我得到 NoneType错误
python NoneType 对象不可迭代
当“文件名”是一个存在的文件时，这段代码运行良好……但是当它不存在时……我不断收到同样的错误:TypeError: 'NoneType' 对象不可迭代 (Errno 2) 尽管我从不迭代任何东西，除非
python - NoneType 对象不可迭代
我在下面的代码中收到“NoneType”对象不可迭代的 TypeError。下面的代码用于使用 pyautogui 滚动 digits 文件夹中的 10 张图像(命名为 0 到 9，以图像中的 # 命
Python 将模块名称视为 'NoneType'
我有一段代码表现得很奇怪。一开始，我导入了一个模块，它是 C 库的 python 绑定(bind)。 try: import pyccn except: print "ERROR:

行者123

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Python 类型错误 : 'NoneType' object is not iterable