python - 使用正则表达式过滤列表时为 "TypeError: expected string or buffer"-6ren

python - 使用正则表达式过滤列表时为 "TypeError: expected string or buffer"

转载作者：太空宇宙更新时间：2023-11-03 17:06:01

24

4

我目前正在尝试应用正则表达式，以便从包含链接的列表中过滤掉某些链接。

我现在尝试了多种方法，但总是收到此错误:

Traceback (most recent call last):
  File "/Users/User/Documents/pyp/pushbullet_updater/DoDa/test.py", line 20, in <module>
    print(get_chapter_links(links))
  File "/Users/User/Documents/pyp/pushbullet_updater/DoDa/test.py", line 15, in get_chapter_links
    match = re.findall(r"https://bluesilvertranslations\.wordpress\.com/\d{4}/\d{2}/\d{2}/douluo-dalu-\d{1,3}-\s*/", link)
  File "/Library/Frameworks/Python.framework/Versions/3.4/lib/python3.4/re.py", line 210, in findall
    return _compile(pattern, flags).findall(string)
TypeError: expected string or buffer

我做错了什么？

代码如下:

import requests
from bs4 import BeautifulSoup
import re

#Gets chapter links
def get_chapter_links(index_url):
    r = requests.get(index_url)
    soup = BeautifulSoup(r.content, 'lxml')
    links = soup.find_all('a')
    url_list = []
    for url in links:
        url_list.append((url.get('href')))

    for link in url_list: # Iterates through every line and looks for a match:
        match = re.findall(r"https://bluesilvertranslations\.wordpress\.com/\d{4}/\d{2}/\d{2}/douluo-dalu-\d{1,3}-\s*/", link)
    return match

links = 'https://bluesilvertranslations.wordpress.com/chapter-list/'

print(get_chapter_links(links))

最佳答案

来自re文档

re.findall(pattern, string, flags=0)
Return all non-overlapping matches of pattern in string, as a list of strings. The string is scanned left-to-right, and matches are returned in the order found. If one or more groups are present in the pattern, return a list of groups; this will be a list of tuples if the pattern has more than one group. Empty matches are included in the result unless they touch the beginning of another match.

New in version 1.5.2.

Changed in version 2.4: Added the optional flags argument.

注意:

第一个参数应该是模式，第二个参数应该是字符串

修改后的代码:

import requests
from bs4 import BeautifulSoup
import re

#Gets chapter links
def get_chapter_links(index_url):
    r = requests.get(index_url)
    soup = BeautifulSoup(r.content, 'lxml')
    links = soup.find_all('a')
    url_list = []
    for url in links:
        url_list.append((url.get('href')))
    match = [] # Create a list and append to it the matched links 
    for link in url_list: # Iterates through every line and looks for a match:
        if link: # I have added this to see in there are value in link.
            match += re.findall(r"https://bluesilvertranslations\.wordpress\.com/\d{4}/\d{2}/\d{2}/douluo-dalu-\d{1,3}-.*/", link) # I have changed the regex a bit since your did not match
    return match

links = 'https://bluesilvertranslations.wordpress.com/chapter-list/'

print(get_chapter_links(links))

关于python - 使用正则表达式过滤列表时为 "TypeError: expected string or buffer"，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/34585286/

24

4

0

文章推荐： c# - 被动 View 方法中的数据绑定(bind)

文章推荐： ruby 脚本/服务器不读取 RAILS_ENV 选项

文章推荐： c# - 使用 AutoInc 的 FluentMapping ComposedId

c# - String.Concat(String, String, String, String) 有什么意义
如果您想使用 String.Concat() 连接 5 个或更多字符串，则它会使用 Concat(String[])。为什么不一直使用 Concat(String[]) 而不再需要 Concat(S
java - String + String 与从方法返回的 String + String
今天在使用 String 时，我遇到了一种我以前不知道的行为。我无法理解内部发生的事情。 public String returnVal(){ return "5";
string - 使用Hibernate映射Map
似乎在我所看到的任何地方，都有一些过时的版本，这些版本不再起作用。我的问题似乎很简单。我有一个Java类，它映射到derby数据库。我正在使用注释，并且已经成功地在数据库中创建了所有其他表，但是在这
string::size_type、string::npos、 string::substr、string::find_first_of、string::replace、string::assign
一、string::size_type() 在C++标准库类型 string ，在调用size函数求解string 对象时，返回值为size_type类型，一种类似于unsigned类型的int 数据
swift - 无法将 [String : String? ] 类型的值转换为预期的参数类型 [String : String]
我正在尝试将数据保存到我的 plist 文件中，其中包含字符串数组的定义。我的plist - enter image description here 我将数据写入 plist 的代码是 -- let
javascript 将转换为
我有一个带有键/值对的 JavaScript 对象，其中值是字符串数组: var errors = { "Message": ["Error #1", "Error #2"], "Em
java - 如何使用相同的递归函数迭代 Map 和 Map> ？
例如，为了使用相同的函数迭代 List 和 List> ，我可以编写如下内容: import java.util.*; public class Test{ public static voi
C#:Dictionary 到 Dictionary> 的转换
第一个Dictionary就像 Dictionary ParentDict = new Dictionary(); ParentDict.Add("A_1", "1")
java - Functions 类型中的方法 replace(String, String, String) 不适用于参数 (StringBuffer, String, String)
这是我的 jsp 文件: 我遇到了错误 The method replace(String, String, String) in the type Functions is not appl
c# - string.Join(string, string[]) 返回 "System.String[]"
我需要一些帮助。我有一个方法应该输出一个包含列表内容的 txt 文件(每行中的每个项目)。列表项是字符串数组。问题是，当我调用 string.Join 时，它返回文字字符串 "System.Strin
c# - 使用 String+string+string 与使用 string.replace
一位同事告诉我，使用以下方法: string url = "SomeURL"; string ext = "SomeExt"; string sub = "SomeSub"; string s
Java 将 {String,String}[] 转换为 Map
给定类: public class CategoryValuePair { String category; String value; } 还有一个方法: public
java - 如何将 Stream>> 合并为一个 Map>？
我正在尝试合并 Stream>>对象与所有 Streams 中的键一起映射到单个映射中. 例如， final Map someObject; final List>> list = someObjec
c# - IDictionary 与 Dictionary
在这里使用 IDictionary 的值(value)是什么？最佳答案使用接口(interface)的值(value)始终相同:切换到另一个后端实现时，您不必更改客户端代码。请考虑稍后分析您的代
ios - [Dictionary()] 和 [String : String]() 之间有什么区别
我可以知道这两个字典声明之间的区别吗？ var places = [String: String]() var places = [Dictionary()] 为什么当我尝试以这种方式附加声明时，只有
c# - string.IsNullOrEmpty(string) 与 string.IsNullOrWhiteSpace(string)
在 .NET 4.0 及更高版本中存在 string.IsNullOrWhiteSpace(string) 时，在检查字符串时使用 string.IsNullOrEmpty(string) 是否被视为
string - 为什么 "Here Strings"被称为 "Here Strings"？
这个名字背后的原因是什么？ SS64在 PowerShell 中解释此处的字符串如下: A here string is a single-quoted or double-quoted string
string - From<&String> 特性没有为 String 类型实现
我打算离开 this 文章，尝试编写一个接受字符串和 &str 的函数，但我遇到了问题。我有以下功能: pub fn new(t_num: S) -> BigNum where S: Into {
ios - 获取多维数组的键( `[String: [String: String]]`)
我有一个结构为 [String: [String: String]] 的多维数组。我可以使用 for 循环到达 [String: String] 位，但我不知道如何访问主键(这个位 [String:
string - 如何使用 map[string]*string
我正在尝试使用 sarama(管理员模式)创建主题。没有 ConfigEntries 工作正常。但我需要定义一些配置。我设置了主题配置(这里发生了错误): tConfigs := map[s

首页

博学

6Ren·AI

商城

python - 使用正则表达式过滤列表时为 "TypeError: expected string or buffer"