python - PRAW/Tweepy 过滤关键字-6ren

python - PRAW/Tweepy 过滤关键字

转载作者：太空宇宙更新时间：2023-11-03 17:03:36

24

4

所以我在过滤我的虾的结果时遇到了一些问题。我想在结果中排除诸如([request]、[off topic] 或 [nsfw])之类的关键字。我不想在 tweepy 上发布类似 praw 结果中的帖子。我正在寻找文档，但在 PRAW 网站上找不到任何内容。

这是我的代码:

def poster():
conn = sqlite3.connect('jb_id.db')
c = conn.cursor()
toTweet = []
for submission in reddit.subreddit(SUB).hot(limit=POST_LIMIT):
    if not submission.stickied and len(submission.title) < 255:    
        url = submission.shortlink
        title = submission.title
        udate = time.strftime("%Y-%m-%d %X",time.gmtime(submission.created_utc))

        try:
            # This keeps a record of the posts in a the database
            c.execute("INSERT INTO posts (id, title, udate) VALUES (?, ?, ?)",
            (url, title, udate))
            conn.commit()


            message = title + " " + url
            print(message)
            toTweet.append(message)

        except sqlite3.IntegrityError:
            # This means the post was already tweeted and is ignored
            print("Duplicate", url)

c.close()
conn.close()
tweeter(toTweet)

如您所见，我排除了超过 255 个字符的标签和标题。我想知道是否有一种方法可以用我上面提到的关于 praw 的结果的关键字来过滤 reddit 上的帖子。谢谢!

最佳答案

列出不应出现在提交标题中的关键字

bad_keywords = "[request]", "[off topic]", "[nsfw]"

如果提交标题包含列表中的项目，则跳过循环

title_lowercase = submission.title.lower()
if any(x in title_lowercase for x in bad_keywords):
    continue

我会将其与您的其他排除项结合使用以减少缩进并使其更具可读性

bad_title = any(x in title_lowercase for x in bad_keywords)
skip_submission = submission.stickied and len(submission.title) > 255 and bad_title
if skip_submission:
    continue

完整的解决方案

def poster():
conn = sqlite3.connect('jb_id.db')
c = conn.cursor()
toTweet = []

bad_keywords = "[request]", "[off topic]", "[nsfw]"

for submission in reddit.subreddit(SUB).hot(limit=POST_LIMIT):
    title = submission.title
    title_lowercase = title.lower()

    bad_title = any(x in title_lowercase for x in bad_keywords)
    skip_submission = submission.stickied and len(submission.title) > 255 and bad_title

    if skip_submission:
        continue

    url = submission.shortlink
    udate = time.strftime("%Y-%m-%d %X",time.gmtime(submission.created_utc))

    try:
        # This keeps a record of the posts in a the database
        c.execute("INSERT INTO posts (id, title, udate) VALUES (?, ?, ?)",
        (url, title, udate))
        conn.commit()


        message = title + " " + url
        print(message)
        toTweet.append(message)

    except sqlite3.IntegrityError:
        # This means the post was already tweeted and is ignored
        print("Duplicate", url)

c.close()
conn.close()
tweeter(toTweet)

关于python - PRAW/Tweepy 过滤关键字，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/58799355/

24

4

0

文章推荐： c# - 如何让我的 C# PictureBox 传播鼠标事件？ (提供截图)

文章推荐： python - 在 numpy 中按间隔分割数组的简单方法

文章推荐： Ruby 捕获两个冒号之间的单词

C++ 对象创建时没有使用 new 关键字，但在构造函数中使用了 new 关键字
如果我创建一个对象时没有使用 new 关键字，例如“Object s(someval)”，但该对象的构造函数使用了 new，当该对象超出范围时，是否会调用析构函数为其分配新的空间？我感觉好像是，但我不
Sql ONLY 关键字
在 SQL 语法中，我发现奇怪的规则表明 select * from ONLY (t1)是有效的 SQL。我的问题是:什么是 ONLY在这种情况下是什么意思？它在规范的“7.6 table ref
jQuery $(this) 关键字
为什么使用 $(this) 而不是重新选择类很重要？我在代码中使用了大量的动画和 CSS 编辑，并且我知道可以使用 $(this) 来简化它。最佳答案当您通过 jQuery 执行 DOM 查询(
Mysql IN 关键字
我正在尝试使用 IN 关键字编写查询。表A 属性标识、属性名称表B key 、属性标识、属性值根据提供的 key ，我想返回所有 attrName、attrVal 组合。结果将包含两个表中的列。
MySQL AS 关键字
这个问题在这里已经有了答案: Why would you use "AS" when aliasing a SQL table? (8 个答案) 关闭 9 年前。我不擅长写查询，但是从我开始使用
java this 关键字
我读过，在 Java 中，您不必将 this 关键字显式绑定(bind)到对象，它由解释器完成。它与 Javascript 相反，在 Javascript 中你总是必须知道 this 的值。但是 Ja
Swift "with"关键字
Swift 中“with”关键字的用途是什么？到目前为止，我发现如果您需要覆盖现有的全局函数，例如 toDebugString，可以使用该关键字。 // without "with" you
C# where 关键字
这个问题在这里已经有了答案: What does the keyword "where" in a class declaration do? (7 个答案) 关闭 9 年前。在下面的一段代码中(
Swift "where"关键字
免责声明:swift 菜鸟您好，我刚刚开始学习 Swift，正在学习 Swift 编程语言(Apple 在 WWDC 期间发布的书籍)，并且想知道“where”关键字是什么。它用于 let vege
去 "this"-关键字
深入研究文档后，我找不到以下问题的答案: 是否有任何理由反对使用 this 来引用当前对象，如下例所示？ type MyStruct struct { someField string } fun
PHP面向对象学习之parent::关键字
前言最近在做THINKPHP开发项目中，用到了 parent:: 关键字，实际上 parent::关键字是PHP中常要用到的一个功能，这不仅仅是在 THINKPHP 项目开发中，即使是一个小型
详谈signed 关键字
我们都知道且经常用到 unsigned 关键字，但有没有想过，与此对应的 signed 关键字有啥用？复制代码代码如下: int i = 0; signed
彻底理解Java中this 关键字
this关键字再java里面是一个我认为非常不好理解的概念，：）也许是太笨的原因 this 关键字的含义：可为以调用了其方法的那个对象生成相应的句柄。怎么理解这段话呢？ thinking i
初识 synchronized 关键字
一什么是 synchronized synchronized 关键字提供了一种锁机制，能够确保共享变量互斥访问，从而防止数据不一致问题的出现。 synchronized 关键字包括 monitor
深入解析 synchronized 关键字
最近看了几篇 synchronized 关键字的相关文章，收获很大，想着总结一下该关键字的相关内容。 1、synchronized 的作用原子性：所谓原子性就是指一个操作或者多个操作，要么全部执行并
JavaScript 方法和 this 关键字
在本教程中，您将借助示例了解 JavaScript 对象方法和 this 关键字。在 JavaScript 中，对象也可以包含函数。例如， // object containing meth
PHP "with"关键字 - "with"有什么作用？
有人可以解释一下 PHP“with”的作用吗？示例开始: 假设我有一个类: \App\fa_batch 这句话有什么区别: $w = (with (new \App\fa_batch))
typescript - 显式类型注释与 "as"关键字
这个问题在这里已经有了答案: What is the difference between using the colon and as syntax for declaring type? (2
tsql - IN 关键字与 OR 关键字
如果我在 WHERE 子句中使用以下任一项，是否会有很大不同: WHERE [Process Code] = 1 AND ([Material ID] = 'PLT' OR [Material ID]
sql - 关键字 'PROCEDURE'附近的语法不正确
This question is unlikely to help any future visitors; it is only relevant to a small geographic are

首页

博学

6Ren·AI

商城

python - PRAW/Tweepy 过滤关键字