gpt4 book ai didi

python - 如何获取具有给定主题标签位置的推文

转载 作者:太空宇宙 更新时间:2023-11-04 04:07:43 25 4
gpt4 key购买 nike

我尝试只获取带有#not 标签的推文,但前提是标签位于推文末尾且不在文本中。我正在使用 tweepy.Cursor

此代码已经有效。它为我提供带有#not 的推文,但不关心#not 的位置。

import tweepy
consumer_key = 'consumer key'
consumer_secret = 'consumer secret'
access_token = 'access token'
access_token_secret = 'access token secret'

auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_token, access_token_secret)
api = tweepy.API(auth,wait_on_rate_limit=True)

for tweet in tweepy.Cursor(api.search,q="#not",count=5,
lang="en",
since="2017-04-03").items():
print (tweet.created_at, tweet.text)

最佳答案

编辑:您可以使用正则表达式来检查您的主题标签是否位于一组尾随的主题标签中:

import tweepy
import re

consumer_key = 'consumer key'
consumer_secret = 'consumer secret'
access_token = 'access token'
access_token_secret = 'access token secret'

auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_token, access_token_secret)
api = tweepy.API(auth,wait_on_rate_limit=True)

# Regular expression to check if tweet ends with our hashtag and maybe more hashtags
rgx = re.compile(r"#not(\s+#\w+)*$", re.IGNORECASE)
for tweet in tweepy.Cursor(api.search,q="#not",count=5,
lang="en",
since="2017-04-03").items():
# Keep only tweets with the hashtag at the end
if rgx.search(tweet.text):
print (tweet.created_at, tweet.text)

您可以过滤推文以仅保留符合您要求的推文:

import tweepy
consumer_key = 'consumer key'
consumer_secret = 'consumer secret'
access_token = 'access token'
access_token_secret = 'access token secret'

auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_token, access_token_secret)
api = tweepy.API(auth,wait_on_rate_limit=True)

for tweet in tweepy.Cursor(api.search,q="#not",count=5,
lang="en",
since="2017-04-03").items():
# Keep only tweets with the hashtag at the end
if tweet.text.lower().endswith('#not'):
print (tweet.created_at, tweet.text)

关于python - 如何获取具有给定主题标签位置的推文,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/56935148/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com