gpt4 book ai didi

python - 获取 time_zone 和 utc_offset 的空值

转载 作者:行者123 更新时间:2023-12-01 09:20:28 28 4
gpt4 key购买 nike

output file of the code. 我在输出文本文件中将 time_zone 和 UTC_offset 的值设置为 null。我需要为每条推文提供一个非 null 且不同的值,即如果印度某人发布一条推文,则该推文的 UTC_offset 和 time_zone 将是与在美国发布的推文不同,我需要那些非空值。此外,如果我使用 dataframe.to_json 转换,我会得到 tweet.created_at 的错误格式,但如果我使用 dataframe.to_csv 转换,则情况并非如此。有人可以吗请解释?P.S-Python 和 tweepy 初学者

import pandas as pd
from datetime import datetime, date, time, timedelta
import json
from dateutil.tz import tzoffset

# Variables that contains the user credentials to access Twitter API
consumer_key = 'mine'
consumer_secret = 'mine'
access_token = 'mine'
access_token_secret = 'mine`enter code here`'

auth = tweepy.OAuthHandler(consumer_key, consumer_secret)
auth.set_access_token(access_token, access_token_secret)


api = tweepy.API(auth)
# Open/Create a file to append data
#csvFile = open('tweet.txt', 'a')
#Use csv Writer
#csvWriter = csv.writer(csvFile,delimiter=',')
results=[]

for tweet in tweepy.Cursor(api.search, q="Depression" or "Anxiety",lang="en").items(30):
if (not tweet.retweeted) and ('RT @' not in tweet.text):
results.append(tweet)


def tweets_df(results):
id_list = [tweet.id for tweet in results]
data_set = pd.DataFrame(id_list, columns=["id"])
data_set["text"] = [tweet.text for tweet in results]
data_set["source"] = [tweet.source for tweet in results]
data_set["screen_name"] = [tweet.user.screen_name for tweet in results]

#localtime_tz = tzoffset(user.time_zone, user.utc_offset)
data_set["created_at"] = [tweet.user.created_at for tweet in results]
# data_set["place"]=[tweet.place for tweet in results]
data_set["location"] = [tweet.user.location for tweet in results]
data_set["UTC_Offset"] = [tweet.user.utc_offset for tweet in results]
data_set["timezone"] = [tweet.user.time_zone for tweet in results]
# data_set["year"] = [tweet.created_at.year for tweet in results]
# data_set["month"] = [tweet.created_at.month for tweet in results]
#data_set["day"] = [tweet.created_at.day for tweet in results]
#data_set["hour"] = [tweet.created_at.hour for tweet in results]
return data_set


data_set = tweets_df(results)
#data_set.to_csv("/home/rajneeshkaushal/Documents/Pycharm/hived/tweet_data.txt",header=None)
out = data_set.to_json(orient='records')[1:-1].replace('},{', '} {')
with open('test.txt', 'w') as f:
f.write(out)

最佳答案

我想这个链接部分回答了您的问题:https://twittercommunity.com/t/upcoming-changes-to-the-developer-platform/104603

根据新的欧盟隐私法,Twitter 用户对象中的时区值将在 5 月 23 日之后成为私有(private)字段。

关于python - 获取 time_zone 和 utc_offset 的空值,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/50830126/

28 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com