R情感分析； 'lexicon' 未找到； 'sentiments' 已损坏？-6ren

R情感分析； 'lexicon' 未找到； 'sentiments' 已损坏？

转载作者：行者123 更新时间：2023-12-01 16:54:23

25

4

我正在尝试关注this情感分析在线教程。代码:

new_sentiments <- sentiments %>% #From the tidytext package
  filter(lexicon != "loughran") %>% #Remove the finance lexicon
  mutate( sentiment = ifelse(lexicon == "AFINN" & score >= 0, "positive",
                         ifelse(lexicon == "AFINN" & score < 0,
                                "negative", sentiment))) %>%
  group_by(lexicon) %>%
  mutate(words_in_lexicon = n_distinct(word)) %>%
  ungroup()

产生错误:

>Error in filter_impl(.data, quo) : 
>Evaluation error: object 'lexicon' not found.

相关的，也许是在我看来，“情绪”表表现得很奇怪(已损坏？)。这是“情绪”的要点:

> head(sentiments,3)
>  element_id sentence_id word_count sentiment                                  
> chapter
> 1          1           1          7         0 The First Book of Moses:  
> Called Genesis
> 2          2           1         NA         0 The First Book of Moses:  
> Called Genesis
> 3          3           1         NA         0 The First Book of Moses:  > 
> Called Genesis
>                                  category
> 1 The First Book of Moses:  Called Genesis
> 2 The First Book of Moses:  Called Genesis
> 3 The First Book of Moses:  Called Genesis

如果我对 bing、AFINN 或 NRC 使用 Get_Sentiments，我会得到看起来合适的响应:

>  get_sentiments("bing")
> # A tibble: 6,788 x 2
>   word        sentiment
>   <chr>       <chr>    >   1 2-faced     negative 
> 2 2-faces     negative 
> 3 a+          positive 
> 4 abnormal    negative

我尝试删除(remove.packages)并重新安装 tidytext；行为没有改变。我正在运行 R 3.5

即使我完全误解了这个问题，我也会很感激任何人能给我的任何见解。

最佳答案

以下说明将修复 new_sentiments 数据集，如 Data Camp tutorial 中所示。 .

bing <- get_sentiments("bing") %>% 
     mutate(lexicon = "bing", 
            words_in_lexicon = n_distinct(word))    

nrc <- get_sentiments("nrc") %>% 
     mutate(lexicon = "nrc", 
            words_in_lexicon = n_distinct(word))

afinn <- get_sentiments("afinn") %>% 
     mutate(lexicon = "afinn", 
            words_in_lexicon = n_distinct(word))

new_sentiments <- bind_rows(bing, nrc, afinn)
names(new_sentiments)[names(new_sentiments) == 'value'] <- 'score'
new_sentiments %>% 
     group_by(lexicon, sentiment, words_in_lexicon) %>% 
     summarise(distinct_words = n_distinct(word)) %>% 
     ungroup() %>% 
     spread(sentiment, distinct_words) %>% 
     mutate(lexicon = color_tile("lightblue", "lightblue")(lexicon), 
            words_in_lexicon = color_bar("lightpink")(words_in_lexicon)) %>% 
     my_kable_styling(caption = "Word Counts per Lexicon")

后续图表也将起作用!

关于R情感分析； 'lexicon' 未找到； 'sentiments' 已损坏？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/51127671/

25

4

0

文章推荐： java - 了解 Maven 存储库和 Artifactory(如 Nexus)

文章推荐： java - 检测信标的检测顺序？

文章推荐： java - 在 PageFactory 中获取 NPE NullPointerException (Selenium+Java)

文章推荐： java - 如何在字符串中插入斜杠？

sentiment-analysis - Sentiment Analysis 中文 - 字典
关闭。这个问题是off-topic .它目前不接受答案。想改进这个问题吗？ Update the question所以它是on-topic用于堆栈溢出。关闭 9 年前。 Improve this
sentiment-analysis - 情绪分析
在进行情感分析时，如何让机器理解我指的是苹果(iphone)，而不是苹果(水果)？谢谢你的建议! 最佳答案嗯，有几种方法，我会从检查大写字母开始，通常，当提到一个名字时，第一个字母是大写的。在
sentiment-analysis - 现有的情感分析算法有哪些？
我和一群人正在开发一种情绪分析算法。我想知道哪些是现有的，因为我想比较它们。有没有文章有这方面的主要算法？提前致谢蒂亚戈最佳答案一些关于情感分析的论文可能对你有帮助—— Bo Pang, Li
sentiment-analysis - 语义分析开源工具——需要建议
很难说出这里要问什么。这个问题模棱两可、含糊不清、不完整、过于宽泛或夸夸其谈，无法以目前的形式得到合理的回答。如需帮助澄清此问题以便重新打开，visit the help center . 关闭 1
sentiment-analysis - 情感分析——评分和比较有什么作用？
我正在对一篇文章进行情感分析。我不知道如何使用情感分析来检查文章是正面的、负面的还是中立的。得分18，比较7.7% 最佳答案在您的文章中被检测为“正面”或“负面”的每个词都有一个分数(高于 0 表
sentiment-analysis - SentiWordNet 中的意义数是什么意思？
我想在我的项目中使用 SentiWordNet，但我无法弄清楚意义数字有什么作用？这是 SentiWordNet 单词列表的一部分； POS ID PosScore NegScore SynsetTe
sentiment-analysis - Textblob 情感算法
有谁知道 textblob 情绪是如何运作的？我知道它基于 Pattern 工作，但我找不到任何文章或文档解释模式如何为句子分配极性值。最佳答案下面是 textblog 情感模块的代码: http
java - 情绪分析 : more than 3 sentiments
我的应用需要情绪分析功能。我发现有很多服务和图书馆可以帮助完成这项任务。但它们中的大多数都有“三维”输出:文本可能被归类为“正面”、“负面”或“中性”。但如果我需要更多种类的选项怎么办？例如:“自信
Python - 使用 sentiment vader 从字符串中提取正面词
是否可以遍历一串词，使用情绪维达将它们分类为正面、负面或中性，然后如果它们是正面的，则将这些正面的词附加到列表中？下面的 for 循环是我想要完成的非工作代码。我是 Python 的初学者，所以如果有
sentiment-analysis - 在相关的不同主题的情感分析中处理(分数)分散的正确方法是什么？
我正在分析社交网络上的情绪。基于不同相关话题作为输入。我们如何处理个别主题分数的分散？例如:我们正在尝试对包含不同关键字的事件的主题进行情绪评分，假设主题是具有以下主题(关键字或同义词)的创新周
sentiment-analysis - 使用 tensorflow 进行情感分析
我正在探索tensorflow，并希望使用可用的选项进行情感分析。我看了下面的教程http://www.tensorflow.org/tutorials/recurrent/index.html#la
sentiment-analysis - 究竟什么是 n Gram？
我在 SO 上发现了上一个问题:N-grams: Explanation + 2 applications . OP给出了这个例子并询问它是否正确: Sentence: "I live in NY."
R情感分析； 'lexicon' 未找到； 'sentiments' 已损坏？
我正在尝试关注this情感分析在线教程。代码: new_sentiments % #From the tidytext package filter(lexicon != "loughran")
hadoop - pig :Twitter Sentiment Analysis
我正在尝试实现 Twitter 情绪分析。我需要获取所有正面推文和负面推文并将它们存储在特定的文本文件中。示例.json {"id": 252479809098223616, "created_at
sentiment-analysis - 一般来说，TF-IDF 什么时候会降低准确率？
我正在使用朴素贝叶斯模型将包含 200000 条评论的语料库训练成正面评论和负面评论，我注意到执行 TF-IDF 实际上将准确度降低了大约 2%(在对 50000 条评论的测试集进行测试时) .所以我
sentiment-analysis - Theano 分类任务总是给出 50% 的验证错误和测试错误？
我正在使用 Theano 的 DBN(深度信念网络)和 SDA(堆叠降噪自动编码器)示例进行文本分类实验。我已经生成了一个特征/标签数据集，就像生成 Theano 的 MINST 数据集一样，并更改了
stanford-nlp - 如何获取 CoreNLP Sentiment 的分数分布值？
我在我的 ubuntu 实例上设置了 CoreNLP 服务器，它工作正常。我对 Sentiment 模块更感兴趣，目前我得到的是 { sentimentValue: "2", sentiment: "
python - 有没有办法提高 nltk.sentiment.vader 情感分析的性能？
我的文字来源于一个社交网络，所以你可以想象它的本质，我认为文字是我想象中的干净和最小的；执行以下 sanitizer 后: 没有网址，没有用户名没有标点符号，没有重音符号没有数字没有停用词(我想
python - 在 Python Vader Sentiment 中添加特例习语
我一直在使用 Vader Sentiment 进行一些文本情感分析，我注意到我的数据中有很多“有待改进”的短语被错误地归类为中性: In[11]: sentiment('way to go John'
python - Python 中 nltk.sentiment.vader 的错误消息
我是 Python 的初学者，正在尝试使用 nltk.sentiment.vader，但尽管多次尝试修复它，但仍收到反复出现的错误消息。我之前安装了大部分 NTLK(3 个模块已过时，因此无法安装)。

首页

博学

6Ren·AI

商城

R情感分析； 'lexicon' 未找到； 'sentiments' 已损坏？