python - nltk中similar()和concordance的区别-6ren

python - nltk中similar()和concordance的区别

转载作者：太空狗更新时间：2023-10-30 01:50:18

24

4

我已经阅读了 this 中的 text1.similar("monstrous") 和 text1.concordance("monstrous")| .

对于自然语言的 text1.concordance('monstrous') 和 text1.similar('monstrous') 之间的区别，我无法得到满意的答案python 中的处理工具包。

请您举例详细说明一下好吗？

最佳答案

使用 concordance(token) 为您提供围绕参数 token 的上下文。它会向您显示 token 出现的句子。

使用 similar(token) 返回出现在与 token 相同上下文中的单词列表。在这种情况下，上下文只是直接位于 token 两侧的单词。

因此，查看 Moby Dick 文本 (text1)。我们可以检查 'monstrous'

的一致性

text1.concordance('monstrous')

# returns:
Displaying 11 of 11 matches:
ong the former , one was of a most monstrous size . ... This came towards us ,
ON OF THE PSALMS . " Touching that monstrous bulk of the whale or ork we have r
ll over with a heathenish array of monstrous clubs and spears . Some were thick
d as you gazed , and wondered what monstrous cannibal and savage could ever hav
that has survived the flood ; most monstrous and most mountainous ! That Himmal
they might scout at Moby Dick as a monstrous fable , or still worse and more de
th of Radney .'" CHAPTER 55 Of the Monstrous Pictures of Whales . I shall ere l
ing Scenes . In connexion with the monstrous pictures of whales , I am strongly
ere to enter upon those still more monstrous stories of them which are to be fo
ght have been rummaged out of this monstrous cabinet there is no telling . But
of Whale - Bones ; for Whales of a monstrous size are oftentimes cast up dead u

然后我们可以获得出现在与'monstrous' 相似上下文中的单词列表。第一个返回行的上下文是 'most _____ size'。

text1.similar('monstrous')

# returns:
determined maddens contemptible modifies abundant tyrannical puzzled
trustworthy impalpable gamesome curious mean pitiable untoward
christian subtly passing domineering uncommon true

如果我们使用单词 'true' 并检查它与 text.concordance('true') 的一致性，我们将得到 87 次使用中的前 25 次单词“true”。这不是很有用，但 NLTK 确实提供了一个称为 common_contexts 的附加方法，它显示何时使用单词列表共享相同的周围单词。

text1.common_contexts(['monstrous', 'true'])

# returns:
the_pictures

这个结果告诉我们 "the monstrous pictures" 和 "the true pictures" 都出现在 Moby Dick 中。

关于python - nltk中similar()和concordance的区别，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/43438008/

24

4

0

文章推荐： python - 从 Pandas apply 获取两个返回值

文章推荐： SQLPlus AUTO_INCREMENT 错误

文章推荐： database - 我可以对 PostgreSQL HStore 值使用聚合函数吗？

nodes - 如何使用 Concorde 解决 TSP？
我有 12 个节点和每对节点之间的距离(以米为单位)。节点指的是城市中的不同街道。我需要获得 TSP 的精确解(不是启发式的)所以我想用 Concorde 程序解决 TSP 问题，但我无法引入数据。
python - count() 和 concordance() 给出不同的计数
我正在尝试 python nltk。同时 >>> from nltk.book import * >>> text1.concordance("monstrous") Displaying 11 o
command-line - 使用 concordance=TRUE 从命令行扫描
我有使用 LaTeX Beamer 模板的文档，并在实际文档的开头传递此代码 \usepackage{Sweave} \begin{document} \SweaveOpts{concordance=
algorithm - 如何提高 `concorde` TSP 求解器的质量？我在滥用它吗？
我正在尝试使用以下格式在文件中使用 concorde TSP 求解器: NAME : p5 COMMENT : Nada TYPE : TSP DIMENSION : 20 EDGE_WEIGHT_T
python - nltk 中的 Text.concordance() 是否可作为分布式方法用于 pyspark
我正在使用 Spark 上的 NLTK 进行自然语言处理。尝试实现 http://www.nltk.org/book/ch03.html 中的3.1 从网络和磁盘访问文本部分。为此，我遵循了如何进行自
python - nltk.concordance 给出最多 25 行，无论我如何更改该参数
所以我开始通过 nltk 书学习 NLP，似乎我立即遇到了一个以前没有人提到的问题。让我们像书上说的那样从nltk.book导入数据: from nltk.book import * 现在我想继续书
java - 如何从 JNI 的现有 c 项目 (concorde) 创建 .dylib
我正在尝试使用 concorde在 Java 应用程序中但我想不通。我已经创建了 java 类如下: package jconcorde; public class TSP { static {

首页

博学

6Ren·AI

商城

python - nltk中similar()和concordance的区别