python - 如何更改我的代码以使字符串不更改为 float-6ren

python - 如何更改我的代码以使字符串不更改为 float

转载作者：行者123 更新时间：2023-12-01 06:44:55

25

4

我正在尝试编写一个检测假新闻的代码。不幸的是，我不断收到相同的错误消息。请有人解释一下我哪里出了问题？我从 https://data-flair.training/blogs/advanced-python-project-detecting-fake-news/ 得到了一些代码行以及 https://www.datacamp.com/community/tutorials/text-analytics-beginners-nltk 中的一些代码行。当我尝试组合两个不同的代码(通过删除重复的代码)时，我收到一条错误消息。

代码

%matplotlib inline
import pandas as pd
from pandas import DataFrame
import matplotlib.pyplot as plt
import matplotlib.patches as mpatches
import itertools
import json
import csv
from sklearn.model_selection import train_test_split
from sklearn.feature_extraction.text import TfidfVectorizer
from sklearn.linear_model import PassiveAggressiveClassifier  
from sklearn.metrics import accuracy_score, confusion_matrix
from sklearn.preprocessing import StandardScaler
from sklearn.decomposition import PCA
from sklearn.linear_model import LogisticRegression
from sklearn.naive_bayes import MultinomialNB
from sklearn import metrics

df = pd.read_csv(r"C:\Users\johnrambo\Downloads\fake_news(1).csv", sep=',', header=0, engine='python', escapechar='\\')

X_train, X_test, y_train, y_test = train_test_split(df['headline'], is_sarcastic_1, test_size = 0.2, random_state = 7)

clf = MultinomialNB().fit(X_train, y_train)

predicted = clf.predict(X_test)

print("MultinomialNB Accuracy:", metrics.accuracy_score(y_test, predicted))

<小时/>

错误

ValueError                                Traceback (most recent call last)
<ipython-input-8-e1f11a702626> in <module>
     21 X_train, X_test, y_train, y_test = train_test_split(df['headline'], is_sarcastic_1, test_size = 0.2, random_state = 7)
     22 
---> 23 clf = MultinomialNB().fit(X_train, y_train)
     24 
     25 predicted = clf.predict(X_test)

~\Anaconda\lib\site-packages\sklearn\naive_bayes.py in fit(self, X, y, sample_weight)
    586         self : object
    587         """
--> 588         X, y = check_X_y(X, y, 'csr')
    589         _, n_features = X.shape
    590 

~\Anaconda\lib\site-packages\sklearn\utils\validation.py in check_X_y(X, y, accept_sparse, accept_large_sparse, dtype, order, copy, force_all_finite, ensure_2d, allow_nd, multi_output, ensure_min_samples, ensure_min_features, y_numeric, warn_on_dtype, estimator)
    717                     ensure_min_features=ensure_min_features,
    718                     warn_on_dtype=warn_on_dtype,
--> 719                     estimator=estimator)
    720     if multi_output:
    721         y = check_array(y, 'csr', force_all_finite=True, ensure_2d=False,

~\Anaconda\lib\site-packages\sklearn\utils\validation.py in check_array(array, accept_sparse, accept_large_sparse, dtype, order, copy, force_all_finite, ensure_2d, allow_nd, ensure_min_samples, ensure_min_features, warn_on_dtype, estimator)
    494             try:
    495                 warnings.simplefilter('error', ComplexWarning)
--> 496                 array = np.asarray(array, dtype=dtype, order=order)
    497             except ComplexWarning:
    498                 raise ValueError("Complex data not supported\n"

~\Anaconda\lib\site-packages\numpy\core\numeric.py in asarray(a, dtype, order)
    536 
    537     """
--> 538     return array(a, dtype, copy=False, order=order)
    539 
    540 

~\Anaconda\lib\site-packages\pandas\core\series.py in __array__(self, dtype)
    946             warnings.warn(msg, FutureWarning, stacklevel=3)
    947             dtype = "M8[ns]"
--> 948         return np.asarray(self.array, dtype)
    949 
    950     # ----------------------------------------------------------------------

~\Anaconda\lib\site-packages\numpy\core\numeric.py in asarray(a, dtype, order)
    536 
    537     """
--> 538     return array(a, dtype, copy=False, order=order)
    539 
    540 

~\Anaconda\lib\site-packages\pandas\core\arrays\numpy_.py in __array__(self, dtype)
    164 
    165     def __array__(self, dtype=None):
--> 166         return np.asarray(self._ndarray, dtype=dtype)
    167 
    168     _HANDLED_TYPES = (np.ndarray, numbers.Number)

~\Anaconda\lib\site-packages\numpy\core\numeric.py in asarray(a, dtype, order)
    536 
    537     """
--> 538     return array(a, dtype, copy=False, order=order)
    539 
    540 

ValueError: could not convert string to float: 'experts caution new car loses 90% of value as soon as you drive it off cliff'

<小时/>

前几行数据

Excel file: fake news

这是我输入 df.head().to_dict() 时得到的结果:

{'is_sarcastic': {0: 1, 1: 0, 2: 0, 3: 1, 4: 1}, 'headline': {0: '三十多岁的科学家揭开了脱发的末日时钟', 1:他们的代表。完全阐明了为什么国会在性别、种族平等方面未能做到这一点”， 2:“吃你的蔬菜:9种美味不同的食谱”， 3:“恶劣的天气使骗子无法上类”， 4:“妈妈已经非常接近正确使用‘流媒体’这个词了”}， 'article_link': {0: ' https://www.theonion.com/thirtysomething-scientists-unveil-doomsday-clock-of-hai-1819586205 ', 1:'https://www.huffingtonpost.com/entry/donna-edwards-inequality_us_57455f7fe4b055bb1170b207 ', 2:'https://www.huffingtonpost.com/entry/eat-your-veggies-9-delici_b_8899742.html ', 3:'https://local.theonion.com/inclement-weather-prevents-liar-from-getting-to-work-1819576031 ', 4:'https://www.theonion.com/mother-comes-pretty-close-to-using-word-streaming-cor-1819575546 '}}

最佳答案

我想您的 df['headline'] 列中有文本数据，您需要执行几个步骤，首先将文本数据转换为基于数字的格式，然后将其传递给机器学习模型处理。

您可能需要引用 sklearn 的 CountVectorizer 和 TfidfTransformer here

关于python - 如何更改我的代码以使字符串不更改为 float ，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/59273860/

25

4

0

文章推荐： java - Swing:单击第一个 JPanel 时启动第二个 JPanel

文章推荐： java - 通过强制转换调用 Animal 类的 methodS

文章推荐： python - 组织 Python 类(class)的最佳方式是什么？

详解dedecms后台编辑器将回车改为的方法
CFSDN坚持开源创造价值，我们致力于搭建一个资源共享平台，让每一个IT人在这里找到属于你的精彩世界. 这篇CFSDN的博客文章详解dedecms后台编辑器将回车改为的方法由作者收集整理，如果你对
javascript - 如何将改为
？
关闭。此题需要details or clarity 。目前不接受答案。想要改进这个问题吗？通过 editing this post 添加详细信息并澄清问题. 已关闭 6 年前。 Improve th
javascript - 改为 JS 文件中的谷歌分析？
不是将代码放在正文的头部或末尾(我把它放在正文的末尾)，如果我将代码放在 JS 文件中而不是在 html 中它自己的脚本标记，是否可以？ (我假设它像任何其他代码一样工作正常，但我问以防万一) 最佳答
postgresql - PSQL 命令行结束从 => 改为 (>
我尝试执行从\e 命令编写的查询，但现在我无法执行任何查询，但可以在 PSQL 中执行命令。现在我注意到这一点，我输入的命令现在在\e 中。当我关闭\e(尝试运行它)时问题开始了。最佳答案 ps
Java字符串替换(去掉换行符，将$改为\$)
我有一个这样的字符串($ 字符总是被其他字符包围): a$b c$d e$f 我希望我的字符串方法在 $ 前面放置一个 \ 并删除换行符: a\$bc\$de\$f 我试过了，但它没有放入 \ 字符:
java - 如何防止Java将XML文件中的 "&"改为 "&"
我需要使用 Java 构建一个 XML 文件。问题是我必须使用一些特殊字符，例如“ć”，然后在我的移动应用程序中读取它。如果我手动更改 ć 就可以正常工作至 ć在我的 XML 文件中的记事
javascript - 将改为 useMutation
我有一个removeUser 页面，我在其中使用，然后使用submitForm() 函数进行错误处理。这段代码运行得非常好: export default function RemoveUserPag
mysql - SQL查询将前面两个数字从 "2048"改为 "1948"
我在数据库 “2048-05-21” 中有一个看起来像这样的日期我只想得到年份，在这一年我只想得到两个后面的数字并将两个前面的数字更改为19 example: data : 2048-05-21 1
JAVA- 将 111 改为 123
public class Venus1 { public static void main(String args[]) { int[]x={1,2,3};
javascript - 改为 JavaScript 的 array_count_values
我有以下 PHP 脚本，现在我需要在 JavaScript 中做同样的事情。 JavaScript 中是否有类似于 PHP 函数的函数，我已经搜索了好几天但找不到类似的东西？我想做的是计算某个单词在数
c++ - #define 改为 constexpr auto
这个问题在这里已经有了答案: Is it bad practice to specify an array size using a variable instead of `#define` in
android - 如何将列表中的所有复选框 'check' 改为 'check' 单个复选框？
我陷入了一种情况，我必须通过“选中”工具栏中的复选框来“选中”列表中存在的所有复选框。这是创建复选框列表的代码:- itemTpl: 'checked="checked" /> {groupName
python - Python 改为 UTF-8 后出现奇怪的字符
我正在使用Python3。在分析一些网站时，我遇到了一些奇怪的字符并寻找解决方案。我找到了一个，但在找到解决方案之前，我尝试了一些方法，并且知道我无法重置它。当我使用 Jupyter 笔记本将列表 l
unity-game-engine - UnityWebRequest 改为 https
我在 http 下有 unity android app 和 site api 的工作基础设施。最近换了服务器，申请了ssl证书。现在我的 api 在 https 下。在 unity 应用程序中，
unity-game-engine - UnityWebRequest 改为 https
我在 http 下有 unity android app 和 site api 的工作基础设施。最近换了服务器，申请了ssl证书。现在我的 api 在 https 下。在 unity 应用程序中，
ios - objective-C 。 NSString 改为 NSCFNumber
我在 Objective-C 中有一些代码。我想，我收到了 NSString 类型，但是当我尝试将它保存在核心数据中时，我得到了一个 user.clientID = clientID; 错误，例如:
php - 将 Varchar 改为 Time 类型
在表中我有一个名为 CallTime 的字段 (Varchar)。包括晚上8:00、晚上8:40、上午10:00等时间我想将字段类型更改为“时间”并更新时间格式。该怎么做？谢谢最佳答案 UPD
c# - 为什么将 int 改为 long 会加快执行速度？
这个问题在这里已经有了答案: C# - for Loop Freezes at strange intervals (3 个答案) 关闭 6 年前。我试图解决 problem #14 from P
list - 预期为 'collections.Iterable' ，改为 Optional[list]
我今天在 Pycharm 社区版 5.0.3 中收到了这个错误，想知道这是否只是我做错了/没有意识到，或者是 PyCharm lint 问题。重现错误的代码是 mylist = list() # fi
java - 如何在页面上显示实体？改为 [Entity of type ... with id : 1]
我的目标是将数据库中的随机文本显示到网页上。首先，我不知道为什么我的数据没有保存，为什么我得到的是[Entity of type sec.helloweb.HelloMessage with id:

首页

博学

6Ren·AI

商城

python - 如何更改我的代码以使字符串不更改为 float