python - stat : path should be string, 字节，os.PathLike 或整数，不是 NoneType

python - stat : path should be string, 字节，os.PathLike 或整数，不是 NoneType - refextract

转载作者：行者123 更新时间：2023-12-05 05:01:46

27

4

在我的 Python 项目中，我尝试使用 refextract从 pdf 文件解析一些数据，但我无法使用它的 extract_references_from_file 功能。

我正在使用网站上提到的示例代码:

from refextract import extract_references_from_file
references = extract_references_from_file('C02-1025.pdf')
print(references[0])

出现这个错误；

TypeError: stat: path should be string, bytes, os.PathLike or integer, not NoneType

我已经尝试过不同的方式来像这样传递文件路径；

references = extract_references_from_file(r"F:\project\python\C02-1025.pdf")

references = extract_references_from_file("F:\\project\\python\\C02-1025.pdf")

但没有任何效果。

我使用的是 Python 3.7.2，它是 64 位的。

这是错误的完整回溯:

Traceback (most recent call last):
  File "refext.py", line 16, in <module>
    references = extract_references_from_file(r"F:\project\python\C02-1025.pdf")
  File "C:\Users\Username\AppData\Local\Programs\Python\Python37\lib\site-packages\refextract\references\api.py", line 128, in extract_references_from_file
    docbody = get_plaintext_document_body(path)
  File "C:\Users\Username\AppData\Local\Programs\Python\Python37\lib\site-packages\refextract\references\engine.py", line 1412, in get_plaintext_document_body
    textbody = convert_PDF_to_plaintext(fpath, keep_layout)
  File "C:\Users\Username\AppData\Local\Programs\Python\Python37\lib\site-packages\refextract\documents\pdf.py", line 457, in convert_PDF_to_plaintext
    if not os.path.isfile(CFG_PATH_PDFTOTEXT):
  File "C:\Users\Username\AppData\Local\Programs\Python\Python37\lib\genericpath.py", line 30, in isfile
    st = os.stat(path)
TypeError: stat: path should be string, bytes, os.PathLike or integer, not NoneType

refextract 库依赖于 pdftotext命令行实用程序。但是当我尝试安装它时

pip install pdftotext

它给了我这个错误

 ERROR: Command errored out with exit status 1:
 command: 'c:\users\usernamem\appdata\local\programs\python\python37\python.exe' -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\USER~1\\AppData\\Local\\Temp\\pip-install-l_9a5zt6\\pdftotext\\setup.py'"'"'; __file__='"'"'C:\\Users\\USER~1\\AppData\\Local\\Temp\\pip-install-l_9a5zt6\\pdftotext\\setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' install --record 'C:\Users\USER~1\AppData\Local\Temp\pip-record-gpha3woc\install-record.txt' --single-version-externally-managed --compile --install-headers 'c:\users\username\appdata\local\programs\python\python37\Include\pdftotext'
     cwd: C:\Users\USER~1\AppData\Local\Temp\pip-install-l_9a5zt6\pdftotext\
Complete output (11 lines):
WARNING: pkg-config not found--guessing at poppler version.
         If the build fails, install pkg-config and try again.
running install
running build
running build_ext
building 'pdftotext' extension
creating build
creating build\temp.win-amd64-3.7
creating build\temp.win-amd64-3.7\Release
cl.exe /c /nologo /Ox /W3 /GL /DNDEBUG /MT -DPOPPLER_CPP_AT_LEAST_0_30_0=1 "-Ic:\users\username\appdata\local\programs\python\python37\include" "-Ic:\users\username\appdata\local\programs\python\python37\include" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.17134.0\shared" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.17134.0\um" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.17134.0\winrt" "-IC:\Program Files (x86)\Windows Kits\10\include\10.0.17134.0\ucrt" "-IC:\Program Files (x86)\Windows Kits\NETFXSDK\4.6.1\include\um" /EHsc /Tppdftotext.cpp /Fobuild\temp.win-amd64-3.7\Release\pdftotext.obj -Wall
error: command 'cl.exe' failed: No such file or directory
----------------------------------------
ERROR: Command errored out with exit status 1: 'c:\users\username\appdata\local\programs\python\python37\python.exe' -u -c 'import sys, setuptools, tokenize; sys.argv[0] = '"'"'C:\\Users\\USER~1\\AppData\\Local\\Temp\\pip-install-l_9a5zt6\\pdftotext\\setup.py'"'"'; __file__='"'"'C:\\Users\\USER~1\\AppData\\Local\\Temp\\pip-install-l_9a5zt6\\pdftotext\\setup.py'"'"';f=getattr(tokenize, '"'"'open'"'"', open)(__file__);code=f.read().replace('"'"'\r\n'"'"', '"'"'\n'"'"');f.close();exec(compile(code, __file__, '"'"'exec'"'"'))' install --record 'C:\Users\USER~1\AppData\Local\Temp\pip-record-gpha3woc\install-record.txt' --single-version-externally-managed --compile --install-headers 'c:\users\username\appdata\local\programs\python\python37\Include\pdftotext' Check the logs for full command output.

最佳答案

您正在使用的 refextract 库依赖于 pdftotext 命令行实用程序。目前在您的系统上找不到该程序，这会导致您描述的错误。可能错误如此含糊不清。有一些代码试图提供更好的错误消息，但在这种情况下不起作用。

在 Linux 上，pdftotext 通常由您的发行版提供。在 Windows 上，您通常需要自己安装它。它来自 Xpdf tools package .您需要将可执行文件安装在系统的 PATH 中的某处，或者您需要通过设置环境变量 CFG_PATH_PDFTOTEXT 将 refextract 指向程序的位置.

关于python - stat : path should be string, 字节，os.PathLike 或整数，不是 NoneType - refextract，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/62603602/

27

4

0

文章推荐： java - Maven 解析的版本与传递依赖中的版本不匹配

文章推荐： scheduler - 每个月第一个工作日的系统计时器

文章推荐： azure-devops - Azure 管道中部署作业中的条件批准门

python - * : 'NoneType' and 'NoneType' (Python) 不支持的操作数类型
我目前正在尝试定义函数，但遇到了这个错误。我只是想做一个简单的函数，用户输入 2 个数字，然后将它们相乘。也请尽可能简单地解释我做错了什么。 (我是菜鸟) def userinput(): w
Python-类型错误 : unorderable types: NoneType() < NoneType()
使用IPtools python 包我试图查看 IP 地址是否在特定范围内。这是我的代码: for line in g: org= line.split("|")[0] ranges
Python:无法摆脱 TypeError:+ 不支持的操作数类型: 'NoneType' 和 'NoneType'
输入 [['1','2','3'],['a','b','c'],['6','7','8'],['e','f','g']] 输出应该是: 1, 2, 3a, b, c6, 7, 8e, f, g Cod
Python:使用 lambda 时，& 不支持的操作数类型: 'NoneType' 和 'NoneType'
我目前正在使用 lambda 使 tkinter 按钮依次执行两件事: def classManip(): cManip = tk.Toplevel() cManip.title
python - 我不断收到 "TypeError: unsupported operand type(s) for -: ' NoneType' 和 'NoneType' ”
我正在学习Python，作为练习，我编写了一些代码来查找用户定义函数的导数。代码如下。 def fx(value, function): x = value return eval(f
python - 类型错误 : 'NoneType' object is not subscriptable followed by AttributeError: 'NoneType' object has no attribute 'split'
使用 Django。我有以下模型: class Postagem(models.Model): id = models.AutoField(primary_key=True, editable=Fal
python-2.7 - feature_importances_ 在 ExtraTreesClassifier :TypeError: 'NoneType' object is not iterable 中显示为 NoneType
我正在尝试为给定的数据集选择重要的特征(或者至少了解哪些特征解释更多的变异性)。为此，我使用 ExtraTreesClassifier 和 GradientBoostingRegressor - 然后
Diffusers SDXL "TypeError: argument of type 'NoneType' is not iterable"(Differs SDXL“TypeError：‘NoneType’类型的参数不可迭代”)
刚刚获得了SDXL模型的访问权限，希望为即将发布的版本进行测试...不幸的是，我们当前用于我们服务的代码似乎不能与稳定ai/稳定-扩散-xl-base-0.9一起工作，我不完全确定SDXL有什么不同，
What causes `None` results from BeautifulSoup functions? How can I avoid "AttributeError: 'NoneType' object has no attribute..." with BeautifulSoup?(是什么原因导致了BeautifulSoup函数的`None‘结果？如何避免“AttributeError：‘NoneType’对象没有属性...”配上美人汤？)
通常，当我尝试使用BeautifulSoup解析网页时，BeautifulSoup函数会得到NONE结果，否则就会引发AttributeError。。以下是一些独立的(即，由于数据是硬编码的，不需要访
What causes `None` results from BeautifulSoup functions? How can I avoid "AttributeError: 'NoneType' object has no attribute..." with BeautifulSoup?(是什么原因导致了BeautifulSoup函数的`None‘结果？如何避免“AttributeError：‘NoneType’对象没有属性...”配上美人汤？)
通常，当我尝试使用BeautifulSoup解析网页时，BeautifulSoup函数会得到NONE结果，否则就会引发AttributeError。。以下是一些独立的(即，由于数据是硬编码的，不需要访
python - 检查可迭代列表中的 NoneTypes
我想遍历可迭代列表，但要求某些元素的类型可以是 None。这看起来像这样: none_list = [None, [0, 1]] for x, y in none_list: print("
Python: 'NoneType' 对象不可在非空查询结果上订阅
我得到object is not subscriptable在非空查询结果上。当我打印时 c.fetchone()它打印了正确的结果，但是当我检查类型时它显示 import sqlite3 conn
python - NoneType 即使定义了属性也会出错？
我在第 15 行收到此错误，但我不明白为什么。有任何想法吗？看来属性已经明确定义了，所以我完全不知所措。任何帮助将非常感激。AttributeError:“NoneType”对象没有属性“Sheets
python - 子类化对象导致 NoneType
我尝试对 Chrome WebDriver 进行子类化以包含一些初始化和清理代码，但随后 Python 提示创建的对象设置为 None: import glob import selenium imp
Python: 'NoneType' 对象不可下标'错误
这个问题已经有答案了: Why do I get AttributeError: 'NoneType' object has no attribute 'something'? (10 个回答) 已关
python - 为什么返回 Nonetype？
这个问题已经有答案了: Why does the print function return None? (1 个回答) 已关闭 6 年前。我对 Python 还很陌生。我正在制作一个生成器，可以为
Python 在解析表时处理 NoneType
我正在尝试比较两个表( table_a 和 table_b )并减去 table_a 的最后一列从table_b的最后一列开始。但是，table_a 包含一个额外的行，导致我得到 NoneType错误
python NoneType 对象不可迭代
当“文件名”是一个存在的文件时，这段代码运行良好……但是当它不存在时……我不断收到同样的错误:TypeError: 'NoneType' 对象不可迭代 (Errno 2) 尽管我从不迭代任何东西，除非
python - NoneType 对象不可迭代
我在下面的代码中收到“NoneType”对象不可迭代的 TypeError。下面的代码用于使用 pyautogui 滚动 digits 文件夹中的 10 张图像(命名为 0 到 9，以图像中的 # 命
Python 将模块名称视为 'NoneType'
我有一段代码表现得很奇怪。一开始，我导入了一个模块，它是 C 库的 python 绑定(bind)。 try: import pyccn except: print "ERROR:

首页

博学

6Ren·AI

商城

python - stat : path should be string, 字节，os.PathLike 或整数，不是 NoneType - refextract