python-2.7 - 无法在 Mac 上安装 Tesseract-OCR-6ren

python-2.7 - 无法在 Mac 上安装 Tesseract-OCR

转载作者：行者123 更新时间：2023-12-05 04:07:11

29

4

我正在尝试使用 pytesseract 在 python 2.7.14 中制作一个 OCR 程序。当我运行我的代码时:

from PIL import Image
import pytesseract

print(pytesseract.image_to_string(Image.open('test.png')))

我得到了错误:

IOError: [Errno 2] No such file or directory: 'test.png'

找了很多地方，好像需要安装tesseract-ocr。我跑了:

pip install tesseract-ocr

但是我得到了错误:

收集tesseract-ocr 使用缓存的 tesseract-ocr-0.0.1.tar.gz要求已经满足:/Library/Frameworks/Python.framework/Versions/2.7/lib/python2.7/site-packages 中的 cython(来自 tesseract-ocr)安装收集的包:tesseract-ocr 为 tesseract-ocr 运行 setup.py install ... 错误命令/Library/Frameworks/Python.framework/Versions/2.7/Resources/Python.app/Contents/MacOS/Python -u -c "import setuptools, tokenize;file='/private 的完整输出/var/folders/rd/lf95py7d7s3dkzft38jh3m8h0000gn/T/pip-build-DTR_fL/tesseract-ocr/setup.py';f=getattr(tokenize, 'open', open)(文件);代码=f.read().replace('\r\n', '\n');f.close();exec(compile(code, file, 'exec'))"安装--record/var/folders/rd/lf95py7d7s3dkzft38jh3m8h0000gn/T/pip-U3OoHi-record/install-record.txt --single-version-externally-managed --compile: 运行安装运行构建运行 build_py 找不到文件 tesseract_ocr.py(用于模块 tesseract_ocr) 找不到文件 tesseract_ocr.py(用于模块 tesseract_ocr) 运行 build_ext 构建“tesseract_ocr”扩展创建构建创建 build/temp.macosx-10.6-intel-2.7 /usr/bin/clang -fno-strict-aliasing -fno-common -dynamic -arch i386 -arch x86_64 -g -DNDEBUG -g -fwrapv -O3 -Wall -Wstrict-prototypes -I/Library/Frameworks/Python.framework/Versions/2.7/include/python2.7 -c tesseract_ocr.cpp -o build/temp.macosx-10.6-intel-2.7/tesseract_ocr.o tesseract_ocr.cpp:558:10: fatal error :找不到“leptonica/allheaders.h”文件 #include "leptonica/allheaders.h" ^ 产生 1 个错误。错误:命令“/usr/bin/clang”失败，退出状态为 1

命令 "/Library/Frameworks/Python.framework/Versions/2.7/Resources/Python.app/Contents/MacOS/Python -u -c "import setuptools, tokenize;file='/private/var/folders/rd/lf95py7d7s3dkzft38jh3m8h0000gn/T/pip-build-DTR_fL/tesseract-ocr/setup.py';f=getattr(tokenize, 'open', open)(文件); code=f.read().replace('\r\n', '\n');f.close();exec(compile(code, file, 'exec'))"install --record/var/folders/rd/lf95py7d7s3dkzft38jh3m8h0000gn/T/pip-U3OoHi-record/install-record.txt --single-version-externally-managed --compile"失败，错误代码为 1 in/private/var/文件夹/rd/lf95py7d7s3dkzft38jh3m8h0000gn/T/pip-build-DTR_fL/tesseract-ocr/

无论如何我可以解决这个问题吗？(当我尝试其他安装方式时出现同样的错误，比如 sudo)

最佳答案

您需要安装 tesseract 本身。

正如文档中所说 https://github.com/tesseract-ocr/tesseract/wiki#homebrew

    brew install tesseract

此外，我还建议通过 brew 安装 python。这样你就不会污染系统 python 安装。

最好也使用 virtualenv。

关于python-2.7 - 无法在 Mac 上安装 Tesseract-OCR，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/48875597/

29

4

0

文章推荐： webassembly - 需要一个 WebAssembly 的 "br_table"指令的例子

文章推荐： paypal - PayPal API 中日期的正确格式

文章推荐： vim - 如何让 vim-plug 识别本地项目？

文章推荐： php - Laravel - 干预图像 - 图像源不可读

tesseract - tesseract 的最小图像大小
我正在尝试使用 tesseract 来识别图像中的字符。这个图像是通过从另一个图像中抓取字符并将它们放在新图像中的一行来创建的。问题是，当我将这张图片提供给 tesseract 时，它在输出中没有显示
tesseract - 为新字体训练 Tesseract
使用创建聚类数据时 mftraining -F font_properties -U unicharset -O lan.unicharset *.tr 我收到以下消息 C:\Users\ \AppD
tesseract - 遇到问题训练 tesseract
我是 tesseract 的新手，对 github 页面中的不同目录有点困惑。 tesserac-ocr 代码库是我安装的。在/usr/local/share/tessdata/中安装了一个 tess
tesseract - Tesseract OCR 执行哪些预处理操作？
我找不到详细的文档，也没有浏览源代码的感觉。例如，如果 Tesseract 引擎已经完成，我不想重做精明的边缘检测。最佳答案本文档提供了引擎的概述:https://github.com/tesse
tesseract - 使用 Tesseract 检测黑色背景上的白色字符
我对 Tesseract OCR 完全陌生。这个问题可能很简单，但我似乎无法使用 Google 找到答案。基本上，我有一个包含两部分的图像:第一部分位于图像的顶部，黑色背景和白色文本；第二部分位于图
tesseract - Tesseract API 文档在哪里？
我浏览了整个 Google 代码网站，但没有找到任何从 API 角度解释如何使用 Tesseract 的内容。有人知道在哪里可以找到这个吗？最佳答案最新文档现已发布 here 和 here 。关
tesseract - 如何使用命令行获得对 tesseract 的字符明智的信心？
我可以通过命令行使用 tesseract 4.0 获得单词级别的置信度分数。有兴趣知道是否也有办法让角色自信。对于单词级别的置信度，使用以下命令: tesseract [Image name] ou
tesseract - 有没有办法将 tesseract 用于单个数字？
TL;DR It appears that tesseract cannot recognize images consisting of a single digit. Is there a wor
python - 没有这样的文件或目录 : 'tesseract' : 'tesseract' even though where to find tesseract is specified in pytesseract. py
所以我已经研究这个问题一段时间了，虽然其他人也有类似的问题，但对我来说没有任何作用: 我正在尝试将 pytesseract 用于项目，并将其安装在 User/Environments/testEnv/
tesseract - 无法使用 Tesseract 3.0.2.02 存档为 WIndows 编译 Tesseract API 示例
我正在考虑使用 Tesseract 来处理 PDF 文件，因此我想使用该库而不是外部可执行文件。我首先下载完整的 Tesseract 源代码并考虑构建它。遗憾的是，标准源没有任何方法可以在非 Lin
tesseract - 使用 Tesseract-OCR 获取识别字符的字体
是否可以使用 Tesseract-OCR 获取已识别字符的字体，即它们是 Arial 还是 Times New Roman，无论是从命令行还是使用 API。我正在扫描可能具有不同字体的不同部分的文档
tesseract - 强制 Tesseract 匹配模式(连续四位数字)
我试图让 Tesseract(使用 Tess4J 包装器)仅匹配特定模式。该模式是连续四位数字，我认为是\d\d\d\d。这是我正在提供 tesseract 的图像的一个非常小子集(平面图受到限制
tesseract - Tess4j Tesseract 与 Tesseract1
我不是 100% 确定 Java api 包 Tess4J 中 Tesseract 和 Tesseract1 对象的区别，任何人都可以解释一下吗？我知道 Tesseract 使用接口(interfa
tesseract - Hololens可以进行物体检测吗？或者如何在Hololens中使用YOLO/tensorflow/tesseract
我正在测试 Hololens 中的一些功能。想知道是否可以在 Hololens 中使用任何对象检测/文本识别功能？最佳答案 Hololens 1 本身不支持对象检测，您需要使用第三方代码。 Open
python-tesseract - 训练 Tesseract 来标记图标
我正在尝试为 Tesseract 4.0 创建训练数据来识别屏幕截图中的图标(例如，评论，分享，保存)。这是示例屏幕截图: 我想微调 Tesseract 以实现如下输出: 喜欢147 评论 29 已保
tesseract - 如何查找 Tesseract OCR 配置文件中支持的参数
我想知道Tesseract OCR使用的配置文件接受哪些参数，如何编写配置文件等我在 their site 上找不到任何有关此内容的文档。如何确定支持哪些参数及其含义？最佳答案 Tesseract
python-tesseract - 训练 Tesseract 来标记图标
我正在尝试为 Tesseract 4.0 创建训练数据来识别屏幕截图中的图标(例如，评论，分享，保存)。这是示例屏幕截图: 我想微调 Tesseract 以实现如下输出: 喜欢147 评论 29 已保
tesseract - 两个pip3包的区别: pytesseract vs tesseract
关闭。这个问题需要更多focused .它目前不接受答案。想改进这个问题吗？更新问题，使其只关注一个问题 editing this post . 关闭 6 年前。此帖于11个月前编辑提交审核，
tesseract - 如何在 python-tesseract 中设置 tessedit_write_images？
我正在尝试设置 tessedit_write_images但似乎做不到，我在任何地方都看不到 tessinput.tif 我正在做: import tesseract api = tesseract.
tesseract - `tesseract-ocr` 可以将结果写入 STDOUT 吗？
使用tesseract-ocr#3.02.02。 tesseract的基本用法是 tesseract sourc.png result 生成和result.txt。要获取结果文本，我必须 cat 这

首页

博学

6Ren·AI

商城

python-2.7 - 无法在 Mac 上安装 Tesseract-OCR