Python 检测字符 tesseract ocr 使用 pytesseract 为文本创建 blob-6ren

Python 检测字符 tesseract ocr 使用 pytesseract 为文本创建 blob

转载作者：太空宇宙更新时间：2023-11-03 21:18:58

25

4

我正在研究超正方体并为图像中的字符创建了盒子文件。在命令行上使用tesseract，它检测到了字符并显示了每个检测到的字符在box文件中的位置。

这是命令行输出。

/Desktop $ tesseract spa.arial.first_page.tif spa.arial.box nobatch 

box.train .stderr
read_params_file: Can't open .stderr
Tesseract Open Source OCR Engine v4.0.0-146-gc39a with Leptonica
Page 1
Detected 74 diacritics
row xheight=2, but median xheight = 17.4815
row xheight=2.5, but median xheight = 17.4815
row xheight=91, but median xheight = 17.4815
row xheight=2.5, but median xheight = 17.4815
row xheight=3, but median xheight = 17.4815
row xheight=61.875, but median xheight = 17.4815
row xheight=23, but median xheight = 17.4815
row xheight=3, but median xheight = 17.4815
row xheight=3, but median xheight = 17.4815
row xheight=12.8333, but median xheight = 17.4815
row xheight=15.1282, but median xheight = 17.4815
row xheight=3.5, but median xheight = 17.4815
row xheight=3.5, but median xheight = 17.4815
row xheight=3.5, but median xheight = 17.4815
row xheight=628, but median xheight = 17.4815
row xheight=415.5, but median xheight = 17.4815
row xheight=4, but median xheight = 17.4815
row xheight=630, but median xheight = 17.4815
FAIL!
APPLY_BOXES: boxfile line 7/A ((286,1979),(325,2002)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 11/U ((199,1943),(239,1967)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 14/R ((298,1943),(323,1967)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 16/M ((325,1943),(360,1967)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 1611/a ((849,451),(875,480)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 1617/5 ((947,457),(973,480)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 1622/. ((1038,457),(1042,460)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 1839/a ((679,280),(705,303)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1860/u ((1030,274),(1063,304)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 1865/p ((1113,274),(1133,304)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 1876/a ((1303,275),(1329,302)): FAILURE! Couldn't find a matching blob
FAIL!
APPLY_BOXES: boxfile line 1879/, ((1362,275),(1365,282)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1886/c ((1467,278),(1494,301)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1889/d ((1542,277),(1551,300)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1892/h ((1569,277),(1595,300)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1895/c ((619,245),(645,268)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1910/n ((888,245),(920,262)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1911/l ((941,245),(949,267)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: boxfile line 1913/e ((981,239),(997,267)): FAILURE! Couldn't find a matching blob
APPLY_BOXES: Unlabelled word at :Bounding box=(133,887)->(1631,893)
APPLY_BOXES: Unlabelled word at :Bounding box=(132,569)->(1631,575)
APPLY_BOXES: Unlabelled word at :Bounding box=(132,484)->(1631,491)
APPLY_BOXES: Unlabelled word at :Bounding box=(1408,418)->(1470,479)
APPLY_BOXES: Unlabelled word at :Bounding box=(132,413)->(1630,420)
APPLY_BOXES: Unlabelled word at :Bounding box=(1238,346)->(1415,400)
APPLY_BOXES: Unlabelled word at :Bounding box=(1408,359)->(1476,425)
APPLY_BOXES: Unlabelled word at :Bounding box=(133,341)->(1628,348)
APPLY_BOXES: Unlabelled word at :Bounding box=(133,205)->(137,1461)
APPLY_BOXES: Unlabelled word at :Bounding box=(598,203)->(602,1034)
APPLY_BOXES: Unlabelled word at :Bounding box=(133,200)->(1629,208)
APPLY_BOXES: Unlabelled word at :Bounding box=(1628,200)->(1633,1460)
   Found 1698 good blobs.
   Leaving 59 unlabelled blobs in 0 words.
   21 remaining unlabelled words deleted.
Generated training data for 353 words

我想为每个检测到的 Blob 绘制 Blob (框)，我已经搜索但未能获得引用。谁能帮我在创建文件的图像上绘制 Blob 。

我尝试使用下面的 python 代码使用 pytesseract 绘制文本的 blob

import cv2
import pytesseract

file = '/home/Desktop/second_page.png'

img = cv2.imread(file)
h, w, _ = img.shape

boxes = pytesseract.image_to_boxes(img)

for b in boxes.splitlines():
    b = b.split(' ')
    img = cv2.rectangle(img, (int(b[1]), h - int(b[2])), (int(b[3]), h - int(b[4])), (0, 255, 0), 2)

cv2.imshow(filename, img)
cv2.waitKey(0)

输出得到:

最佳答案

为了充分理解，请尝试使用链接 https://www.pyimagesearch.com/2017/07/10/using-tesseract-ocr-python/

import csv
import cv2
from pytesseract import pytesseract as pt

pt.run_tesseract('bw.png', 'output', lang=None, boxes=True, config="hocr")

# To read the coordinates
boxes = []
with open('output.box', 'rb') as f:
    reader = csv.reader(f, delimiter = ' ')
    for row in reader:
        if(len(row)==6):
            boxes.append(row)

# Draw the bounding box
img = cv2.imread('bw.png')
h, w, _ = img.shape
for b in boxes:
    img = cv2.rectangle(img,(int(b[1]),h-int(b[2])),(int(b[3]),h-int(b[4])),(255,0,0),2)

cv2.imshow('output',img)

休息一下，我建议您通过上面提供的链接有更好的理解。

使用 pytesseract 和 image_to_boxes 方法获取图像中的字符及其位置:

ret = pt.image_to_boxes(pil_img)

结果是一个字符串，其中每一行都是一个字符，其位置 (x1, y1, X2, y2) 用空格分隔:

a 30 10 45 20
b 49 12 54 23

关于Python 检测字符 tesseract ocr 使用 pytesseract 为文本创建 blob，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/54455372/

25

4

0

文章推荐： OpenCV 版本问题

文章推荐： c# - 在 C# 中使用 HttpCookie 而不是 Session

文章推荐： opencv - 从轮廓 OpenCV 中检测卡片 MinArea Quadrilateral

ocr - OCR 不再是问题吗？
根据 Wikipedia 的说法，“拉丁文打字文本的准确识别现在被认为在很大程度上解决了可以提供清晰成像的应用程序(例如扫描打印文档)的问题。”但是，它没有给出引用。我的问题是:这是真的吗？当前最先
ocr - OCR 中的格鲁吉亚语
我在将包含文本图像的 JPG 文件转换为文本文件时遇到问题。我尝试了 ABBYY 的 OCR SDK 和其他一些 OCR 来源，但没有一个包含格鲁吉亚语。你能告诉我是否有任何可用于格鲁吉亚语的 OC
ocr - OCR 简介
有人给了我一大堆惊人的信息。它是 200MB 的 .tiff 扫描公告图像，可以追溯到 40 年代。我想将其数字化，但我对 OCR 一无所知。一些早期的 Material 几乎无法被人类阅读，更不用说
ocr - 有没有办法用小字体改进tesseract OCR？
我正在尝试通过 python-tesseract 使用 tesseract-OCR 来读取看起来像这样的低分辨率字体: 不幸的是，该图像返回 ZIJZHZI 我认为分辨率太低，这会导致问题。我试过放大
ocr - OCR 软件能否可靠地从表格中读取值？
OCR 软件是否能够可靠地将如下图像转换为值列表？更新: 更详细的任务如下: 我们有一个客户端应用程序，用户可以在其中打开报告。此报告包含一个值表。但并不是每个报告看起来都一样——不同的字体、不同
ocr - Tesseract OCR - 手写字体
我正在尝试使用 Tesseract-OCR检测其中包含纯文本的图像文本，但这些文本具有名为Journal 的手写字体。例子: 结果不是最好的: Maxima! size` W (35) 有没有可能改
ocr - 使用 OCR 识别上标字符
我已经开始了一个简单的项目，它必须获得一个包含带有上标的文本的图像，然后通过使用 OCR(目前我正在使用 tesseract)它必须识别上标字符 + 正常字符。例如，我们有一个化学方程式，例如 Cl
ocr - 是否存在 OCR 无法识别的字体？
关闭。这个问题是off-topic .它目前不接受答案。想改善这个问题吗？ Update the question所以它是 on-topic对于堆栈溢出。 8年前关闭。 Improve this q
ocr - 波斯语 OCR 问题
我目前正在研究 OCR(波斯语)，尽管“fas.traineddata”在tessdata中可用，但是当我使用以下命令时，什么也没发生: import pytesseract from PIL im
ocr - Tesseract OCR 可以识别下标和上标吗？
我对文本片段中下标和上标的一般识别有疑问。示例图片: 我使用 Tesseract 4.1.1 和 https://github.com/tesseract-ocr/tessdata_best 下可用
ocr - Tesseract OCR 培训的替代方案？
在过去的 3 个月里，我一直在尝试训练 Tesseract 通过识别我拥有的图像集合，由于真正的缺乏正确的文档，以及非常高的复杂性，我开始放弃将 Tesseract 作为解决方案。我正在寻找一种
ocr - 是否有一个 OCR 库可以输出在图像中找到的单词的坐标？
关闭。这个问题不符合Stack Overflow guidelines .它目前不接受答案。想改进这个问题？将问题更新为 on-topic对于堆栈溢出。 12 个月前关闭。 Improve this
ocr - 最好的 OCR 开源拼写检查器？
已结束。此问题正在寻求书籍、工具、软件库等的推荐。它不满足Stack Overflow guidelines 。目前不接受答案。我们不允许提出寻求书籍、工具、软件库等推荐的问题。您可以编辑问题，以便
ocr - 如何 "OCR"我的电表图片？
下面是我的电表读数 52425.5(粗略)的图片: 什么程序/技术可以帮助我自动抄表？备注: 这是较为清晰的图像之一。许多图像都有静态。我可以忽略(让程序说“错误”)带有太多静态的图片。相机有一个固
ocr - 大写字母的常见 OCR 错误有哪些？
大写字母OCR(光学字符识别)的常见错误有哪些？例如FOR -> FOB 最佳答案要获得最准确的答案，最好使用针对您的问题的特定数据样本自行测试。不同字符/单词组合的错误率可能有很大差异，具体取决
ocr - Tesseract OCR - 将复选框识别为单词
对于我想教 Tesseract 将复选框识别为单词的客户。当 Tesseract 应该识别一个空的复选框时，它工作得很好。此命令与 this 结合使用教程就像一个魅力，Tesseract 能够找到空
ocr - 具有表格或行的文档的 Tesseract OCR 文本顺序
我正在使用 Tesseract OCR将扫描的 PDF 转换为纯文本。总体而言，它非常有效，但我对扫描文本的顺序有疑问。带有表格数据的文档似乎是逐列向下扫描，而更自然的方式是逐行扫描。一个非常小的例子
ocr - 土耳其语版 Tesseract OCR 多维数据集文件
在哪里可以找到 cube 的 tesseract ocr 土耳其语扩展模式？文件: tr.cube.fold tr.cube.lm tr.cube.nn tr.cube.params tr.cube
ocr - 为 Tesseract OCR 创建训练图像
我正在编写一个用于训练 Tesseract OCR 图像的生成器。在为 Tesseract OCR 的新字体生成训练图像时，最佳值是什么: 新闻部以磅为单位的字体大小字体是否应该抗锯齿边界框是
ocr - 如何通过 tesseract OCR 识别带有少量数学符号的文本？
我的文本带有一些不那么复杂的数学符号，如下所示。 Tesseract OCR 默认无法识别此类数学符号(+-、角度)。我如何通过 tesseract 识别这些数学符号？最佳答案只需使用以下语句:

首页

博学

6Ren·AI

商城

Python 检测字符 tesseract ocr 使用 pytesseract 为文本创建 blob