Tesseract OCR having trouble detecting numbers(Tesseract OCR在检测数字时出现问题)-6ren

Tesseract OCR having trouble detecting numbers(Tesseract OCR在检测数字时出现问题)

转载作者：bug小助手更新时间：2023-10-25 17:16:30

I am trying to detect some numbers with tesseract in python. Below you will find my starting image and what I can get it down to. Here is the code I used to get it there.

我正在尝试在python中使用tesseract检测一些数字。下面你会发现我的开始图像，以及我可以把它降到什么。这是我用来把它弄到那里的代码。

import pytesseract
import cv2
import numpy as np
pytesseract.pytesseract.tesseract_cmd = "C:\\Users\\choll\\AppData\\Local\\Programs\\Tesseract-OCR\\tesseract.exe"

image = cv2.imread(r'64normalwart.png')
lower = np.array([254, 254, 254])
upper = np.array([255, 255, 255])
image = cv2.inRange(image, lower, upper)
image = cv2.bitwise_not(image)
#Uses a language that should work with minecraft text, I have tried with and without, no luck 
text = pytesseract.image_to_string(image, lang='mc')
print(text)
cv2.imwrite("Wartthreshnew.jpg", image)
cv2.imshow("Image", image)
cv2.waitKey(0)

I end up with black numbers on a white background which seems pretty good but tesseract can still not detect the numbers. I also noticed the numbers were pretty jagged but I don't know how to fix that. Does anyone have recommendations for how I could make tesseract be able to recognize these numbers?

我最终得到了白色背景上的黑色数字，这看起来很好，但tesseract仍然无法检测到数字。我也注意到这些数字相当参差不齐，但我不知道如何解决这个问题。有没有人建议我如何让tesseract能够识别这些数字？

Starting Image

启动图像

What I end up with

我最终得到的是什么

更多回答

You could try cv2.blur() to smooth the rough edges of the numbers. It will make the image fuzzier overall but tesseract might have an easier time recognizing digits.

您可以尝试使用cv2.blur()来平滑数字的粗略边缘。这将使图像整体上更加模糊，但tesseract可能会更容易识别数字。

Thanks for the suggestion, the image might be too small but it still cant see it.

谢谢你的建议，图像可能太小了，但还是看不见。

Try to add config psm 6 or 7 like this: pytesseract.image_to_string(img, config='--psm 6')

尝试像这样添加配置PSM 6或7：pytesseract.Image_to_string(img，config=‘--PSM 6’)

Good idea. The solution I found was to use --psm 8 and treat it as a word along with limiting it to numbers. stackoverflow.com/questions/44619077/… Was a useful resource for anyone in the future who sees this.

好主意.我找到的解决方案是使用--psm 8并将其视为一个单词，同时将其限制为数字。stackoverflow.com/questions/44619077/.对于将来看到这一点的人来说是一个有用的资源。

优秀答案推荐

Your problem is with the page segmentation mode. Tesseract segments every image in a different way. When you don't choose an appropriate PSM, it goes for mode 3, which is automatic and might not be suitable for your case. I've just tried your image and it works perfectly with PSM 6.

您的问题出在页面分割模式上。Tesseract以不同的方式对每个图像进行分割。如果您没有选择合适的PSM，则会选择模式3，该模式是自动的，可能不适合您的情况。我刚刚试过你的图像，它和PSM 6完美地搭配在一起。

df = pytesseract.image_to_string(np.array(image),lang='eng', config='--psm 6')

These are all PSMs availabe at this moment:

这些都是目前可用的PSM：

  0    Orientation and script detection (OSD) only.
  1    Automatic page segmentation with OSD.
  2    Automatic page segmentation, but no OSD, or OCR.
  3    Fully automatic page segmentation, but no OSD. (Default)
  4    Assume a single column of text of variable sizes.
  5    Assume a single uniform block of vertically aligned text.
  6    Assume a single uniform block of text.
  7    Treat the image as a single text line.
  8    Treat the image as a single word.
  9    Treat the image as a single word in a circle.
 10    Treat the image as a single character.
 11    Sparse text. Find as much text as possible in no particular order.
 12    Sparse text with OSD.
 13    Raw line. Treat the image as a single text line,
            bypassing hacks that are Tesseract-specific.

Use the pytesseract.image_to_string(img, config='--psm 8') or try diffrent configs to see if the image will get recognized. Useful link here Pytesseract OCR multiple config options

使用pytesseract.IMAGE_TO_STRING(img，CONFIG=‘--PSM 8’)或尝试不同的配置，以查看是否可以识别映像。此处提供了有用的链接，其中包含多个配置选项

I think tesseract is blacklisted numbers by default, so i tried tessedit_char_whitelist to whitelist the characters i want but it didn't work, so i tried to un-blacklist the numbers using this config tessedit_char_unblacklist='0123456789'

我认为tesseract默认情况下是被列入黑名单的数字，所以我尝试tessedit_char_White elist将我想要的字符列入白名单，但它不起作用，所以我尝试使用此配置将数字取消黑名单。tessedit_char_unBlacklist=‘0123456789’

pytesseract.image_to_string(img, lang='eng', config='--psm 6 --oem 3 -c tessedit_char_unblacklist=0123456789')

更多回答

Remember that Stack Overflow isn't just intended to solve the immediate problem, but also to help future readers find solutions to similar problems, which requires understanding the underlying code. This is especially important for members of our community who are beginners, and not familiar with the syntax. Given that, can you edit your answer to include an explanation of what you're doing and why you believe it is the best approach?

请记住，Stack Overflow不仅仅是为了解决眼前的问题，也是为了帮助未来的读者找到类似问题的解决方案，这需要理解底层代码。这对于我们社区的初学者和不熟悉语法的成员来说尤其重要。考虑到这一点，你能编辑你的答案，包括你正在做的事情的解释以及为什么你认为这是最好的方法吗？

文章推荐： How to Map Composite Primary key in JPA(如何在JPA中映射组合主键)

文章推荐： ECS Service Connect DNS Resolution(ECS服务连接域名解析)

angular - 错误 : Type '[number] | [number, number, number, number]' is not assignable to type '[number]'
从 angular 5.1 更新到 6.1 后，我开始从我的代码中收到一些错误，如下所示: Error: ngc compilation failed: components/forms/utils.
typescript :number[] 和 [number,number] 有什么区别？
我正在学习 Typescript 并尝试了解类型和接口(interface)的最佳实践。我正在玩一个使用 GPS 坐标的示例，想知道一种方法是否比另一种更好。 let gps1 : number[];
javascript - 类型 'number[]' 缺少类型 '[number, number, number, number]' 的以下属性 : 0, 1、2、3
type padding = [number, number, number, number] interface IPaddingProps { defaultValue?: padding
c - : number = number + 10; and number += 10; 之间的区别
这两种格式在内存中保存结果的顺序上有什么区别吗？ number = number + 10; number += 10; 我记得一种格式会立即保存结果，因此下一行代码可以使用新值，而对于另一种格式，
python重新匹配组: number after\number
在 Python 匹配模式中，如何匹配像 1 这样的文字数字在按数字反向引用后 \1 ？我尝试了 \g用于此目的的替换模式中可用的语法，但它在我的匹配模式中不起作用。我有一个更大的问题，我想使用一
javascript - 将字符串 ">number<"转换为 ">number<"
我的源文件here包含 HTML 代码，我想将电话号码更改为可在我的应用程序中单击。我正在寻找一个正则表达式来转换字符串 >numbernumber(\d+)$1numbernumber<"，我们在S
Javascript/html : How to generate random number between number A and number B?
我们有一个包含 2 个字段和一个按钮的表单。我们想要点击按钮来输出位于 int A 和 int B 之间的随机整数(比如 3、5 或 33)？ (不需要使用 jQuery 或类似的东西) 最佳答案你
javascript - 类型 '(priority1: number, priority2: number) => number' 的参数不可分配给类型 '(a: unknown, b: unknown) => number' 的参数
我收到以下类型错误(TypeScript - 3.7.5)。 error TS2345: Argument of type '(priority1: number, priority2: number
google-apps-script - 找不到方法 getRange(number,number,number,(class))
只想创建简单的填充器以在其他功能中使用它: function fillLine(row, column, length, bgcolor) { var sheet = SpreadsheetApp
java - java中的输出(printf)中的终止符(number).(number)[a](number)[d0]是什么意思？
我有一个问题。当我保存程序输出的 *.txt 时，我得到以下信息:0.021111111111111112a118d0 以及更多的东西。问题是: 这个数字中的“d0”和“a”是什么意思？我不知道“
algorithm - 数字金字塔算法 : Numbers 1-15 in a pyramid where each number is the difference of the subjacent numbers
首先:抱歉标题太长了，但我发现很难用一句话来解释这个问题；)。是的，我也四处搜索(这里和谷歌)，但找不到合适的答案。所以，问题是这样的: 数字 1-15 将像这样放在金字塔中(由数组表示):
r - 提取模式 "number/number"
我想从字符串中提取血压。数据可能如下所示: text <- c("at 10.00 seated 132/69", "99/49", "176/109", "10.12 I 128/51, II 1
Bash 算术 $number != $((number))
当尝试执行一个简单的 bash 脚本以将前面带有 0 的数字递增 1 时，原始数字被错误地解释。 #!/bin/bash number=0026 echo $number echo $((number
typescript - [number, number] 类型的初始值
我有一个类型为 [number, number] 的字段，TypeScript 编译器(strict 设置为 true)出现问题，提示初始值值(value)。我尝试了以下方法: public shee
ruby - 正则表达式数组(["number"， "number"，...])
你能帮我表达数组吗:["232","2323","233"] 我试试这个:/^\[("\d{1,7}")|(,"\d{1,7}")\]$/ 但是这个表达式不能正常工作。我使用 ruby(rail
c++ - (number & -number) 在位编程中是什么意思？
这个问题在这里已经有了答案: meaning of (number) & (-number) (4 个回答) 关闭6年前. 例如: int get(int i) { int res = 0;
counter - 如何在 Berkeley DB 中对 Map> 建模
我正在考虑使用 Berkeley DB作为高度并发的移动应用程序后端的一部分。对于我的应用程序，使用 Queue对于他们的记录级别锁定将是理想的。但是，如标题中所述，我需要查询和更新概念建模的数据，如
javascript - 重复出现的数字 : How to get a non-rounded recurring number when dividing a number by another number?
我正在尝试解决涉及重复数字的特定 JavaScript 练习，为此我需要将重复数字处理到大量小数位。目前我正在使用: function divide(numerator, denominator){
typescript - 错误 : Type 'number | undefined' is not assignable to type 'number | { valueOf(): number; }' ?
我有这个数组类型: interface Details { Name: string; URL: string; Year: number; } interface AppState {
java - 在服务器 "number 1"或服务器 "number 2"上运行作业。从未在服务器上 "number 3"
我们正在使用 Spring 3.x.x 和 Quartz 2.x.x 实现 Web 应用程序。 Web 服务器是 Tomcat 7.x.x。我们有 3 台服务器。 Quartz 是集群式的，因此所有这

bug小助手

个人简介

我是一名优秀的程序员,十分优秀！

作者热门文章

滴滴打车优惠券免费领取

全站热门文章

首页

博学

6Ren·AI

商城

Tesseract OCR having trouble detecting numbers(Tesseract OCR在检测数字时出现问题)