gpt4 book ai didi

matlab - 如何将图像转换为字符段?

转载 作者:太空宇宙 更新时间:2023-11-03 19:09:36 24 4
gpt4 key购买 nike

往往在OCR的过程中,一个图像文件本质上是被切割成片段,每个字符被识别为一个片段。例如,Unsegmented text as image

必须转换成类似的东西 Image in which text has been segmented and is ready for OCR

此外,是否有针对泰卢固语等亚洲语言的算法可用于此目的?如果不是,英语是怎么做到的?

最佳答案

使用 OpenCV 可以轻松完成。下面是一个示例代码:

import cv2
import numpy as np

# Load the image
img = cv2.imread('sof.png')

# convert to grayscale
gray = cv2.cvtColor(img,cv2.COLOR_BGR2GRAY)

# smooth the image to avoid noises
gray = cv2.medianBlur(gray,5)

# Apply adaptive threshold
thresh = cv2.adaptiveThreshold(gray,255,1,1,11,2)
thresh_color = cv2.cvtColor(thresh,cv2.COLOR_GRAY2BGR)

# apply some dilation and erosion to join the gaps
thresh = cv2.dilate(thresh,None,iterations = 3)
thresh = cv2.erode(thresh,None,iterations = 2)

# Find the contours
contours,hierarchy = cv2.findContours(thresh,cv2.RETR_LIST,cv2.CHAIN_APPROX_SIMPLE)

# For each contour, find the bounding rectangle and draw it
for cnt in contours:
x,y,w,h = cv2.boundingRect(cnt)
cv2.rectangle(img,(x,y),(x+w,y+h),(0,255,0),2)
cv2.rectangle(thresh_color,(x,y),(x+w,y+h),(0,255,0),2)

# Finally show the image
cv2.imshow('img',img)
cv2.imshow('res',thresh_color)
cv2.waitKey(0)
cv2.destroyAllWindows()

输出如下所示:

enter image description here enter image description here

关于matlab - 如何将图像转换为字符段?,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/10964226/

24 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com