image-processing - 如何从文档图像中检测文本区域？-6ren

image-processing - 如何从文档图像中检测文本区域？

转载作者：行者123 更新时间：2023-11-30 09:01:49

24

4

我有一个文档图像，可能是报纸或杂志。例如，扫描的报纸。我想删除所有/大部分文本并保留文档中的图像。有人知道如何检测文档中的文本区域吗？下面是一个例子。提前致谢!

示例图像:https://www.mathworks.com/matlabcentral/answers/uploaded_files/21044/6ce011abjw1elr8moiof7j20jg0w9jyt.jpg

最佳答案

通常的对象识别模式将在这里工作 - 阈值、检测区域、过滤区域，然后对剩余区域执行您需要的操作。

这里阈值设置很容易。背景是纯白色(或者可以过滤为纯白色)，因此反转灰度图像中高于 0 的任何内容都是文本或图像。然后可以在这个阈值二值图像中检测区域。

为了过滤区域，我们只需识别文本与图片的不同之处。文本区域将会很小，因为每个字母都有自己的区域。相比之下，图片是大区域。使用适当的阈值按区域进行过滤将拉出所有图片并删除所有文本，假设所有图片都没有页面上任何位置的单个字母大小。如果是，则可以使用其他过滤标准(饱和度、色调方差……)。

根据面积和饱和度标准对区域进行过滤后，可以通过将原始图像中落入过滤区域的边界框内的像素插入到新图像中来创建新图像。

MATLAB 实现:

%%%%%%%%%%%%
% Set these values depending on your input image

img = imread('https://www.mathworks.com/matlabcentral/answers/uploaded_files/21044/6ce011abjw1elr8moiof7j20jg0w9jyt.jpg');

MinArea = 2000; % Minimum area to consider, in pixels
%%%%%%%%%
% End User inputs

gsImg = 255 - rgb2gray(img); % convert to grayscale (and invert 'cause that's how I think)
threshImg = gsImg > graythresh(gsImg)*max(gsImg(:)); % Threshold automatically

% Detect regions, using the saturation in place of 'intensity'
regs = regionprops(threshImg, 'BoundingBox', 'Area');

% Process regions to conform to area and saturation thresholds
regKeep = false(length(regs), 1);
for k = 1:length(regs)

    regKeep(k) = (regs(k).Area > MinArea);

end

regs(~regKeep) = []; % Delete those regions that don't pass qualifications for image

% Make a new blank image to hold the passed regions
newImg = 255*ones(size(img), 'uint8');

for k = 1:length(regs)

    boxHere = regs(k).BoundingBox; % Pull out bounding box for current region
    boxHere([1 2]) = floor(boxHere([1 2])); % Round starting points down to next integer
    boxHere([3 4]) = ceil(boxHere([3 4])); % Round ranges up to next integer
    % Insert pixels within bounding box from original image into the new
    % image
    newImg(boxHere(2):(boxHere(2)+boxHere(4)), ...
        boxHere(1):(boxHere(1)+boxHere(3)), :) = img(boxHere(2):(boxHere(2)+boxHere(4)), ...
        boxHere(1):(boxHere(1)+boxHere(3)), :);

end

% Display
figure()
image(newImg);

正如您在下面链接的图片中看到的，它满足了需要。除图片和报头外的所有内容均被删除。好处是，如果您在远离头版的报纸上工作，这对于彩色和灰度图像来说效果很好。

结果:

http://imgur.com/vEmpavY,dd172fr#1

关于image-processing - 如何从文档图像中检测文本区域？，我们在Stack Overflow上找到一个类似的问题： https://stackoverflow.com/questions/26955513/

24

4

0

文章推荐： javascript - Document.write 问题

文章推荐： javascript - 使用外部javascript的html表单验证

processing - "Processing"编程语言用于什么？
关闭。这个问题是opinion-based .它目前不接受答案。想改善这个问题吗？更新问题，以便可以通过 editing this post 用事实和引文回答问题. 5年前关闭。 Improve t
processing - Processing/Arduino中如何计算统计模式
我是一名设计老师，试图帮助学生应对编程挑战，所以我编码是为了好玩，但我不是专家。她需要找到 mode (最常见的值)在使用耦合到 Arduino 的传感器的数据构建的数据集中，然后根据结果激活一些功
Node.js/Electron : How to Identify the process is windows process or other application process
我正在开发一个应用程序，该应用程序提供 CPU 使用率最高的 5 个应用程序名称。目前，我通过以下代码获得了排名前 5 的应用程序: var _ = require('lodash');
emacs - 微调: `set-process-sentinel` | `set-process-filter` | `start-process`
互联网上很少有例子涉及这个问题的所有三个问题——即 set-process-sentinel ; set-process-filter ;和 start-process . 我尝试了几种不同的方法来微
c# - Process.Start 与 C# 中的 Process `p = new Process()`？
如 this post 中所述，在 C# 中有两种调用另一个进程的方法。 Process.Start("hello"); 和 Process p = new Process(); p.StartInf
processing - 如何在 Processing 中用渐变填充矩形或椭圆？
我试图让我的桨从白色变为渐变(线性)，并使球具有径向渐变。感谢您的帮助!您可以在 void drawPaddle 中找到桨的代码。这是我的目标: 这是我的代码: //球 int ballX = 50
process - VHDL - process() 什么时候第一次运行？
考虑:流程(a)根据我的文字: A process is first entered at the time of simulation, at which time it is executed u
processing - 从 Processing 中的数组中删除对象的最佳方法
我真的希望 Processing 有用于处理数组的 push 和 pop 方法，但由于它没有，我不得不试图找出删除数组中特定位置的对象的最佳方法。我相信这对很多人来说都是基本的，但我可以使用一些帮助，
c++ - "is processed"还是 "was processed"？
关闭。这个问题是off-topic .它目前不接受答案。想改进这个问题吗？ Update the question所以它是on-topic用于堆栈溢出。关闭 10 年前。 Improve thi
c# - Windows 10 : How to determine whether a process is an App/Background Process/Windows Process
以编程方式，我如何确定 Windows 10 中的 3 个类别应用后台进程 Windows 服务就像任务管理器一样？即我需要一些 C# 代码，我可以确定应用程序列表与后台进程列表。检查 Win
javascript - Node :process and process?有什么区别
当我导入 node:process它工作正常。但是，当我尝试要求相同时，它会出错。这工作正常: import process from 'node:process'; 但是当我尝试要求相同时，它会引
processing - Processing 中的 map() 函数是如何工作的？
我正在上一门使用处理的类(class)。我在理解 map() 函数时遇到问题。根据它的文档( http://www.processing.org/reference/map_.html ): Re
process - Composer 更新 "process killed"
我试图执行: composer.phar update 并收到: Fatal error: Allowed memory size of 94371840 bytes exhausted (tried
processing - 使用 processing.js 进行体积渲染
给定一堆二维图像，如何使用 Processing/Processing.js 产生体积渲染效果？目前我的想法是使用 java(类似于 imageJ)进行体积渲染 -> 获取体积渲染图像的面作为单独的
c# - 我在调用 Process.Start() 时收到 'A 32 bit processes cannot access modules of a 64 bit process.' 异常
这是代码示例 var startInfo = new ProcessStartInfo { Arguments = commandStr, FileName = @"C:\Window
processing - 从 Sketch 菜单添加时，Processing 库安装在哪里？
当我在 Processing(草图 > 导入库 > 添加库)中添加库时，它安装在哪里？最佳答案它们安装在您的中速写本位置 . 您可以通过转到"file">“首选项”来查看和更改您的速写本位置。草
.net - 为什么是 Process.WorkingSet > Process.MaxWorkingSet？
无聊的好奇... 我正在查看当前进程的一些属性: using(Process p = Process.GetCurrentProcess()) { // Inspect properties
processing.js - 如何同时运行多个 processing.js 草图
我正在尝试在同一页面上运行多个草图。初始化脚本指定: /* * This code searches for all the * in your page and loads each scrip
.net - Process.Kill 后是否需要使用 Process.WaitForExit？
Process.Kill 后是否需要使用 Process.WaitForExit？如果调用进程在调用 Process.Kill 后立即退出怎么办？这会导致 Process.Kill 失败吗？编辑
processing - 使用 Minim 在 Processing 中获取频率
我尝试使用处理从麦克风获取频率。我混合了文档中的两个示例，但“最高”并不是真正的赫兹(a 是 440 赫兹)。你知道如何拥有比这更好的东西吗？ import ddf.minim.*; import

首页

博学

6Ren·AI

商城

image-processing - 如何从文档图像中检测文本区域？