gpt4 book ai didi

azure - 人工智能形式识别器与认知服务计算机视觉

转载 作者:行者123 更新时间:2023-12-03 07:07:58 25 4
gpt4 key购买 nike

当前使用 @azure/ai-form-recognizer 3.2.0 对图像和 PDF 进行 OCR,例如:

const poller = await MsClient.beginRecognizeInvoices(stream, 
{
onProgress: (state) => {}
});
const [ocrResult] = await poller.pollUntilDone();

@azure/cognitiveservices-computervision 的差异或关系是什么?我只对 OCR 感兴趣。

最佳答案

两者之间有几个关键区别。表单识别器的主要目标是构建表单和其他数字化文档中的数据以进行进一步处理。这里的关键是,表单识别器提供的功能可以帮助更好地将从所述文档中读取的信息置于上下文中,而不仅仅是独立的光学字符识别。来自 Form Recognizer documentation (强调我的):

Azure Form Recognizer is a cloud-based Azure Applied AI Service that uses machine-learning models to extract and analyze form fields, text, and tables from your documents. Form Recognizer analyzes your forms and documents, extracts text and data, maps field relationships as key-value pairs, and returns a structured JSON output. You quickly get accurate results that are tailored to your specific content without excessive manual intervention or extensive data science expertise. Use Form Recognizer to automate your data processing in applications and workflows, enhance data-driven strategies, and enrich document search capabilities.

另一方面,Azure 计算机视觉提供了三个不同的功能。虽然下面的 OCR 原则描述了与表单识别器类似的内容,但它的用途更为通用,因为它不像表单识别器那样提供健壮的键/值对上下文化。该服务还提供了更高级别的人工智能功能,用于处理图像和视频,以识别人物/名人、地标和其中的常见物体(等等)。来自 Computer Vision documentation :

Service Description
Optical Character Recognition (OCR) The Optical Character Recognition (OCR) service extracts text from images. You can use the new Read API to extract printed and handwritten text from photos and documents. It uses deep-learning-based models and works with text on a variety of surfaces and backgrounds. These include business documents, invoices, receipts, posters, business cards, letters, and whiteboards. The OCR APIs support extracting printed text in several languages...
Image Analysis The Image Analysis service extracts many visual features from images, such as objects, faces, adult content, and auto-generated text descriptions. Follow the Image Analysis quickstart to get started.
Spatial Analysis The Spatial Analysis service analyzes the presence and movement of people on a video feed and produces events that other systems can respond to. Install the Spatial Analysis container to get started.

乍一看,两者之间存在一些重叠,但经过进一步检查,两者的主要用例有了明确的划分。

关于azure - 人工智能形式识别器与认知服务计算机视觉,我们在Stack Overflow上找到一个类似的问题: https://stackoverflow.com/questions/71071309/

25 4 0
Copyright 2021 - 2024 cfsdn All Rights Reserved 蜀ICP备2022000587号
广告合作:1813099741@qq.com 6ren.com