

What is Optical character recognition?
source link: https://docs.microsoft.com/en-us/azure/cognitive-services/computer-vision/overview-ocr
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.

What is Optical character recognition?
- 06/21/2021
- 2 minutes to read
In this article
Optical character recognition (OCR) allows you to extract printed or handwritten text from images, such as photos of street signs and products, as well as from documents—invoices, bills, financial reports, articles, and more. Microsoft's OCR technologies support extracting printed text in several languages. Follow a quickstart to get started.
This documentation contains the following types of articles:
- The quickstarts are step-by-step instructions that let you make calls to the service and get results in a short period of time.
- The how-to guides contain instructions for using the service in more specific or customized ways.
Read API
The Computer Vision Read API is Azure's latest OCR technology (learn what's new) that extracts printed text (in several languages), handwritten text (English only), digits, and currency symbols from images and multi-page PDF documents. It's optimized to extract text from text-heavy images and multi-page PDF documents with mixed languages. It supports detecting both printed and handwritten text in the same image or document.
Input requirements
The Read call takes images and documents as its input. They have the following requirements:
- Supported file formats: JPEG, PNG, BMP, PDF, and TIFF
- For PDF and TIFF files, up to 2000 pages (only first two pages for the free tier) are processed.
- The file size must be less than 50 MB (6 MB for the free tier) and dimensions at least 50 x 50 pixels and at most 10000 x 10000 pixels.
Supported languages
The Read API supports a total of 73 languages for print style text. Refer to the full list of OCR-supported languages. Handwritten-style OCR is supported exclusively for English.
Key features
The Read API includes the following features.
- Print text extraction in 73 languages
- Handwritten text extraction in English
- Text lines and words with location and confidence scores
- No language identification required
- Support for mixed languages, mixed mode (print and handwritten)
- Select pages and page ranges from large, multi-page documents
- Natural reading order for text lines
- Handwriting classification for text lines
- Available as Distroless Docker container for on-premise deployment
Learn how to use the OCR features.
Use the cloud API or deploy on-premise
The Read 3.x cloud APIs are the preferred option for most customers because of ease of integration and fast productivity out of the box. Azure and the Computer Vision service handle scale, performance, data security, and compliance needs while you focus on meeting your customers' needs.
For on-premise deployment, the Read Docker container (preview) enables you to deploy the new OCR capabilities in your own local environment. Containers are great for specific security and data governance requirements.
Warning
The Computer Vision 2.0 RecognizeText operations are in the process of being deprecated in favor of the new Read API covered in this article. Existing customers should transition to using Read operations.
Data privacy and security
As with all of the Cognitive Services, developers using the Computer Vision service should be aware of Microsoft's policies on customer data. See the Cognitive Services page on the Microsoft Trust Center to learn more.
Next steps
Is this page helpful?
Recommended content
-
Quickstart: Read client library or REST API - Azure Cognitive Services
Learn how to use Optical character recognition (OCR) in your application through a native client library in the language of your choice.
-
How to call the Read API - Azure Cognitive Services
Learn how to call the Read API and configure its behavior in detail.
-
Receipts - Form Recognizer - Azure Applied AI Services
Learn concepts related to receipt analysis with the Form Recognizer API - usage and limits.
-
Language support - Computer Vision - Azure Cognitive Services
This article provides a list of natural languages supported by Computer Vision features; OCR, Image analysis.
Recommend
-
6
Optical character recognition From Wikipedia, the free encyclopedia Jump to navigation
-
11
Optical Character Recognition: How the Hell Does It Work?September 23rd 2021 new story
-
10
Optical character recognition with TensorFlow Lite Optical character recognition with TensorFlow Lite: A new example app September 27, 2021 Posted by Wei Wei, TensorFlow Developer Advocate
-
10
Laravel 8 Optical Character Recognition using Google Cloud Vision 20 views 2 days ago Laravel Google clou...
-
8
Reading Handwritten Pages With Azure and Optical Character Recognition An introduction to optical character recognition (OCR) in the cloud and how to use it from your applications. Learn about image-to...
-
8
Using the Google Optical Character Recognition APIJune 17th 2022 new story4
-
7
AI Inference Software Fundamentals: Getting Started with Optical Character Recognition ...
-
4
Welcome to DeskTranslate!
-
8
Dig Security adds optical character recognition to help find sensitive information in image files
-
11
Exploring Optical Character Recognition (OCR): An Experiment with OpenCV and PyTesseract
About Joyk
Aggregate valuable and interesting links.
Joyk means Joy of geeK