Comparison of optical character recognition software
This article may contain original research. (April 2026) |
This article or section possibly contains original synthesis. Source material should verifiably mention and relate to the main topic. (April 2026) |
This comparison of optical character recognition software includes:
- OCR engines, that do the actual character identification
- Layout analysis software, that divide scanned documents into zones suitable for OCR
- Graphical interfaces to one or more OCR engines
- Software development kits that are used to add OCR capabilities to other software (e.g. forms processing applications, document imaging management systems, e-discovery systems, records management solutions)
| Name | Founded year
|
Latest
stable version |
Latest
release year |
License | Online
|
Windows | Mac OS X | Linux | BSD | Android
|
iOS
|
Programming language | SDK?
|
Languages | Fonts | Output formats | Notes |
|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|---|
| ABBYY FineReader | 1989 | 16 | 2023 | Proprietary | Yes | Yes | Yes | No | Yes | Yes | Yes | C/C++ | Yes | 198 [1] |
All fonts | DOC, DOCX, XLS, XLSX, PPTX, RTF, PDF, HTML, CSV, TXT, ODT, DjVu, EPUB, FB2 [2] |
ABBYY also supplies SDKs for embedded and mobile devices. Professional, Corporate and Site License Editions for Windows, Express Edition for Mac. [3] |
| AIDA | 2016 | 13.0 | 2024 | Proprietary | Yes | Yes | Yes | Yes | Yes | Yes | Yes | No | All languages using Latin alphabet | Machine and handprinted text, Latin alphabet | DOCX, XLSX, PPTX, TXT, CSV, PDF, JSON, XML | AIDA is able to learn how to extract any value from any document, with a single click on a single document. [4] | |
| AnyDoc Software | 1989 | ? | ? | Proprietary | No | Yes | No | No | No | ? | ? | VBScript | ? | ? | ? | Works with structured, semi-structured, and unstructured documents. | |
| Asprise OCR SDK | 1998 | 15 | 2015 | Proprietary | Yes | Yes | Yes | Yes | Yes | ? | ? | Java, C#,VB.NET, C/C++/Delphi | Yes | 20+ [5] |
? | Plain text, searchable PDF, XML [6] |
Java, C#, VB.NET, C/C++/Delphi SDKs for OCR and Barcode recognition on Windows, Linux, Mac OS X and Unix. [7] |
| CuneiForm | 1996 | 1.1 | 2011 | BSD variant | No | Yes | Yes | Yes | Yes | ? | ? | C/C++ | Yes | 28 | Any printed font | HTML, hOCR, native, RTF, TeX, TXT [8] |
Enterprise-class system, can save text formatting and recognizes complicated tables of any structure |
| E-aksharayan | 2010 | Yes | No | Yes | No | ? | ? | 14 | RTF, TXT, BRL | ||||||||
| GOCR | 2000 | 0.52 [9] |
2018 | GPL | Yes [10] |
Yes | Yes | Yes | Yes | ? | ? | C | ? | 20+ | ? | ||
| Google Drive OCR or Google Cloud Vision | 2015 | Proprietary | Yes | Browser | Browser | Browser | Unknown | ? | ? | Unknown | Yes | 200+ | All fonts | text | Google blog post [11] [12] | ||
| Microsoft Office Document Imaging | ? | Office 2007 | 2007 | Proprietary | No | Yes | No | No | No | ? | ? | ? | ? | ? | ? | Uses OmniPage[citation needed] | |
| Microsoft Office OneNote 2007 | 2011 | ? | 2007 | Proprietary | No | Yes | No | No | No | ? | ? | ? | ? | ? | ? | ||
| OCRFeeder | 2009-03 | 0.8.5 | 2022 | GPL | No | No | No | Yes | No | ? | ? | Python | ? | ? | ? | Features a full user interface and has a command-line tool for automatic operations. Has its own segmentation algorithm but uses system-wide OCR engines like Tesseract or Ocrad | |
| Ocrad | ? | 0.29 [13] |
2024 | GPL | Yes | No | Yes | Yes | Yes | ? | ? | C++ | Yes | Latin alphabet | ? | Command line | |
| OCRopus | 2007 | 1.3.3 | 2017 | Apache | No | No | Yes | Yes | Yes | ? | ? | Python | ? | All languages using Latin script (other languages can be trained) | Normal Latin script and Fraktur (other scripts can be trained) | TXT, hOCR, [14] PDF [15] |
Pluggable framework under active development, used for Google Books |
| OmniPage | 1970s | 19.2 | 2015 | Proprietary | Yes | Yes | Yes | Yes | No | ? | ? | C/C++, C# [16] |
Yes | 125 [17] |
Machine and handprinted fonts | DOC/DOCX XLS/XLSX PPTX RTF PDF PDF/A Searchable PDF HTML Text XML ePUB MP3 | Product of Nuance Communications |
| Puma.NET | ? | ? | 2009 | BSD | No | Yes | No | No | No | ? | ? | C# | Yes | 28 | Any printed font | .NET OCR SDK based on Cognitive Technologies' CuneiForm recognition engine. Wraps Puma COM server and provides simplified API for .NET applications | |
| ReadSoft | ? | ? | 14? | Proprietary | No | Yes | No | No | No | ? | ? | ? | ? | ? | ? | Scan, capture and classify business documents such as invoices, forms and purchase orders integrated with business processes. | |
| Scantron | ? | ? | ? | Proprietary | No | Yes | No | No | No | ? | ? | ? | ? | ? | ? | For working with localized interfaces, corresponding language support is required. | |
| SmartScore | 1991 | 10.5.8 | 2015 | Proprietary | No | Yes | Yes | No | No | ? | ? | ? | ? | ? | ? | For musical scores | |
| Tesseract | 1985 | 5.5.0 | 2024 | Apache | No | Yes | Yes | Yes | Yes | ? | ? | C++, C | Yes | 100+ [18] |
Any printed font | Text, ALTO, hOCR, PAGE, [19] PDF, others with different user interfaces [20] or the API |
Developed at HP Labs (1985–1995) and Google (2006–2018 [21] |
| Name | Founded year
|
Latest
stable version |
Latest
release year |
License | Online
|
Windows | Mac OS X | Linux | BSD | Android
|
iOS
|
Programming language | SDK?
|
Languages | Fonts | Output formats | Notes |
Evaluation
A 2016 analysis of the accuracy and reliability of the OCR packages Google Docs OCR, Tesseract, ABBYY FineReader, and Transym, employing a dataset including 1227 images from 15 different categories concluded Google Docs OCR and ABBYY to be performing better than others.[22]
References
- ^ "ABBYY FineReader 14: Technical Specifications". Finereader.abbyy.com. Retrieved 2017-02-23.
- ^ "ABBYY FineReader 11: Technical Specifications". Finereader.abbyy.com. Retrieved 2013-09-12.
- ^ "Top OCR Software". Ocrworld.com. 2010-03-30. Archived from the original on 2017-02-23. Retrieved 2013-09-12.
- ^ "AIDA". TCLAB. 2024-12-03. Retrieved 2024-12-03.
- ^ "Asprise OCR SDK Features". asprise.com. Retrieved 2014-06-21.
- ^ "Asprise Java OCR Library Features". asprise.com. Retrieved 2014-06-21.
- ^ "Asprise Java, C#/VB.NET OCR API". asprise.com. 2015-11-19. Retrieved 2015-11-19.
- ^ Debian manual page for Cuneiform for Linux version 1.1.0
- ^ "GOCR Homepage". wasd.urz.uni-magdeburg.de. Retrieved 2018-10-17.
- ^ "GOCR". Jocr.sourceforge.net. Retrieved 2013-09-12.
- ^ "Supported languages". Feb 11, 2022.
- ^ Ashok Popat (Sep 4, 2015). "IEEE SPS: Optical Character Recognition for Most of the World's Languages". YouTube. Archived from the original on 2021-12-20.
- ^ Diaz, Antonio (2024-01-20). "GNU Ocrad 0.29 released" (Mailing list). info-gnu.
- ^ OCRopus includes the ocropus-hocr tool which produces hOCR from the recognition results.
- ^ In combination with the hocr-tools
- ^ "OmniPage CSDK - OCR Document Capture Toolkit | Document Imaging & OCR". Nuance. Archived from the original on 2010-08-24. Retrieved 2013-09-12.
- ^ "OmniPage Standard Document Conversion". Nuance. Archived from the original on 2014-03-13. Retrieved 2014-02-25.
- ^ Based on count of language training files for version 3.04. Available at the download page.
- ^ Usage explained in the Tesseract Readme and FAQ
- ^ Such as ODF with OCRFeeder
- ^ "GitHub - tesseract-ocr/tesseract: Tesseract Open Source OCR Engine (main repository)". GitHub. Retrieved 2025-03-21.
- ^ Assefi, Mehdi (2016-12-01). "OCR as a Service: An Experimental Evaluation of Google Docs OCR, Tesseract, ABBYY FineReader, and Transym". ResearchGate. Retrieved 2019-01-31.
Content Disclaimer
Informasi ini disarikan dari Wikipedia dan disajikan kembali untuk tujuan edukasi. Konten tersedia di bawah lisensi CC BY-SA 3.0. Kami tidak bertanggung jawab atas ketidakakuratan data yang bersumber dari kontribusi publik tersebut.
- The information displayed on this website is sourced in part or in whole from Wikipedia and has been adapted for the purpose of restating it. We strive to provide accurate and relevant information, however:
- There is no guarantee of absolute accuracy. Wikipedia is an open, collaborative project that can be edited by anyone, so information is subject to change.
- It is not intended to constitute professional advice. The content displayed is for informational and educational purposes only. For important decisions (e.g., medical, legal, or financial), please consult a professional.
- Content copyright. Wikipedia is licensed under the Creative Commons Attribution-ShareAlike License (CC BY-SA). This means that content may be reused with appropriate attribution and shared under a similar license.
- Responsible use. Any risk arising from the use of information from this website is entirely the responsibility of the user.