📞 +91-7667918914 | ✉️ iarjset@gmail.com
International Advanced Research Journal in Science, Engineering and Technology
International Advanced Research Journal in Science, Engineering and Technology A Monthly Peer-Reviewed Multidisciplinary Journal
ISSN Online 2393-8021ISSN Print 2394-1588Since 2014
IARJSET aligns to the suggestive parameters by the latest University Grants Commission (UGC) for peer-reviewed journals, committed to promoting research excellence, ethical publishing practices, and a global scholarly impact.
← Back to VOLUME 3, ISSUE 5, MAY 2016

DOCUMENT IMAGE ANALYSIS USING IMAGEMAGICK AND TESSERACT-OCR

Prof. Smitha M L, Dr. Antony P J, Sachin D N

👁 4 views📥 0 downloads
Share: 𝕏 f in

Abstract: Document image analysis is the field of converting paper documents into an editable electronic representation by performing optical character recognition (OCR). In recent years, there has been a tremendous amount of progress in the development of open source OCR systems. The tesseract-ocr engine, as was the HP Research Prototype in the UNLV Fourth Annual Test of OCR Accuracy, is described in a comprehensive overview. Emphasis is placed on aspects that are novel or at least unusual in an OCR engine, including in particular the line finding, features/classification methods, and the adaptive classifier. OCRopus is one of the leading open source document analysis systems using tesseract-ocr with a modular and pluggable architecture. Imagemagick is an open source image processing tool. This paper presents an overview of different steps involved in a document image analysis system and illustrates them with examples from Combination of imagemagick and OCRopus.

Keywords: Document Image Analysis, Imagemagick, tesseract-ocr, open source OCR, Free Software.

How to Cite:

[1] Prof. Smitha M L, Dr. Antony P J, Sachin D N, “DOCUMENT IMAGE ANALYSIS USING IMAGEMAGICK AND TESSERACT-OCR,” International Advanced Research Journal in Science, Engineering and Technology (IARJSET), DOI: 10.17148/IARJSET.2016.3523

Creative Commons License This work is licensed under a Creative Commons Attribution 4.0 International License.