FreeTechBooks.com Homepage
FreeTechBooks.com
Free Online Computer Science and Programming Books, Textbooks, and Lecture Notes


 
Document Image Analysis
Reply with quote
Document Image Analysis

Author(s) : Lawrence O'Gorman and Rangachar Kasturi
ISBN : 0-8186-7802-X
Publication Date : 1997, 2009

Excerpts from the Preface:

Rangachar Kasturi wrote:
This book describes some of the technical methods and systems used for document processing of text and graphics images. The methods have grown out of the fields of digital signal processing, digital image processing, and pattern recognition. The objective is to give the reader an understanding of what approaches are used for application to documents and how these methods apply to different situations. Since the field of document processing is relatively new, it is also dynamic, so current methods have room for improvement, and innovations are still being made. In addition, there are rarely definitive techniques for all cases of a certain problem.

The intended audience is executives, managers, and other decision makers whose business requires some acquaintance or understanding of document processing. (We call this group "executives" in accordance with the Executive Briefing series.) Some rudimentary knowledge of computers and computer images will be helpful background for these readers. We begin at basic principles (such as, what is a pixel?), but do not belabor them. The reader is expected not so much Document Processing and Optical Character Recognition as to have knowledge of picture processing as to have a level of comfort with the tasks that can be accomplished on a computer and the digital nature by which any computer technique operates. A grasp of the terminology goes a long way toward aiding the executive in discussing the problem. For this reason, each section begins with a list of keywords that also appears in the index. With knowledge of the terminology and whatever depth of method or system understanding that he or she decides to take from the text, the executive should be well-equipped to deal with document processing issues.

Rangachar Kasturi wrote:
The book is organized in the sequence that document images are usually processed. After document input by digital scanning, pixel processing is first performed. This level of processing includes operations that are applied to all image pixels. These include noise removal, image enhancement, and segmentation of image components into text and graphics (lines and symbols). Feature-level analysis treats groups of pixels as entities, and includes line and curve detection, and shape description. The last two chapters separate text and graphics analysis. Text analysis includes optical character recognition (OCR) and page format recognition. Graphics analysis includes recognition of components of engineering drawings, maps, and other diagrams.

Arrow View/Download Document Image Analysis | Book webpage
View user's profileSend private message
Reply with quote
Well I have not read this book and having not that much experience with it. But I have heared that a huge number of documents that were only available in libraries are now on the web. That also includes the original image as well as the reference for the reader and the input data that are processed to automatically detect what makes sense in a document.
View user's profileSend private message
  
 Reply to topic