Guide to OCR for Indic Scripts: Document Recognition and Retrieval

Front Cover
Venu Govindaraju, Srirangaraj (Ranga) Setlur
Springer Science & Business Media, Sep 25, 2009 - Computers - 325 pages

This is the first comprehensive text on Optical Character Recognition for Indic scripts. It covers many topics and describes OCR systems for eight different scripts—Bangla, Devanagari, Gurmukhi, Gujarti, Kannada, Malayalam, Tamil and Urdu.

 

Contents

Building Data Sets for Indian Language OCR Research
3
Bangla and Devanagari
26
A Complete MachinePrinted Gurmukhi OCR System
43
Progress in Gujarati Document Processing and Character Recognition
73
Design of a Bilingual KannadaEnglish OCR
96
Recognition of Malayalam Documents
125
A Complete OCR System for Tamil Magazine Documents
147
Experiments on Urdu Text Recognition
163
Generalization of Hindi OCR Using Adaptive Segmentation and Font Files
181
Online Handwriting Recognition for Indic Scripts
208
Part II Retrieval of Indic Documents
235
Enhancing Access to Primary Cultural Heritage Materials of India
237
Digital Image Enhancement of Indic Historical Manuscripts
248
GFGBased Compression and Retrieval of Document Images in Indian Scripts
269
Word Spotting for Indic Documents to Facilitate Retrieval
285
Indian Language Information Retrieval
300

The BBN Byblos Hindi OCR System
173
Colour Plates
315

Other editions - View all

Common terms and phrases

Bibliographic information