This is the first comprehensive text on Optical Character Recognition for Indic scripts. It covers many topics and describes OCR systems for eight different scripts—Bangla, Devanagari, Gurmukhi, Gujarti, Kannada, Malayalam, Tamil and Urdu.
This is the fourth edition of the standard text and complete reference for scientists and engineers. This fully revised version includes important updates on articles and books as well as information on creating transparencies for classrooms and professional meetings.
This Second Edition explores the field of text mining. Coverage includes the use of machine learning in conjunction with natural language processing, information extraction, and algebraic/mathematical approaches to computational information retrieval.