Automatic Digital Document Processing and Management

Problems, Algorithms and Techniques

By Stefano Ferilli

Automatic Digital Document Processing and Management Cover Image

Examining the full range of a document’s lifetime, this volume reviews the issues involved in handling and processing digital documents. Topics include acquisition, representation, security, pre-processing, layout analysis and analysis of single components.

Full Description

  • ISBN13: 978-0-8572-9197-4
  • 324 Pages
  • User Level: Science
  • Publication Date: January 3, 2011
  • Available eBook Formats: PDF
  • eBook Price: $129.00
Buy eBook Buy Print Book Add to Wishlist

Related Titles

Full Description
This text reviews the issues involved in handling and processing digital documents. Examining the full range of a document’s lifetime, the book covers acquisition, representation, security, pre-processing, layout analysis, understanding, analysis of single components, information extraction, filing, indexing and retrieval. Features: provides a list of acronyms and a glossary of technical terms; contains appendices covering key concepts in machine learning, and providing a case study on building an intelligent system for digital document and library management; discusses issues of security, and legal aspects of digital documents; examines core issues of document image analysis, and image processing techniques of particular relevance to digitized documents; reviews the resources available for natural language processing, in addition to techniques of linguistic analysis for content handling; investigates methods for extracting and retrieving data/information from a document.
Table of Contents

Table of Contents

  1. Part I: Digital Documents.
  2. Documents.
  3. Digital Formats.
  4. Legal and Security Aspects.
  5. Part II: Document Analysis.
  6. Image Processing.
  7. Document Image Analysis.
  8. Part III: Content Processing.
  9. Natural Language Processing.
  10. Information Management.
Errata

Please Login to submit errata.

No errata are currently published