Apress

Intelligent Document Retrieval

Exploiting Markup Structure

By Udo Kruschwitz

Intelligent Document Retrieval Cover Image

  • ISBN13: 978-1-4020-3767-2
  • 216 Pages
  • User Level: Science
  • Publication Date: January 9, 2006
  • Available eBook Formats: PDF
  • eBook Price: $199.00
Buy eBook Buy Print Book Add to Wishlist
Full Description
Collections of digital documents can nowadays be found everywhere in institutions, universities or companies. Examples are Web sites or intranets. But searching them for information can still be painful. Searches often return either large numbers of matches or no suitable matches at all. Such document collections can vary a lot in size and how much structure they carry. What they have in common is that they typically do have some structure and that they cover a limited range of topics. The second point is significantly different from the Web in general. The type of search system that we propose in this book can suggest ways of refining or relaxing the query to assist a user in the search process. In order to suggest sensible query modifications we would need to know what the documents are about. Explicit knowledge about the document collection encoded in some electronic form is what we need. However, typically such knowledge is not available. So we construct it automatically.
Table of Contents

Table of Contents

  1. From the contents: Foreword.
  2. Preface.
  3. List of Figures.
  4. List of Tables.
  5. 1 Introduction.
  6. Part I The Model.
  7. 2 Related Work.
  8. 3 Data Analysis and Domain Model Construction.
  9. 4 Incorporating Additional Knowledge.
  10. 5 A Dialogue System for Partially Structured Data.
  11. Part II Practical Applications.
  12. 6 UKSearch
  13. Intelligent Web Search.
  14. 7 UKSearch
  15. Evaluation and Discussion.
  16. 8 YPA
  17. Searching Classified Directories.
  18. 9 Future Directions and Conclusions.
  19. References.
  20. Index.
Errata

If you think that you've found an error in this book, please let us know about it. You will find any confirmed erratum below, so you can check if your concern has already been addressed.

* Required Fields

No errata are currently published