Apress Access

Web Archiving

By Julien Masanès

  • eBook Price: $69.99
Buy eBook Buy Print Book

Web Archiving Cover Image

  • Add to Wishlist
  • ISBN13: 978-3-5402-3338-1
  • 241 Pages
  • User Level: Science
  • Publication Date: February 15, 2007
  • Available eBook Formats: PDF
Full Description
The public information available on the Web today is larger than information distributed on any other media. The raw nature of Web content, the unpredictable remote changes that can affect it, the wide variety of formats concerned, and the growth in data-driven websites make the preservation of this material a challenging task, requiring specific monitoring, collecting and preserving strategies, procedures and tools. Julien Masanès, Director of the European Archive, has assembled contributions from computer scientists and librarians that altogether encompass the complete range of tools, tasks and processes needed to successfully preserve the cultural heritage of the Web. His book serves as a standard introduction for everyone involved in keeping alive the immense amount of online information, and it covers issues related to building, using and preserving Web archives both from the computer scientist and librarian viewpoints. Practitioners will find in this book a state-of-the-art overview of methods, tools and standards they need for their activities. Researchers as well as advanced students in computer science will use it as an introduction to this new field with a hopefully stimulating review of open issues where future work is needed.
Table of Contents

Table of Contents

  1. Web Archiving: Issues and Methods.
  2. Web Use and Web Studies.
  3. Selection for Web Archives.
  4. Copying Web Sites.
  5. Archiving the Hidden Web.
  6. Access and Finding Aids.
  7. Mining Web Collections.
  8. The Long
  9. Term Preservation of Web Content.
  10. Year
  11. by
  12. Year: From an Archive of the Internet to an Archive on the Internet.
  13. Small Scale Academic Web Archiving: DACHS.

If you think that you've found an error in this book, please let us know by emailing to editorial@apress.com . You will find any confirmed erratum below, so you can check if your concern has already been addressed.
No errata are currently published


    1. Information Extraction: Algorithms and Prospects in a Retrieval Context


      View Book