HAPPY HOLIDAYS: Get a special discount on Apress Access! Subscribe today >>

Theory and Applications of Natural Language Processing

Language Technology for Cultural Heritage

Selected Papers from the LaTeCH Workshop Series

Editors: Sporleder, Caroline, van den Bosch, Antal, Zervanou, Kalliopi (Eds.)

  • selected papers of high quality that highlight different applications and different approaches to a given task
  • First book dealing specifically with language technology for CH/SSH domains
  • Unique presentation with highlight on different applications and different approaches
see more benefits

Buy this book

eBook $149.00
price for USA
  • ISBN 978-3-642-20227-8
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Download immediately after purchase
Hardcover $189.00
price for USA
  • ISBN 978-3-642-20226-1
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover $189.00
price for USA
  • ISBN 978-3-642-26988-2
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this book

The digital age has had a profound effect on our cultural heritage and the academic research that studies it. Staggering amounts of objects, many of them of a textual nature, are being digitised to make them more readily accessible to both experts and laypersons. Besides a vast potential for more effective and efficient preservation, management, and presentation, digitisation offers opportunities to work with cultural heritage data in ways that were never feasible or even imagined. 

To explore and exploit these possibilities, an interdisciplinary approach is needed, bringing together experts from cultural heritage, the social sciences and humanities on the one hand, and information technology on the other. Due to a prevalence of textual data in these domains, language technology has a crucial role to play in this endeavour. Language technology can break through the "Google barrier" by offering the potential to analyse texts at advanced levels, extracting information and knowledge at the level of the humanities or social sciences researcher, who wants to know about the who, what, where, and when, but also the how and the why. At the same time cultural heritage data poses considerable challenges for existing language technology: technology aimed at "generic" language has to face such disparate problems as historical language variation, OCR digitisation errors, and near-extinct academic expertise.
 
This book is primarily intended for researchers in information technology and language processing who would like to receive a state-of-the-art overview of the whole breadth of the new and vibrant field of language technology for cultural heritage and its associated academic research in the humanities and social sciences. Researchers working in the target domains of cultural heritage, the social sciences and humanities will also find this book useful, as it provides an overview of how language technology can help them with their information needs. The book covers applications ranging from pre-processing and data cleaning, to the adaptation and compilation of linguistic resources, to personalisation, narrative analysis, visualisation and retrieval.  

About the authors

Caroline Sporleder leads a junior research group in the Cluster of Excellence “Multimodal Computing and Interaction” at Saarland University, Germany. Her research interests include text mining and error detection for cultural heritage data, cross-domain language processing, and computational modelling of semantics and discourse. Before coming to Saarland University, she worked as a post-doctoral researcher on the MITCH project (Mining for Information from the Cultural Heritage), a joint research project between Tilburg University, and Naturalis, the Dutch National Museum of Natural History. The project aimed at developing technology to clean, enrich and structure field books and other natural history data sources.

Antal van den Bosch is a full professor of computational linguistics and artificial intelligence at Tilburg University. His research interests include machine learning of natural language; historical and heritage text mining; proofing and recommendation; and machine translation. He helped create the Dutch CATCH programme (Continuous Access to Cultural Heritage) funded by the Netherlands Organization for Scientific Research, and has been PI of two CATCH projects: the aforementioned MITCH (with Caroline Sporleder) and the current HitiMe project (with Kalliopi Zervanou). He was guest editor of the special issue of the Interdisciplinary Science Review journal on “Continuous access to cultural heritage” published in 2009.

Kalliopi A. Zervanou is a post-doctoral researcher in the HiTiME project (Historical Timeline Mining and Extraction), a joint research project by Tilburg University’s Centre for Cognition and Communication (TiCC) and the International Institute of Social History. The project aims at the development of a text analysis system for the recognition and extraction of historical events and facts from a variety of primary and secondary historical sources. Previous to HiTiME, she worked as a researcher at the University of Manchester, and the Technical University of Crete, in various information management and text mining projects. Her research interests include information extraction, knowledge acquisition and representation techniques, automatic term extraction and abstracting.

Table of contents (12 chapters)

  • Strategies for Reducing and Correcting OCR Errors

    Volk, Martin (et al.)

    Pages 3-22

  • Alignment between Text Images and their Transcripts for Handwritten Documents

    Toselli, Alejandro H. (et al.)

    Pages 23-37

  • A Diachronic Computational Lexical Resource for 800 Years of Swedish

    Borin, Lars (et al.)

    Pages 41-61

  • Morphosyntactic Tagging of Old Icelandic Texts and Its Use in Studying Syntactic Variation and Change

    Rögnvaldsson, Eiríkur (et al.)

    Pages 63-76

  • The Ancient Greek and Latin Dependency Treebanks

    Bamman, David (et al.)

    Pages 79-98

Buy this book

eBook $149.00
price for USA
  • ISBN 978-3-642-20227-8
  • Digitally watermarked, DRM-free
  • Included format: PDF
  • ebooks can be used on all reading devices
  • Download immediately after purchase
Hardcover $189.00
price for USA
  • ISBN 978-3-642-20226-1
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Softcover $189.00
price for USA
  • ISBN 978-3-642-26988-2
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
Loading...

Bibliographic Information

Bibliographic Information
Book Title
Language Technology for Cultural Heritage
Book Subtitle
Selected Papers from the LaTeCH Workshop Series
Editors
  • Caroline Sporleder
  • Antal van den Bosch
  • Kalliopi Zervanou
Series Title
Theory and Applications of Natural Language Processing
Copyright
2011
Publisher
Springer-Verlag Berlin Heidelberg
Copyright Holder
Springer-Verlag Berlin Heidelberg
eBook ISBN
978-3-642-20227-8
DOI
10.1007/978-3-642-20227-8
Hardcover ISBN
978-3-642-20226-1
Softcover ISBN
978-3-642-26988-2
Series ISSN
2192-032X
Edition Number
1
Number of Pages
XXXII, 232
Topics