Name: From Extractive to Abstractive Summarization: A Journey
ISBN: 978-981-13-8934-4

Overview

Authors:

Parth Mehta
1. Information Retrieval and Language Processing Lab, Dhirubhai Ambani Institute of Information and Communication Technology, Gandhinagar, India
View author publications

You can also search for this author in PubMed Google Scholar
Prasenjit Majumder
1. Information Retrieval and Language Processing Lab, Dhirubhai Ambani Institute of Information and Communication Technology, Gandhinagar, India
View author publications

You can also search for this author in PubMed Google Scholar

Provides an overview of the transition from extractive to abstractive summarization and how they can be combined to create a practical summarization system
Covers in detail ways to build ensembles from several existing techniques
Places special emphasis on domain-specific applications that are practically useful
Is accompanied by source codes and corpora, so that readers can use the book as a springboard for future experiments

3234 Accesses
5 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 84.99

Price excludes VAT (USA)

Softcover Book USD 109.99

Price excludes VAT (USA)

Hardcover Book USD 109.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (8 chapters)

Front Matter

Pages i-xi

Download chapter PDF
Introduction
- Parth Mehta, Prasenjit Majumder
Pages 1-9
Related Work
- Parth Mehta, Prasenjit Majumder
Pages 11-24
Corpora and Evaluation for Text Summarisation
- Parth Mehta, Prasenjit Majumder
Pages 25-34
Domain-Specific Summarisation
- Parth Mehta, Prasenjit Majumder
Pages 35-48
Improving Sentence Extraction Through Rank Aggregation
- Parth Mehta, Prasenjit Majumder
Pages 49-68
Leveraging Content Similarity in Summaries for Generating Better Ensembles
- Parth Mehta, Prasenjit Majumder
Pages 69-81
Neural Model for Sentence Compression
- Parth Mehta, Prasenjit Majumder
Pages 83-95
Conclusion
- Parth Mehta, Prasenjit Majumder
Pages 97-98
Back Matter

Pages 99-116

Download chapter PDF

Keywords

About this book

This book describes recent advances in text summarization, identifies remaining gaps and challenges, and proposes ways to overcome them. It begins with one of the most frequently discussed topics in text summarization – ‘sentence extraction’ –, examines the effectiveness of current techniques in domain-specific text summarization, and proposes several improvements.
In turn, the book describes the application of summarization in the legal and scientific domains, describing two new corpora that consist of more than 100 thousand court judgments and more than 20 thousand scientific articles, with the corresponding manually written summaries. The availability of these large-scale corpora opens up the possibility of using the now popular data-driven approaches based on deep learning. The book then highlights the effectiveness of neural sentence extraction approaches, which perform just as well as rule-based approaches, but without the need for any manual annotation. As a next step, multiple techniques for creating ensembles of sentence extractors – which deliver better and more robust summaries – are proposed. In closing, the book presents a neural network-based model for sentence compression. Overall the book takes readers on a journey that begins with simple sentence extraction and ends in abstractive summarization, while also covering key topics like ensemble techniques and domain-specific summarization, which have not been explored in detail prior to this.

Authors and Affiliations

Information Retrieval and Language Processing Lab, Dhirubhai Ambani Institute of Information and Communication Technology, Gandhinagar, India

Parth Mehta, Prasenjit Majumder

About the authors

Dr. Parth Mehta completed his M.Tech. in Machine Intelligence and his Ph.D. in Text Summarization at Dhirubhai Ambani Institute of ICT (DA-IICT), Gandhinagar, India. At the DA-IICT he was part of the Information Retrieval and Natural Language Processing Lab. He was also involved in the national project “Cross Lingual Information Access”, funded by the Govt. of India, which focused on building a cross-lingual search engine for nine Indian languages.
Dr. Mehta has served as reviewer for the journals Information Processing and Management and Forum for Information Retrieval Evaluation. Apart from several journal and conference papers, he has also co-edited a book on text processing published by Springer.
Prof. Prasenjit Majumder is an Associate Professor at Dhirubhai Ambani Institute of ICT (DA-IICT), Gandhinagar and a Visiting Professor at the Indian Institute of Information Technology, Vadodara (IIIT-V). Prof. Majumder completed his Ph.D. at Jadavpur University in 2008 and worked as a postdoctoral fellow at the University College Dublin, prior to joining the DA-IICT, where he currently heads the Information Retrieval and Language Processing Lab. His research interests lie at the intersection of Information Retrieval, Cognitive Science and Human Computing Interaction. He has headed several projects sponsored by the Govt. of India.
He is one of the pioneers of the Forum for Information Retrieval Evaluation (FIRE), which assesses research on Information Retrieval and related areas for South Asian languages. Since being founded in 2008, FIRE has grown to become a respected conference, drawing participants from across the globe. Prof. Majumder has authored several journal and conference papers, and co-edited two special issues of Transactions in Information Systems (ACM). He has co-edited two books: ‘Multi Lingual Information Access in South Asian Languages’ and ‘Text Processing,’ both published by Springer.

Bibliographic Information

Book Title: From Extractive to Abstractive Summarization: A Journey
Authors: Parth Mehta, Prasenjit Majumder
DOI: https://doi.org/10.1007/978-981-13-8934-4
Publisher: Springer Singapore
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer Nature Singapore Pte Ltd. 2019
Hardcover ISBN: 978-981-13-8933-7Published: 30 August 2019
Softcover ISBN: 978-981-13-8936-8Published: 30 August 2020
eBook ISBN: 978-981-13-8934-4Published: 13 August 2019
Edition Number: 1
Number of Pages: XI, 116
Number of Illustrations: 461 b/w illustrations, 9 illustrations in colour
Topics: Performance and Reliability, Computer Communication Networks, Information Systems Applications (incl. Internet)

Publish with us

Policies and ethics

From Extractive to Abstractive Summarization: A Journey

Overview

Access this book

Other ways to access

Table of contents (8 chapters)

Front Matter

Back Matter

Keywords

About this book

Authors and Affiliations

Information Retrieval and Language Processing Lab, Dhirubhai Ambani Institute of Information and Communication Technology, Gandhinagar, India

About the authors

Bibliographic Information

Publish with us

Search

Navigation