Skip to main content
  • Textbook
  • © 2018

Machine Learning for Text

Authors:

  • The first textbook to cover machine learning of text in a holistic way, which includes aspects of mining, language modeling, and deep learning
  • Includes many examples to simplify exposition and facilitate in learning. Semantically understandable illustrations are provided, so that they can be used in classroom teaching
  • Provides comprehensive coverage of this field.The depth and breadth of coverage
  • is unique to this textbook
  • Request lecturer material: sn.pub/lecturer-material

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 84.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (14 chapters)

  1. Front Matter

    Pages i-xxiii
  2. Machine Learning for Text: An Introduction

    • Charu C. Aggarwal
    Pages 1-16
  3. Text Preparation and Similarity Computation

    • Charu C. Aggarwal
    Pages 17-30
  4. Matrix Factorization and Topic Modeling

    • Charu C. Aggarwal
    Pages 31-72
  5. Text Clustering

    • Charu C. Aggarwal
    Pages 73-112
  6. Text Classification: Basic Models

    • Charu C. Aggarwal
    Pages 113-157
  7. Linear Classification and Regression for Text

    • Charu C. Aggarwal
    Pages 159-207
  8. Classifier Performance and Evaluation

    • Charu C. Aggarwal
    Pages 209-234
  9. Joint Text Mining with Heterogeneous Data

    • Charu C. Aggarwal
    Pages 235-258
  10. Information Retrieval and Search Engines

    • Charu C. Aggarwal
    Pages 259-304
  11. Text Sequence Modeling and Deep Learning

    • Charu C. Aggarwal
    Pages 305-360
  12. Text Summarization

    • Charu C. Aggarwal
    Pages 361-380
  13. Information Extraction

    • Charu C. Aggarwal
    Pages 381-411
  14. Opinion Mining and Sentiment Analysis

    • Charu C. Aggarwal
    Pages 413-434
  15. Text Segmentation and Event Detection

    • Charu C. Aggarwal
    Pages 435-452
  16. Back Matter

    Pages 453-493

About this book

Text analytics is a field that lies on the interface of information retrieval,machine learning, and natural language processing, and this textbook carefully covers a coherently organized framework drawn from these intersecting topics. The chapters of this textbook is organized into three categories:

- Basic algorithms: Chapters 1 through 7 discuss the classical algorithms for machine learning from text such as preprocessing, similarity computation, topic modeling, matrix factorization, clustering, classification, regression, and ensemble analysis.

- Domain-sensitive mining: Chapters 8 and 9 discuss the learning methods from text when combined with different domains such as multimedia and the Web. The problem of information retrieval and Web search is also discussed in the context of its relationship with ranking and machine learning methods. 

- Sequence-centric mining: Chapters 10 through 14 discuss various sequence-centric and natural language applications, such as feature engineering, neural language models, deep learning, text summarization, information extraction, opinion mining, text segmentation, and event detection.

 This textbook covers machine learning topics for text in detail. Since the coverage is extensive,multiple courses can be offered from the same book, depending on course level. Even though the presentation is text-centric, Chapters 3 to 7 cover machine learning algorithms that are often used indomains beyond text data. Therefore, the book can be used to offer courses not just in text analytics but also from the broader perspective of machine learning (with text as a backdrop).

 This textbook targets graduate students in computer science, as well as researchers, professors, and industrial practitioners working in these related fields. This textbook is accompanied with a solution manual for classroom teaching.

Reviews

“The book discusses many key technologies used today in social media, such as opinion mining or event detection. One of the most promising new technologies, deep learning, is discussed as well. This book is an excellent resource for programmers and graduate students interested in becoming experts in the text mining field. … Summing Up: Recommended. Graduate students, researchers, and professionals.” (J. Brzezinski, Choice, Vol. 56 (04), December, 2018)

Authors and Affiliations

  • IBM T. J. Watson Research Center, Yorktown Heights, USA

    Charu C. Aggarwal

About the author

Charu C. Aggarwal is a Distinguished Research Staff Member (DRSM) at the IBM T. J. Watson Research Center in Yorktown Heights, New York. He completed his undergraduate degree in Computer Science from the Indian Institute of Technology at Kanpur in 1993 and his Ph.D. from the Massachusetts Institute of Technology in 1996. He has worked extensively in the field of data mining. He has published more than 350 papers in refereed conferences and journals and authored over 80 patents. He is the author or editor of 17 books, including textbooks on data mining, recommender systems, and outlier analysis. Because of the commercial value of his patents, he has thrice been designated a Master Inventor at IBM. He is a recipient of an IBM Corporate Award (2003) for his work on bio-terrorist threat detection in data streams, a recipient of the IBM Outstanding Innovation Award (2008) for his scientific contributions to privacy technology,and a recipient of two IBM Outstanding Technical Achievement Awards (2009, 2015) for his work on data streams/high-dimensional data. He received the EDBT 2014 Test of Time Award for his work on condensation-based privacy-preserving data mining. He is also a recipient of the IEEE ICDM Research Contributions Award (2015), which is one of the two highest awards for influential research contributions in the field of data mining. He has served as the general co-chair of the IEEE Big Data Conference (2014) and as the program co-chair of the ACM CIKM Conference (2015), the IEEE ICDM Conference (2015), and the ACM KDD Conference (2016). He served as an associate editor of the IEEE Transactions on Knowledge and Data Engineering from 2004 to 2008. He is an associate editor of the IEEE Transactions on Big Data, an action editor of the Data Mining and Knowledge Discovery Journal, and an associate editor of the Knowledge and Information Systems Journal. He has served as editor-in-chief of the ACM SIGKDD Explorations (2014–2017) and is currently an editor-in-chief of the ACM Transactions on Knowledge Discovery from Data. He serves on the advisory board of the Lecture Notes on Social Networks, a publication by Springer. He has served as the vice-president of the SIAM Activity Group on Data Mining and is a member of the SIAM industry committee. He is a fellow of the SIAM, ACM, and the IEEE, for “contributions to knowledge discovery and data mining algorithms.”

Bibliographic Information

  • Book Title: Machine Learning for Text

  • Authors: Charu C. Aggarwal

  • DOI: https://doi.org/10.1007/978-3-319-73531-3

  • Publisher: Springer Cham

  • eBook Packages: Computer Science, Computer Science (R0)

  • Copyright Information: Springer International Publishing AG, part of Springer Nature 2018

  • Hardcover ISBN: 978-3-319-73530-6Published: 03 April 2018

  • Softcover ISBN: 978-3-030-08807-1Published: 01 February 2019

  • eBook ISBN: 978-3-319-73531-3Published: 19 March 2018

  • Edition Number: 1

  • Number of Pages: XXIII, 493

  • Number of Illustrations: 76 b/w illustrations, 4 illustrations in colour

  • Topics: Data Mining and Knowledge Discovery, Artificial Intelligence

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info
Hardcover Book USD 84.99
Price excludes VAT (USA)
  • Durable hardcover edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access