Name: Analyzing Emotion in Spontaneous Speech
ISBN: 978-981-10-7674-9

Overview

Authors:

Rupayan Chakraborty ⁰,
Meghna Pandharipande ¹,
Sunil Kumar Kopparapu ²

Rupayan Chakraborty
1. Speech and Natural Language Processing, TCS Research and Innovation—Mumbai, Tata Consultancy Services Limited, Thane, India
View author publications

You can also search for this author in PubMed Google Scholar
Meghna Pandharipande
1. Speech and Natural Language Processing, TCS Research and Innovation—Mumbai, Tata Consultancy Services Limited, Thane, India
View author publications

You can also search for this author in PubMed Google Scholar
Sunil Kumar Kopparapu
1. Speech and Natural Language Processing, TCS Research and Innovation—Mumbai, Tata Consultancy Services Limited, Thane, India
View author publications

You can also search for this author in PubMed Google Scholar

Serves as a handy compact document for researchers and students wanting to start exploring the challenges facing automatic emotion recognition in spontaneous speech
Explains, elaborates, and proposes possible solutions to the existing challenges in spontaneous speech emotion recognition
Presents case studies based on real-life practical problems
Lists a set of databases that can be useful to start work in the area of audio emotion analysis

3835 Accesses
5 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99

Price excludes VAT (USA)

Softcover Book USD 54.99

Price excludes VAT (USA)

Hardcover Book USD 54.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (6 chapters)

Front Matter

Pages i-xvi

Download chapter PDF
Introduction
- Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu
Pages 1-10
Literature Survey
- Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu
Pages 11-23
A Framework for Spontaneous Speech Emotion Recognition
- Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu
Pages 25-35
Improving Emotion Classification Accuracies
- Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu
Pages 37-49
Case Studies
- Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu
Pages 51-64
Conclusions
- Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu
Pages 65-66
Back Matter

Pages 67-81

Download chapter PDF

Keywords

About this book

This book captures the current challenges in automatic recognition of emotion in spontaneous speech and makes an effort to explain, elaborate, and propose possible solutions. Intelligent human–computer interaction (iHCI) systems thrive on several technologies like automatic speech recognition (ASR); speaker identification; language identification; image and video recognition; affect/mood/emotion analysis; and recognition, to name a few. Given the importance of spontaneity in any human–machine conversational speech, reliable recognition of emotion from naturally spoken spontaneous speech is crucial. While emotions, when explicitly demonstrated by an actor, are easy for a machine to recognize, the same is not true in the case of day-to-day, naturally spoken spontaneous speech. The book explores several reasons behind this, but one of the main reasons for this is that people, especially non-actors, do not explicitly demonstrate their emotion when they speak, thus making it difficult for machines to distinguish one emotion from another that is embedded in their spoken speech. This short book, based on some of authors’ previously published books, in the area of audio emotion analysis, identifies the practical challenges in analysing emotions in spontaneous speech and puts forward several possible solutions that can assist in robustly determining the emotions expressed in spontaneous speech.

Authors and Affiliations

Speech and Natural Language Processing, TCS Research and Innovation—Mumbai, Tata Consultancy Services Limited, Thane, India

Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu

About the authors

Rupayan Chakraborty (Member IEEE) works as a scientist at the TCS Research and Innovation - Mumbai. He has been working in the area of speech and audio signal processing/recognition since 2008 and was involved in academic research prior to joining TCS. He worked as a researcher at the Computer Vision and Pattern Recognition (CVPR) Unit of the Indian Statistical Institute (ISI), Kolkata. He obtained his PhD degree from TALP Research Centre, Barcelona, Spain, in December 2013, in the area of acoustic event detection and localization using distributed sensor arrays in room environments, while working on the “Speech and Audio Recognition for Ambient Intelligence (SARAI)” project. After completing his PhD, he was a visiting scientist at the CVPR Unit of ISI, Kolkata, for 1 year. He has published research work in top-tier conferences and journals. He is currently working in the area of “speech emotion recognition and analysis”.

Meghna Pandharipande received her Bachelor of Engineering (BE) in Electronics and Telecommunication in June 2002 from Amravati University, Amravati. Between September 2002 and December 2003, she was a faculty member of the Department of Electronics and Telecommunication at Shah and Anchor Kutchhi Engineering College, Mumbai. In 2004, she completed her certification in Embedded Systems at CMC, Mumbai and then worked as a Lotus Notes developer in a startup ATS, Mumbai for a year. Since June 2005 she has been with TCS (having first joined the Cognitive Systems Research Laboratory, Tata InfoTech under Prof. P.V.S. Rao) and since 2006, she has been working as a researcher at TCS Research and Innovation - Mumbai. Her research interest is in the area of speech signal processing and has been working extensively on building systems that can process all aspects of spoken speech. More recently, she has been researching non-linguistic aspects of speech processing, like speaking rate and emotion detection from speech.

Sunil Kumar Kopparapu (Senior Member, IEEE; ACM Senior Member India) obtained his doctoral degree in Electrical Engineering from the Indian Institute of Technology Bombay, Mumbai, India in 1997. His thesis “Modular integration for low-level and high-level vision problems in a multi-resolution framework” provided a broad framework to enable reliable and fast vision processing.

Between 1997 and 2000 he was with the Automation Group, Commonwealth Scientific and Industrial Research Organization (CSIRO), Brisbane, Australia working on practical image processing and 3D vision problems, mainly for the benefit of the Australian mining industry.

Prior to joining the Cognitive Systems Research Laboratory (CSRL), Tata InfoTech Limited as a senior research member in 2001, he was an expert for developing virtual self-line of e-commerce products for the R&D Group at Aquila Technologies Private Limited, India.

In his current role as a principal scientist at TCS Research and Innovation - Mumbai, he is actively working in the areas of speech, script, image and natural-language processing with a focus on building usable systems for mass use in Indian conditions. He has co-authored a book titled “Bayesian Approach to Image Interpretation” and more recently a Springer Brief on Non-linguistic Analysis of Call Center Conversations.

Bibliographic Information

Book Title: Analyzing Emotion in Spontaneous Speech
Authors: Rupayan Chakraborty, Meghna Pandharipande, Sunil Kumar Kopparapu
DOI: https://doi.org/10.1007/978-981-10-7674-9
Publisher: Springer Singapore
eBook Packages: Computer Science, Computer Science (R0)
Copyright Information: Springer Nature Singapore Pte Ltd. 2017
Hardcover ISBN: 978-981-10-7673-2Published: 01 February 2018
Softcover ISBN: 978-981-13-5668-1Published: 09 December 2018
eBook ISBN: 978-981-10-7674-9Published: 23 January 2018
Edition Number: 1
Number of Pages: XVI, 81
Number of Illustrations: 31 b/w illustrations
Topics: Artificial Intelligence, User Interfaces and Human Computer Interaction, Simulation and Modeling, Signal, Image and Speech Processing, Pattern Recognition, Computer Appl. in Social and Behavioral Sciences

Publish with us

Policies and ethics

Analyzing Emotion in Spontaneous Speech

Overview

Access this book

Other ways to access

Table of contents (6 chapters)

Front Matter

Introduction

Literature Survey

A Framework for Spontaneous Speech Emotion Recognition

Improving Emotion Classification Accuracies

Case Studies

Conclusions

Back Matter

Keywords

About this book

Authors and Affiliations

Speech and Natural Language Processing, TCS Research and Innovation—Mumbai, Tata Consultancy Services Limited, Thane, India

About the authors

Bibliographic Information

Publish with us

Search

Navigation