Apress Access

Multimodal Processing and Interaction

Audio, Video, Text

By Petros Maragos , Alexandros Potamianos , Patrick Gros

  • eBook Price: $159.00
Buy eBook Buy Print Book

Multimodal Processing and Interaction Cover Image

This volume presents high quality research ideas and results from theoretic, algorithmic and application viewpoints. It specifically focuses on interaction with multimedia content with special emphasis on multimodal interfaces for accessing multimedia information.

Full Description

  • Add to Wishlist
  • ISBN13: 978-0-3877-6315-6
  • 400 Pages
  • User Level: Science
  • Publication Date: December 16, 2008
  • Available eBook Formats: PDF
Full Description
Multimodal Processing and Interaction: Audio, Video and Text presents high quality, state-of-the-art research ideas and results from theoretic, algorithmic and application viewpoints. This edited volume contains both state-of-the-art reviews and original contributions by leading experts in the scientific and technological field of multimedia. It grew out of a four-year collaboration among research groups participating in the European network of Excellence on Multimedia Understanding, Semantics, Computation and Learning (MUSCLE). Multimodal Processing and Interaction: Audio, Video and Text covers a broad spectrum of novel perspectives, analytic tools, algorithms, design practices and applications in multimedia science and engineering with emphasis on multimodal integration and modality fusion. This volume also contains contributions in the area of interaction with multimedia, especially multimodal interfaces for accessing multimedia content. Multimodal Processing and Interaction: Audio, Video and Text is designed for a professional audience composed of practitioners and researchers in industry and academia. This book is suitable for advanced-level students in computer science and engineering as well.
Table of Contents

Table of Contents

  1. Preface.
  2. Tutorial Review of State of the Art.
  3. Cross
  4. Modal Integration/Interaction for Performance Improving in Multimedia: State
  5. of
  6. the
  7. Art Review.
  8. Human
  9. Computer Interfaces for Multimedia Retrieval: State of the Art.
  10. New Research Directions: Integrated Multimedia Analysis And Recognition.
  11. Stochastic Models for Multimodal Video Analysis.
  12. Adaptive Multimodal Fusion by Uncertainty Compensation with Application to Audio
  13. Visual Speech Recognition.
  14. Action Recognition in Multimedia Streams.
  15. Surveillance Using Both Video and Audio.
  16. Movie Analysis with Emphasis to Dialogue Detections.
  17. Audio
  18. Visual Attention Modeling and Salient Event Detection.
  19. New Research Directions: Searching Multimedia Content.
  20. Interactive Image Retrieval using a Hybrid Visual and Conceptual Content Representation.
  21. Multimodal Analysis of Text and Audio Features for Music Information Retrieval.
  22. Toward the Integration of NLP and ASR: POS Tagging and Transcription.
  23. Design Principles for Multimodal Spoken Dialogue Systems.
  24. Eye Tracking for Image Retrieval.
  25. Natural/ Novel User Interfaces for Mobile Devices.

If you think that you've found an error in this book, please let us know by emailing to editorial@apress.com . You will find any confirmed erratum below, so you can check if your concern has already been addressed.
No errata are currently published


    1. PHP Objects, Patterns, and Practice


      View Book

    2. Beginning Android 3D Game Development


      View Book

    3. Troubleshooting Oracle Performance


      View Book

    4. Beginning Amazon Web Services with Node.js


      View Book