Guide to Intelligent Data Analysis

How to Intelligently Make Sense of Real Data

By Michael R. Berthold , Christian Borgelt , Frank Höppner , Frank Klawonn

Guide to Intelligent Data Analysis Cover Image

This is a systematic overview and classification of tasks in data analysis, methods to solve them and typical problems encountered. The book combines views from classical and non-classical statistics like Bayesian inference and robust statistics.

Full Description

  • ISBN13: 978-1-8488-2259-7
  • 408 Pages
  • User Level: Students
  • Publication Date: June 23, 2010
  • Available eBook Formats: PDF
  • eBook Price: $89.95
Buy eBook Buy Print Book Add to Wishlist
Full Description
Each passing year bears witness to the development of ever more powerful computers, increasingly fast and cheap storage media, and even higher bandwidth data connections. This makes it easy to believe that we can now – at least in principle – solve any problem we are faced with so long as we only have enough data. Yet this is not the case. Although large databases allow us to retrieve many different single pieces of information and to compute simple aggregations, general patterns and regularities often go undetected. Furthermore, it is exactly these patterns, regularities and trends that are often most valuable. To avoid the danger of “drowning in information, but starving for knowledge” the branch of research known as data analysis has emerged, and a considerable number of methods and software tools have been developed. However, it is not these tools alone but the intelligent application of human intuition in combination with computational power, of sound background knowledge with computer-aided modeling, and of critical reflection with convenient automatic model construction, that results in successful intelligent data analysis projects. Guide to Intelligent Data Analysis provides a hands-on instructional approach to many basic data analysis techniques, and explains how these are used to solve data analysis problems. Topics and features: guides the reader through the process of data analysis, following the interdependent steps of project understanding, data understanding, data preparation, modeling, and deployment and monitoring; equips the reader with the necessary information in order to obtain hands-on experience of the topics under discussion; provides a review of the basics of classical statistics that support and justify many data analysis methods, and a glossary of statistical terms; includes numerous examples using R and KNIME, together with appendices introducing the open source software; integrates illustrations and case-study-style examples to support pedagogical exposition. This practical and systematic textbook/reference for graduate and advanced undergraduate students is also essential reading for all professionals who face data analysis problems. Moreover, it is a book to be used following one’s exploration of it. Dr. Michael R. Berthold is Nycomed-Professor of Bioinformatics and Information Mining at the University of Konstanz, Germany. Dr. Christian Borgelt is Principal Researcher at the Intelligent Data Analysis and Graphical Models Research Unit of the European Centre for Soft Computing, Spain. Dr. Frank Höppner is Professor of Information Systems at Ostfalia University of Applied Sciences, Germany. Dr. Frank Klawonn is a Professor in the Department of Computer Science and Head of the Data Analysis and Pattern Recognition Laboratory at Ostfalia University of Applied Sciences, Germany. He is also Head of the Bioinformatics and Statistics group at the Helmholtz Centre for Infection Research, Braunschweig, Germany.
Table of Contents

Table of Contents

  1. Introduction.
  2. Practical Data Analysis: An Example.
  3. Project Understanding.
  4. Data Understanding.
  5. Principles of Modeling.
  6. Data Preparation.
  7. Finding Patterns.
  8. Finding Explanations.
  9. Finding Predictors.
  10. Evaluation and Deployment.
  11. Appendix A: Statistics.
  12. Appendix B: The R Project.
  13. Appendix C: KNIME.
Errata

Please Login to submit errata.

No errata are currently published