Name: Data Mining Algorithms in C++
ISBN: 978-1-4842-3315-3

Authors:

Timothy Masters ⁰

Timothy Masters
1. Ithaca, USA
View author publications

You can also search for this author in PubMed Google Scholar

An expert-driven data mining and algorithms in C++ book
Data mining is an important topic in big data
Algorithms are also a critical topic of growing importance

14k Accesses
2 Citations

Buy it now

eBook USD 59.99

Price excludes VAT (USA)

Softcover Book USD 79.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Learn about institutional subscriptions

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (5 chapters)

Front Matter

Pages i-xiv

PDF
Information and Entropy
- Timothy Masters
Pages 1-73
Screening for Relationships
- Timothy Masters
Pages 75-166
Displaying Relationship Anomalies
- Timothy Masters
Pages 167-184
Fun with Eigenvectors
- Timothy Masters
Pages 185-265
Using the DATAMINE Program
- Timothy Masters
Pages 267-279
Back Matter

Pages 281-286

PDF

About this book

Discover hidden relationships among the variables in your data, and learn how to exploit these relationships. This book presents a collection of data-mining algorithms that are effective in a wide variety of prediction and classification applications. All algorithms include an intuitive explanation of operation, essential equations, references to more rigorous theory, and commented C++ source code.

Many of these techniques are recent developments, still not in widespread use. Others are standard algorithms given a fresh look. In every case, the focus is on practical applicability, with all code written in such a way that it can easily be included into any program. The Windows-based DATAMINE program lets you experiment with the techniques before incorporating them into your own work.

What You'll Learn

Use Monte-Carlo permutation tests to provide statistically sound assessments of relationships present in your data
Discover how combinatorially symmetric cross validation reveals whether your model has true power or has just learned noise by overfitting the data
Work with feature weighting as regularized energy-based learning to rank variables according to their predictive power when there is too little data for traditional methods
See how the eigenstructure of a dataset enables clustering of variables into groups that exist only within meaningful subspaces of the data
Plot regions of the variable space where there is disagreement between marginal and actual densities, or where contribution to mutual information is high

Who This Book Is For

Anyone interested in discovering and exploiting relationships among variables. Although all code examples are written in C++, the algorithms are described in sufficient detail that they can easily be programmed in any language.

Keywords

Authors and Affiliations

Ithaca, USA

Timothy Masters

About the author

Timothy Masters has a PhD in statistics and is an experienced programmer. His dissertation was in image analysis. His career moved in the direction of signal processing, and for the last 25 years he's been involved in the development of automated trading systems in various financial markets.

Bibliographic Information

Book Title: Data Mining Algorithms in C++
Book Subtitle: Data Patterns and Algorithms for Modern Applications
Authors: Timothy Masters
DOI: https://doi.org/10.1007/978-1-4842-3315-3
Publisher: Apress Berkeley, CA
eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)
Softcover ISBN: 978-1-4842-3314-6Published: 19 December 2017
eBook ISBN: 978-1-4842-3315-3Published: 15 December 2017
Edition Number: 1
Number of Pages: XIV, 286
Topics: Programming Languages, Compilers, Interpreters, Big Data, Data Mining and Knowledge Discovery, Programming Techniques, Algorithms

Publish with us

Policies and ethics

Authors:

Sections

Buy it now

Buying options

Other ways to access

Table of contents (5 chapters)

Front Matter

Information and Entropy

Screening for Relationships

Displaying Relationship Anomalies

Fun with Eigenvectors

Using the DATAMINE Program

Back Matter

About this book

Keywords

Authors and Affiliations

Ithaca, USA

About the author

Bibliographic Information

Publish with us

Buy it now

Buying options

Other ways to access

Search

Navigation