Overview
- Explains the latest Scikit-Multiflow framework in detail
- Explains Supervised and Unsupervised Learning for streaming data
- One of the first books in the market on machine learning models for streaming data using Python
Access this book
Tax calculation will be finalised at checkout
Other ways to access
Table of contents (4 chapters)
Keywords
About this book
You'll start with an introduction to streaming data, the various challenges associated with it, some of its real-world business applications, and various windowing techniques. You'll then examine incremental and online learning algorithms, and the concept of model evaluation with streaming data and get introduced to the Scikit-Multiflow framework in Python. This is followed by a review of the various change detection/concept drift detection algorithms and the implementation of various datasets using Scikit-Multiflow.
Introduction to the various supervised and unsupervised algorithms for streaming data, and their implementation on various datasets using Python are also covered. The book concludes by briefly covering other open-source tools available for streaming data such as Spark, MOA (Massive Online Analysis), Kafka, and more.
What You'll Learn
- Understand machine learning with streaming data concepts
- Review incremental and online learning
- Develop models for detecting concept drift
- Explore techniques for classification, regression, and ensemble learning in streaming data contexts
- Apply best practices for debugging and validating machine learning models in streaming data context
- Get introduced to other open-source frameworks for handling streaming data.
Who This Book Is For
Machine learning engineers and data science professionals
Authors and Affiliations
About the author
Dr. Sayan Putatunda is an experienced data scientist and researcher. He holds a Ph.D. in Applied Statistics/ Machine Learning from the Indian Institute of Management, Ahmedabad (IIMA) where his research was on streaming data and its applications in the transportation industry. He has a rich experience of working in both senior individual contributor and managerial roles in the data science industry with multiple companies such as Amazon, VMware, Mu Sigma, and more. His research interests are in streaming data, deep learning, machine learning, spatial point processes, and directional statistics. As a researcher, he has multiple publications in top international peer-reviewed journals with reputed publishers. He has presented his work at various reputed international machine learning and statistics conferences. He is also a member of IEEE.
Bibliographic Information
Book Title: Practical Machine Learning for Streaming Data with Python
Book Subtitle: Design, Develop, and Validate Online Learning Models
Authors: Sayan Putatunda
DOI: https://doi.org/10.1007/978-1-4842-6867-4
Publisher: Apress Berkeley, CA
eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)
Copyright Information: Sayan Putatunda 2021
Softcover ISBN: 978-1-4842-6866-7Published: 09 April 2021
eBook ISBN: 978-1-4842-6867-4Published: 09 April 2021
Edition Number: 1
Number of Pages: XVI, 118
Number of Illustrations: 16 b/w illustrations
Topics: Machine Learning, Professional Computing