Read While You Wait - Get immediate ebook access, if available*, when you order a print book

Next-Generation Machine Learning with Spark

Covers XGBoost, LightGBM, Spark NLP, Distributed Deep Learning with Keras, and More

Authors: Quinto, Butch

Free Preview
  • For the latest version of Spark and Spark MLlib
  • Covers powerful third-party machine learning algorithms and libraries not available in the standard Spark MLlib library such as XGBoost4J-Spark, LightGBM on Spark, Isolation Forest, Spark NLP, and Stanford CoreNLP
  • Includes distributed deep learning using convolutional neural networks with Spark and Keras
see more benefits

Buy this book

eBook $29.99
price for USA (gross)
  • ISBN 978-1-4842-5669-5
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $39.99
price for USA
  • ISBN 978-1-4842-5668-8
  • Free shipping for individuals worldwide
  • Immediate ebook access, if available*, with your print order
  • Usually dispatched within 3 to 5 business days.
About this book

Access real-world documentation and examples for the Spark platform for building large-scale, enterprise-grade machine learning applications.

The past decade has seen an astonishing series of advances in machine learning. These breakthroughs are disrupting our everyday life and making an impact across every industry.

Next-Generation Machine Learning with Spark provides a gentle introduction to Spark and Spark MLlib and advances to more powerful, third-party machine learning algorithms and libraries beyond what is available in the standard Spark MLlib library. By the end of this book, you will be able to apply your knowledge to real-world use cases through dozens of practical examples and insightful explanations. 


What You Will Learn

  • Be introduced to machine learning, Spark, and Spark MLlib 2.4.x
  • Achieve lightning-fast gradient boosting on Spark with the XGBoost4J-Spark and LightGBM libraries
  • Detect anomalies with the Isolation Forest algorithm for Spark
  • Use the Spark NLP and Stanford CoreNLP libraries that support multiple languages
  • Optimize your ML workload with the Alluxio in-memory data accelerator for Spark
  • Use GraphX and GraphFrames for Graph Analysis
  • Perform image recognition using convolutional neural networks
  • Utilize the Keras framework and distributed deep learning libraries with Spark 


Who This Book Is For

Data scientists and machine learning engineers who want to take their knowledge to the next level and use Spark and more powerful, next-generation algorithms and libraries beyond what is available in the standard Spark MLlib library; also serves as a primer for aspiring data scientists and engineers who need an introduction to machine learning, Spark, and Spark MLlib.

About the authors

Butch Quinto is founder and Chief AI Officer at Intelvi AI, an artificial intelligence company that develops cutting-edge solutions for the defense, industrial, and transportation industries. As Chief AI Officer, Butch heads strategy, innovation, research, and development. Previously, he was the Director of Artificial Intelligence at a leading technology firm and Chief Data Officer at an AI startup. As Director of Analytics at Deloitte, Butch led the development of several enterprise-grade AI and IoT solutions as well as strategy, business development, and venture capital due diligence. He has more than 20 years of experience in various technology and leadership roles in several industries including banking and finance, telecommunications, government, utilities, transportation, e-commerce, retail, manufacturing, and bioinformatics. Butch is the author of Next-Generation Big Data (Apress) and a member of the Association for the Advancement of Artificial Intelligence and the American Association for the Advancement of Science. 

Table of contents (7 chapters)

Table of contents (7 chapters)

Buy this book

eBook $29.99
price for USA (gross)
  • ISBN 978-1-4842-5669-5
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover $39.99
price for USA
  • ISBN 978-1-4842-5668-8
  • Free shipping for individuals worldwide
  • Immediate ebook access, if available*, with your print order
  • Usually dispatched within 3 to 5 business days.

Services for this book

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Next-Generation Machine Learning with Spark
Book Subtitle
Covers XGBoost, LightGBM, Spark NLP, Distributed Deep Learning with Keras, and More
Authors
Copyright
2020
Publisher
Apress
Copyright Holder
Butch Quinto
eBook ISBN
978-1-4842-5669-5
DOI
10.1007/978-1-4842-5669-5
Softcover ISBN
978-1-4842-5668-8
Edition Number
1
Number of Pages
XIX, 355
Number of Illustrations
67 b/w illustrations
Topics

*immediately available upon purchase as print book shipments may be delayed due to the COVID-19 crisis. ebook access is temporary and does not include ownership of the ebook. Only valid for books with an ebook version. Springer Reference Works are not included.