Practical Hadoop Ecosystem

A Definitive Guide to Hadoop-Related Frameworks and Tools

Authors: Vohra, Deepak

  • In-depth book covering topics that are not covered elsewhere, and how they all work together 
  • Provides practical examples
  • Presents one of the two most popular big data frameworks, Hadoop
see more benefits

Buy this book

eBook 29,74 €
price for Germany (gross)
  • ISBN 978-1-4842-2199-0
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover 39,58 €
price for Germany (gross)
  • ISBN 978-1-4842-2198-3
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.
About this book

Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.

While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.


What You Will Learn:
  • Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5
  • Run a MapReduce job
  • Store data with Apache Hive, and Apache HBase
  • Index data in HDFS with Apache Solr
  • Develop a Kafka messaging system
  • Stream Logs to HDFS with Apache Flume
  • Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop
  • Create a Hive table over Apache Solr
  • Develop a Mahout User Recommender System

Who This Book Is For:
Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.

About the authors

Deepak Vohra is a coder, developer, programmer, book author, and technical reviewer.

Table of contents (11 chapters)

Buy this book

eBook 29,74 €
price for Germany (gross)
  • ISBN 978-1-4842-2199-0
  • Digitally watermarked, DRM-free
  • Included format: EPUB, PDF
  • ebooks can be used on all reading devices
  • Immediate eBook download after purchase
Softcover 39,58 €
price for Germany (gross)
  • ISBN 978-1-4842-2198-3
  • Free shipping for individuals worldwide
  • Usually dispatched within 3 to 5 business days.

Services for this book

Loading...

Bibliographic Information

Bibliographic Information
Book Title
Practical Hadoop Ecosystem
Book Subtitle
A Definitive Guide to Hadoop-Related Frameworks and Tools
Authors
Copyright
2016
Publisher
Apress
Copyright Holder
Deepak Vohra
Distribution Rights
Standard Apress Distribution
eBook ISBN
978-1-4842-2199-0
DOI
10.1007/978-1-4842-2199-0
Softcover ISBN
978-1-4842-2198-3
Edition Number
1
Number of Pages
XX, 421
Number of Illustrations and Tables
18 b/w illustrations, 293 illustrations in colour
Topics