Skip to main content
  • Book
  • © 2016

Practical Hadoop Ecosystem

A Definitive Guide to Hadoop-Related Frameworks and Tools

Apress

Authors:

  • In-depth book covering topics that are not covered elsewhere, and how they all work together
  • Provides practical examples
  • Presents one of the two most popular big data frameworks, Hadoop

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (11 chapters)

  1. Front Matter

    Pages i-xx
  2. Fundamentals

    1. Front Matter

      Pages 1-1
    2. Introduction

      • Deepak Vohra
      Pages 3-162
    3. HDFS and MapReduce

      • Deepak Vohra
      Pages 163-205
  3. Storing & Querying

    1. Front Matter

      Pages 207-207
    2. Apache Hive

      • Deepak Vohra
      Pages 209-231
    3. Apache HBase

      • Deepak Vohra
      Pages 233-257
  4. Bulk Transferring & Streaming

    1. Front Matter

      Pages 259-259
    2. Apache Sqoop

      • Deepak Vohra
      Pages 261-286
    3. Apache Flume

      • Deepak Vohra
      Pages 287-300
  5. Serializing

    1. Front Matter

      Pages 301-301
    2. Apache Avro

      • Deepak Vohra
      Pages 303-323
    3. Apache Parquet

      • Deepak Vohra
      Pages 325-335
  6. Messaging & Indexing

    1. Front Matter

      Pages 337-337
    2. Apache Kafka

      • Deepak Vohra
      Pages 339-347
    3. Apache Solr

      • Deepak Vohra
      Pages 349-376
    4. Apache Mahout

      • Deepak Vohra
      Pages 377-414
  7. Back Matter

    Pages 415-421

About this book

Learn how to use the Apache Hadoop projects, including MapReduce, HDFS, Apache Hive, Apache HBase, Apache Kafka, Apache Mahout, and Apache Solr. From setting up the environment to running sample applications each chapter in this book is a practical tutorial on using an Apache Hadoop ecosystem project.

While several books on Apache Hadoop are available, most are based on the main projects, MapReduce and HDFS, and none discusses the other Apache Hadoop ecosystem projects and how they all work together as a cohesive big data development platform.




What You Will Learn:

  • Set up the environment in Linux for Hadoop projects using Cloudera Hadoop Distribution CDH 5
  • Run a MapReduce job
  • Store data with Apache Hive, and Apache HBase
  • Index data in HDFS with Apache Solr
  • Develop a Kafka messaging system
  • Stream Logs to HDFS with Apache Flume
  • Transfer data from MySQL database to Hive, HDFS, and HBase with Sqoop
  • Create a Hive table over Apache Solr
  • Develop a Mahout User Recommender System




Who This Book Is For:

Apache Hadoop developers. Pre-requisite knowledge of Linux and some knowledge of Hadoop is required.

Authors and Affiliations

  • White Rock, Canada

    Deepak Vohra

About the author

Deepak Vohra is a coder, developer, programmer, book author, and technical reviewer.

Bibliographic Information

  • Book Title: Practical Hadoop Ecosystem

  • Book Subtitle: A Definitive Guide to Hadoop-Related Frameworks and Tools

  • Authors: Deepak Vohra

  • DOI: https://doi.org/10.1007/978-1-4842-2199-0

  • Publisher: Apress Berkeley, CA

  • eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)

  • Copyright Information: Deepak Vohra 2016

  • Softcover ISBN: 978-1-4842-2198-3Published: 01 October 2016

  • eBook ISBN: 978-1-4842-2199-0Published: 30 September 2016

  • Edition Number: 1

  • Number of Pages: XX, 421

  • Number of Illustrations: 18 b/w illustrations, 293 illustrations in colour

  • Topics: Big Data, Database Management

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access