Apress Access

Pro Microsoft HDInsight

Hadoop on Windows

By Debarchan Sarkar

  • eBook Price: $41.99
Buy eBook Buy Print Book
Pro Microsoft HDInsight is a complete guide for deploying and using Apache Hadoop on the Microsoft Windows Azure Platform.

Full Description

  • Add to Wishlist
  • ISBN13: 978-1-4302-6055-4
  • 272 Pages
  • User Level: Intermediate to Advanced
  • Publication Date: February 23, 2014
  • Available eBook Formats: EPUB, MOBI, PDF

Related Titles

  • Pro C# 5.0 and the .NET 4.5 Framework
  • Managing Risk and Information Security
  • Pro ASP.NET Web API
  • TouchDevelop
  • Pro HTML5 with Visual Studio 2012
  • High Performance SQL Server
  • Expert Scripting and Automation for SQL Server DBAs
  • Pro Hadoop Data Analytics
  • Practical Microsoft Visual Studio 2015
  • Develop Microsoft HoloLens Apps Now
Full Description

Pro Microsoft HDInsight is a complete guide to deploying and using Apache Hadoop on the Microsoft Windows Azure Platforms. The information in this book enables you to process enormous volumes of structured as well as non-structured data easily using HDInsight, which is Microsoft’s own distribution of Apache Hadoop. Furthermore, the blend of Infrastructure as a Service (IaaS) and Platform as a Service (PaaS) offerings available through Windows Azure lets you take advantage of Hadoop’s processing power without the worry of creating, configuring, maintaining, or managing your own cluster.

With the data explosion that is soon to happen, the open source Apache Hadoop Framework is gaining traction, and it benefits from a huge ecosystem that has risen around the core functionalities of the Hadoop distributed file system (HDFS™) and Hadoop Map Reduce. Pro Microsoft HDInsight equips you with the knowledge, confidence, and technique to configure and manage this ecosystem on Windows Azure. The book is an excellent choice for anyone aspiring to be a data scientist or data engineer, putting you a step ahead in the data mining field.

  • Guides you through installation and configuration of an HDInsight cluster on Windows Azure
  • Provides clear examples of configuring and executing Map Reduce jobs
  • Helps you consume data and diagnose errors from the Windows Azure HDInsight Service

What you’ll learn

  • Create and Manage HDInsight clusters on Windows Azure

  • Understand the different HDInsight services and configuration files
  • Develop and run Map Reduce jobs using .NET and PowerShell
  • Consume data from client applications like Microsoft Excel and Power View
  • Monitor job executions and logs
  • Troubleshoot common problems

Who this book is for

Pro Microsoft HDInsight: Hadoop on Windows is an excellent choice for developers in the field of business intelligence and predictive analysis who want that extra edge in technology on Microsoft Windows and Windows Azure platforms. The book is for people who love to slice and dice data, and identify trends and patterns through analysis of data to help in creative and intelligent decision making.

Table of Contents

Table of Contents

  1. Introducing HDInsight
  2. Understanding Windows Azure HDInsight Service
  3. Provisioning Your HDInsight Service Cluster
  4. Automating HDInsight Cluster Provisioning
  5. Submitting Jobs to Your HDInsight Cluster
  6. Exploring the HDInsight Name Node
  7. Using Windows Azure HDInsight Emulator
  8. Accessing HDInsight over Hive and ODBC
  9. Consuming HDInsight from Self-Service BI Tools
  10. Integrating HDInsight with SQL Server Integration Services
  11. Logging in HDInsight
  12. Troubleshooting Cluster Deployments
  13. Troubleshooting Job Failures
Source Code/Downloads

Downloads are available to accompany this book.

Your operating system can likely extract zipped downloads automatically, but you may require software such as WinZip for PC, or StuffIt on a Mac.


If you think that you've found an error in this book, please let us know by emailing to editorial@apress.com . You will find any confirmed erratum below, so you can check if your concern has already been addressed.
No errata are currently published


    1. Big Data Made Easy


      View Book

    2. Big Data Imperatives


      View Book

    3. Practical Hadoop Security


      View Book