Skip to main content
  • Book
  • © 2015

Practical Graph Analytics with Apache Giraph

Apress
  • Practical Graph Analytics with Apache Giraph helps you build data mining and machine learning applications using the Apache Foundation’s Giraph framework for graph processing.

  • This is the same framework as used by Facebook, Google, and other social media analytics operations to derive business value from vast amounts of interconnected data points.

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (13 chapters)

  1. Front Matter

    Pages i-xix
  2. Giraph Building Blocks

    1. Front Matter

      Pages 1-1
    2. Introducing Giraph

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 3-16
    3. Modeling Graph Processing Use Cases

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 17-41
    4. The Giraph Programming Model

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 43-70
    5. Giraph Algorithmic Building Blocks

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 71-105
  3. Giraph Overview

    1. Front Matter

      Pages 107-107
    2. Working with Giraph

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 109-136
    3. Giraph Architecture

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 137-162
    4. Graph IO Formats

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 163-186
    5. Beyond the Basic API

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 187-213
  4. Advanced Topics

    1. Front Matter

      Pages 215-215
    2. Exposing Parallelism in Giraph

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 217-239
    3. Advanced IO

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 241-253
    4. Tuning Giraph

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 255-275
    5. Giraph in the Clouda

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 277-293
    6. Install and Configure Giraph and Hadoop

      • Claudio Martella, Roman Shaposhnik, Dionysios Logothetis
      Pages 295-307
  5. Back Matter

    Pages 309-315

About this book

Practical Graph Analytics with Apache Giraph helps you build data mining and machine learning applications using the Apache Foundation’s Giraph framework for graph processing. This is the same framework as used by Facebook, Google, and other social media analytics operations to derive business value from vast amounts of interconnected data points.

Graphs arise in a wealth of data scenarios and describe the connections that are naturally formed in both digital and real worlds. Examples of such connections abound in online social networks such as Facebook and Twitter, among users who rate movies from services like Netflix and Amazon Prime, and are useful even in the context of biological networks for scientific research. Whether in the context of business or science, viewing data as connected adds value by increasing the amount of information available to be drawn from that data and put to use in generating new revenue or scientific opportunities.

Apache Giraph offers a simple yet flexible programming model targeted to graph algorithms and designed to scale easily to accommodate massive amounts of data. Originally developed at Yahoo!, Giraph is now a top top-level project at the Apache Foundation, and it enlists contributors from companies such as Facebook, LinkedIn, and Twitter. Practical Graph Analytics with Apache Giraph brings the power of Apache Giraph to you, showing how to harness the power of graph processing for your own data by building sophisticated graph analytics applications using the very same framework that is relied upon by some of the largest players in the industry today.

About the authors

Roman Shaposhnik is a vice-president and one of the lead developers of Apache Bigtop, a 100% open source and community-driven big data management distribution built on top of Apache Hadoop. He has been working on making Hadoop ecosystem components more accessible and easier to use, and he has contributed to a wide array of Apache projects from Avro to Zookeeper. In addition to his day job building Data Fabric APIs at Pivotal Inc., Roman currently serves as a vice-president of Apache Incubator, helping exciting and new open source projects join the Apache family.

Bibliographic Information

Buy it now

Buying options

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 49.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access