Skip to main content
  • Book
  • © 2016

Big Data SMACK

A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka

Apress
  • The first book presenting the SMACK stack
  • A practical guide teaching how to incorporate big data
  • Covers the full stack of big data architecture, discussing the practical benefits of each technology

Buy it now

Buying options

eBook USD 29.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 39.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

This is a preview of subscription content, log in via an institution to check for access.

Table of contents (11 chapters)

  1. Front Matter

    Pages i-xxv
  2. Introduction

    1. Front Matter

      Pages 1-1
    2. Big Data, Big Challenges

      • Raul Estrada, Isaac Ruiz
      Pages 3-7
    3. Big Data, Big Solutions

      • Raul Estrada, Isaac Ruiz
      Pages 9-16
  3. Playing SMACK

    1. Front Matter

      Pages 17-17
    2. The Language: Scala

      • Raul Estrada, Isaac Ruiz
      Pages 19-40
    3. The Model: Akka

      • Raul Estrada, Isaac Ruiz
      Pages 41-66
    4. Storage: Apache Cassandra

      • Raul Estrada, Isaac Ruiz
      Pages 67-95
    5. The Engine: Apache Spark

      • Raul Estrada, Isaac Ruiz
      Pages 97-130
    6. The Manager: Apache Mesos

      • Raul Estrada, Isaac Ruiz
      Pages 131-163
    7. The Broker: Apache Kafka

      • Raul Estrada, Isaac Ruiz
      Pages 165-203
  4. Improving SMACK

    1. Front Matter

      Pages 205-205
    2. Fast Data Patterns

      • Raul Estrada, Isaac Ruiz
      Pages 207-224
    3. Data Pipelines

      • Raul Estrada, Isaac Ruiz
      Pages 225-250
    4. Glossary

      • Raul Estrada, Isaac Ruiz
      Pages 251-258
  5. Back Matter

    Pages 259-264

About this book

Learn how to integrate full-stack open source big data architecture and to choose the correct technology—Scala/Spark, Mesos, Akka, Cassandra, and Kafka—in every layer. 

Big data architecture is becoming a requirement for many different enterprises. So far, however, the focus has largely been on collecting, aggregating, and crunching large data sets in a timely manner. In many cases now, organizations need more than one paradigm to perform efficient analyses.

Big Data SMACK explains each of the full-stack technologies and, more importantly, how to best integrate them. It provides detailed coverage of the practical benefits of these technologies and incorporates real-world examples in every situation. This book focuses on the problems and scenarios solved by the architecture, as well as the solutions provided by every technology. It covers the six main concepts of big data architecture and how integrate, replace, and reinforce every layer:

  • The language: Scala
  • The engine: Spark (SQL, MLib, Streaming, GraphX)
  • The container: Mesos, Docker
  • The view: Akka
  • The storage: Cassandra
  • The message broker: Kafka
  • What You Will Learn:

    • Make big data architecture without using complex Greek letter architectures
    • Build a cheap but effective cluster infrastructure
    • Make queries, reports, and graphs that business demands
    • Manage and exploit unstructured and No-SQL data sources
    • Use tools to monitor the performance of your architecture
    • Integrate all technologies and decide which ones replace and which ones reinforce

    Who This Book Is For:

    Developers, data architects, and data scientists looking to integrate the most successful big data open stack architecture and to choose the correct technology in every layer

    Authors and Affiliations

    • Mexico City, Mexico

      Raul Estrada, Isaac Ruiz

    About the authors

    Raúl Estrada is the co-founder of Treu Technologies, an enterprise for Social Data Marketing and BigData research. He is an Enterprise Architect with more than 15 years of experience in cluster management and Enterprise Software. Prior to founding Treu Technologies, Estrada worked as an Enterprise Architect in Application Servers & evangelist for Oracle Inc. He loves functional languages like Elixir and Scala, and also has a Master of Computer Science degree.

    Isaac Ruiz has been a Java programmer since 2001, and a consultant and architect since 2003. He has participated in projects of different areas and varied scopes (education, communications, retail, and others). Ruiz specializes in systems integration and has participated in projects mainly related to the financial sector. He is a supporter of free software. Ruiz likes to experiment with new technologies (frameworks, languages, methods).

    Bibliographic Information

    • Book Title: Big Data SMACK

    • Book Subtitle: A Guide to Apache Spark, Mesos, Akka, Cassandra, and Kafka

    • Authors: Raul Estrada, Isaac Ruiz

    • DOI: https://doi.org/10.1007/978-1-4842-2175-4

    • Publisher: Apress Berkeley, CA

    • eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)

    • Copyright Information: Raul Estrada and Isaac Ruiz 2016

    • Softcover ISBN: 978-1-4842-2174-7Published: 29 September 2016

    • eBook ISBN: 978-1-4842-2175-4Published: 29 September 2016

    • Edition Number: 1

    • Number of Pages: XXV, 264

    • Number of Illustrations: 22 b/w illustrations, 52 illustrations in colour

    • Topics: Big Data, Database Management, Data Structures

    Buy it now

    Buying options

    eBook USD 29.99
    Price excludes VAT (USA)
    • Available as EPUB and PDF
    • Read on any device
    • Instant download
    • Own it forever
    Softcover Book USD 39.99
    Price excludes VAT (USA)
    • Compact, lightweight edition
    • Dispatched in 3 to 5 business days
    • Free shipping worldwide - see info

    Tax calculation will be finalised at checkout

    Other ways to access