Skip to main content
Apress
Book cover

Scalable Big Data Architecture

A practitioners guide to choosing relevant Big Data architecture

  • Book
  • © 2016

Overview

  • This book not only gives a landscape of Big Data ecosystem, but will guide the readers on the reasons to use a project regarding a Big Data use case, as well
  • A step by step guide that will walk you through common Big Data patterns--helping you to understand the context & perimeter one should focus on for their specific needs
  • This book help the readers to get visibility on how Big Data can solve data processing problem through real industry use cases
  • The readers will understand the limits of each pattern, and then will be able to compose them into a heterogeneous architecture
  • Understanding the fundamentals of machine learning and how to handle it

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99
Price excludes VAT (USA)
  • Available as EPUB and PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (7 chapters)

Keywords

About this book

This book highlights the different types of data architecture and illustrates the many possibilities hidden behind the term "Big Data", from the usage of No-SQL databases to the deployment of stream analytics architecture, machine learning, and governance.

Scalable Big Data Architecture covers real-world, concrete industry use cases that leverage complex distributed applications , which involve web applications, RESTful API, and high throughput of large amount of data stored in highly scalable No-SQL data stores such as Couchbase and Elasticsearch. This book demonstrates how data processing can be done at scale from the usage of NoSQL datastores to the combination of Big Data distribution.

When the data processing is too complex and involves different processing topology like long running jobs, stream processing, multiple data sources correlation, and machine learning, it’s often necessary to delegate the load to Hadoop or Spark and use the No-SQLto serve processed data in real time.

This book shows you how to choose a relevant combination of big data technologies available within the Hadoop ecosystem. It focuses on processing long jobs, architecture, stream data patterns, log analysis, and real time analytics. Every pattern is illustrated with practical examples, which use the different open sourceprojects such as Logstash, Spark, Kafka, and so on.

Traditional data infrastructures are built for digesting and rendering data synthesis and analytics from large amount of data. This book helps you to understand why you should consider using machine learning algorithms early on in the project, before being overwhelmed by constraints imposed by dealing with the high throughput of Big data.

Scalable Big Data Architecture is for developers, data architects, and data scientists looking for a better understanding of how to choose the most relevant pattern for a Big Data project and which tools tointegrate into that pattern.

Reviews

       

About the author

<p><span lang="EN-GB"><em>Bahaaldine Azarmi </em>is the co-founder and CTO of reach five, a Social Data Marketing Platform. Bahaaldine has a strong background and expertise skills in REST API and Big Data architecture. Prior to founding reach five, Bahaaldine worked as a technical architect &amp; evangelist for large software vendors such as Oracle &amp; Talend.</span></p><span lang="EN-GB" style="font-size:11.0pt;line-height:115%;font-family:'Arial','sans-serif';">He has a master&rsquo;s degree of computer science from Polytech&rsquo;Paris engineering school, Paris.</span>    

Bibliographic Information

Publish with us