Name: Scalable Big Data Architecture
ISBN: 978-1-4842-1326-1

Overview

Authors:

Bahaaldine Azarmi

Bahaaldine Azarmi

View author publications

You can also search for this author in PubMed Google Scholar

This book not only gives a landscape of Big Data ecosystem, but will guide the readers on the reasons to use a project regarding a Big Data use case, as well
A step by step guide that will walk you through common Big Data patterns--helping you to understand the context & perimeter one should focus on for their specific needs
This book help the readers to get visibility on how Big Data can solve data processing problem through real industry use cases
The readers will understand the limits of each pattern, and then will be able to compose them into a heterogeneous architecture
Understanding the fundamentals of machine learning and how to handle it

22k Accesses
8 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 39.99

Price excludes VAT (USA)

Softcover Book USD 54.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (7 chapters)

Front Matter

Pages i-xiii

Download chapter PDF
The Big (Data) Problem
- Bahaaldine Azarmi
Pages 1-16
Early Big Data with NoSQL
- Bahaaldine Azarmi
Pages 17-40
Defining the Processing Topology
- Bahaaldine Azarmi
Pages 41-56
Streaming Data
- Bahaaldine Azarmi
Pages 57-80
Querying and Analyzing Patterns
- Bahaaldine Azarmi
Pages 81-103
Learning From Your Data?
- Bahaaldine Azarmi
Pages 105-121
Governance Considerations
- Bahaaldine Azarmi
Pages 123-137
Back Matter

Pages 139-142

Download chapter PDF

Keywords

About this book

This book highlights the different types of data architecture and illustrates the many possibilities hidden behind the term "Big Data", from the usage of No-SQL databases to the deployment of stream analytics architecture, machine learning, and governance.

Scalable Big Data Architecture covers real-world, concrete industry use cases that leverage complex distributed applications , which involve web applications, RESTful API, and high throughput of large amount of data stored in highly scalable No-SQL data stores such as Couchbase and Elasticsearch. This book demonstrates how data processing can be done at scale from the usage of NoSQL datastores to the combination of Big Data distribution.

When the data processing is too complex and involves different processing topology like long running jobs, stream processing, multiple data sources correlation, and machine learning, it’s often necessary to delegate the load to Hadoop or Spark and use the No-SQLto serve processed data in real time.

This book shows you how to choose a relevant combination of big data technologies available within the Hadoop ecosystem. It focuses on processing long jobs, architecture, stream data patterns, log analysis, and real time analytics. Every pattern is illustrated with practical examples, which use the different open sourceprojects such as Logstash, Spark, Kafka, and so on.

Traditional data infrastructures are built for digesting and rendering data synthesis and analytics from large amount of data. This book helps you to understand why you should consider using machine learning algorithms early on in the project, before being overwhelmed by constraints imposed by dealing with the high throughput of Big data.

Scalable Big Data Architecture is for developers, data architects, and data scientists looking for a better understanding of how to choose the most relevant pattern for a Big Data project and which tools tointegrate into that pattern.

Reviews

About the author

Bahaaldine Azarmi is the co-founder and CTO of reach five, a Social Data Marketing Platform. Bahaaldine has a strong background and expertise skills in REST API and Big Data architecture. Prior to founding reach five, Bahaaldine worked as a technical architect & evangelist for large software vendors such as Oracle & Talend.He has a master’s degree of computer science from Polytech’Paris engineering school, Paris.

Bibliographic Information

Book Title: Scalable Big Data Architecture
Book Subtitle: A practitioners guide to choosing relevant Big Data architecture
Authors: Bahaaldine Azarmi
DOI: https://doi.org/10.1007/978-1-4842-1326-1
Publisher: Apress Berkeley, CA
eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)
Softcover ISBN: 978-1-4842-1327-8Published: 30 December 2015
eBook ISBN: 978-1-4842-1326-1Published: 31 December 2015
Edition Number: 1
Number of Pages: XIII, 141
Number of Illustrations: 70 b/w illustrations
Topics: Big Data, Computer Appl. in Administrative Data Processing, Database Management

Publish with us

Policies and ethics

Scalable Big Data Architecture

Overview

Access this book

Other ways to access

Table of contents (7 chapters)

Front Matter

The Big (Data) Problem

Early Big Data with NoSQL

Defining the Processing Topology

Streaming Data

Querying and Analyzing Patterns

Learning From Your Data?

Governance Considerations

Back Matter