Name: Scala Programming for Big Data Analytics
ISBN: 978-1-4842-4810-2

Overview

Authors:

Irfan Elahi ⁰

Irfan Elahi
1. Notting Hill, Australia
View author publications

You can also search for this author in PubMed Google Scholar

Provides coverage of Scala with the prime goal of getting started with Apache Spark development
Covers the end-to-end development life cycle of Scala applications employing industry standards
Includes Scala programs that use the industry-standard practices and the Scala Build Tool

20k Accesses
1 Citations

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 29.99

Price excludes VAT (USA)

Softcover Book USD 37.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (14 chapters)

Front Matter

Pages i-xxi

Download chapter PDF
Scala Language
- Irfan Elahi
Pages 1-13
Installing Scala
- Irfan Elahi
Pages 15-26
Using the Scala Shell
- Irfan Elahi
Pages 27-44
Variables
- Irfan Elahi
Pages 45-55
Data Types
- Irfan Elahi
Pages 57-79
Conditional Statements
- Irfan Elahi
Pages 81-93
Code Blocks
- Irfan Elahi
Pages 95-101
Functions
- Irfan Elahi
Pages 103-122
Collections
- Irfan Elahi
Pages 123-164
Loops
- Irfan Elahi
Pages 165-173
Classes and Packages
- Irfan Elahi
Pages 175-201
Exception Handling
- Irfan Elahi
Pages 203-221
Building and Packaging
- Irfan Elahi
Pages 223-260
Hello Apache Spark
- Irfan Elahi
Pages 261-299
Back Matter

Pages 301-306

Download chapter PDF

Keywords

About this book

Gain the key language concepts and programming techniques of Scala in the context of big data analytics and Apache Spark. The book begins by introducing you to Scala and establishes a firm contextual understanding of why you should learn this language, how it stands in comparison to Java, and how Scala is related to Apache Spark for big data analytics. Next, you’ll set up the Scala environment ready for examining your first Scala programs. This is followed by sections on Scala fundamentals including mutable/immutable variables, the type hierarchy system, control flow expressions and code blocks.

The author discusses functions at length and highlights a number of associated concepts such as functional programming and anonymous functions. The book then delves deeper into Scala’s powerful collections system because many of Apache Spark’s APIs bear a strong resemblance to Scala collections.

Along the way you’ll see thedevelopment life cycle of a Scala program. This involves compiling and building programs using the industry-standard Scala Build Tool (SBT). You’ll cover guidelines related to dependency management using SBT as this is critical for building large Apache Spark applications. Scala Programming for Big Data Analytics concludes by demonstrating how you can make use of the concepts to write programs that run on the Apache Spark framework. These programs will provide distributed and parallel computing, which is critical for big data analytics.

What You Will Learn

See the fundamentals of Scala as a general-purpose programming language
Understand functional programming and object-oriented programming constructs in Scala
Use Scala collections and functions
Develop, package and run Apache Spark applications for big data analytics

Who ThisBook Is For

Data scientists, data analysts and data engineers who intend to use Apache Spark for large-scale analytics.

Authors and Affiliations

Notting Hill, Australia

Irfan Elahi

About the author

Irfan Elahi is a senior consultant in Deloitte Australia specializing in big data and machine learning. His primary focus lies in using big data and machine learning to support business growth with multifaceted and strong ties to the telecommunications, energy, retail and media industries. He has worked on a number of projects in Australia to design, prototype, develop, and deploy production-grade big data solutions in Amazon Web Services (AWS) and Azure to support a number of use-cases ranging from enterprise data warehousing, ETL offloading, analytics, batch processing and stream processing while employing leading commercial Hadoop solutions such as Cloudera and Hortonworks. He has worked closely with clients’ systems and software engineering teams using DevOps to enhance the continuous integration and continuous deployment (CICD) processes and manage a Hadoop cluster’s operations and security.

In addition to his technology competencies, Irfan has recently presented at the DataWorks Summit in Sydney on the subject of in-memory big data technologies and in a number of meetups all around the world. He also remains involved delivering knowledge-transfer sessions, training and workshops about big data and machine learning, both within his firm and at clients. He also has launched Udemy courses on Apache Spark for big data analytics and R programming for data science with more than 18,000 students from 145 countries enrolled.

Bibliographic Information

Book Title: Scala Programming for Big Data Analytics
Book Subtitle: Get Started With Big Data Analytics Using Apache Spark
Authors: Irfan Elahi
DOI: https://doi.org/10.1007/978-1-4842-4810-2
Publisher: Apress Berkeley, CA
eBook Packages: Business and Management, Apress Access Books, Business and Management (R0)
Softcover ISBN: 978-1-4842-4809-6Published: 06 July 2019
eBook ISBN: 978-1-4842-4810-2Published: 05 July 2019
Edition Number: 1
Number of Pages: XXI, 306
Number of Illustrations: 125 b/w illustrations
Topics: Big Data/Analytics, Programming Languages, Compilers, Interpreters, Open Source

Publish with us

Policies and ethics

Overview

Access this book

Other ways to access

Table of contents (14 chapters)

Front Matter

Back Matter

Keywords

About this book

Authors and Affiliations

Notting Hill, Australia

About the author

Bibliographic Information

Publish with us

Search

Navigation