Name: Docker for Data Science
ISBN: 978-1-4842-3012-1

Overview

Authors:

Joshua Cook ⁰

Joshua Cook
1. Santa Monica, USA
View author publications

You can also search for this author in PubMed Google Scholar

Teaches Docker principles with practical examples
Covers high-performance interactive computing with Jupyter
Presents a unique development method geared toward interactive computing

51k Accesses
26 Citations
5 Altmetric

This is a preview of subscription content, log in via an institution to check access.

Access this book

eBook USD 44.99

Price excludes VAT (USA)

Softcover Book USD 59.99

Price excludes VAT (USA)

Tax calculation will be finalised at checkout

Other ways to access

Licence this eBook for your library

Institutional subscriptions

Table of contents (10 chapters)

Front Matter

Pages i-xxi

Download chapter PDF
Introduction
- Joshua Cook
Pages 1-27
Docker
- Joshua Cook
Pages 29-47
Interactive Programming
- Joshua Cook
Pages 49-70
The Docker Engine
- Joshua Cook
Pages 71-79
The Dockerfile
- Joshua Cook
Pages 81-101
Docker Hub
- Joshua Cook
Pages 103-118
The Opinionated Jupyter Stacks
- Joshua Cook
Pages 119-135
The Data Stores
- Joshua Cook
Pages 137-178
Docker Compose
- Joshua Cook
Pages 179-211
Interactive Software Development
- Joshua Cook
Pages 213-251
Back Matter

Pages 253-257

Download chapter PDF

Keywords

About this book

Learn Docker "infrastructure as code" technology to define a system for performing standard but non-trivial data tasks on medium- to large-scale data sets, using Jupyter as the master controller.

It is not uncommon for a real-world data set to fail to be easily managed. The set may not fit well into access memory or may require prohibitively long processing. These are significant challenges to skilled software engineers and they can render the standard Jupyter system unusable.

As a solution to this problem, Docker for Data Science proposes using Docker. You will learn how to use existing pre-compiled public images created by the major open-source technologies—Python, Jupyter, Postgres—as well as using the Dockerfile to extend these images to suit your specific purposes. The Docker-Compose technology is examined and you will learn how it can be used to build a linked system with Python churning data behind the scenesand Jupyter managing these background tasks. Best practices in using existing images are explored as well as developing your own images to deploy state-of-the-art machine learning and optimization algorithms.

What You'll Learn

Master interactive development using the Jupyter platform
Run and build Docker containers from scratch and from publicly available open-source images
Write infrastructure as code using the docker-compose tool and its docker-compose.yml file type
Deploy a multi-service data science application across a cloud-based system

Who This Book Is For

Data scientists, machine learning engineers, artificial intelligence researchers, Kagglers, and software developers

Authors and Affiliations

Santa Monica, USA

Joshua Cook

About the author

Joshua Cook is a mathematician. He writes code in Bash, C, and Python and has done pure and applied computational work in geo-spatial predictive modeling, quantum mechanics, semantic search, and artificial intelligence. He also has 10 years experience teaching mathematics at the secondary and post-secondary level. His research interests lie in high-performance computing, interactive computing, feature extraction, and reinforcement learning. He is always willing to discuss orthogonality or to explain why Fortran is the language of the future over a warm or cold beverage.

Bibliographic Information

Book Title: Docker for Data Science
Book Subtitle: Building Scalable and Extensible Data Infrastructure Around the Jupyter Notebook Server
Authors: Joshua Cook
DOI: https://doi.org/10.1007/978-1-4842-3012-1
Publisher: Apress Berkeley, CA
eBook Packages: Professional and Applied Computing, Apress Access Books, Professional and Applied Computing (R0)
Softcover ISBN: 978-1-4842-3011-4Published: 25 August 2017
eBook ISBN: 978-1-4842-3012-1Published: 23 August 2017
Edition Number: 1
Number of Pages: XXI, 257
Number of Illustrations: 21 b/w illustrations, 76 illustrations in colour
Topics: Big Data, Artificial Intelligence, Open Source, Python

Publish with us

Policies and ethics

Overview

Access this book

Other ways to access

Table of contents (10 chapters)

Front Matter

Back Matter

Keywords

About this book

Authors and Affiliations

Santa Monica, USA

About the author

Bibliographic Information

Publish with us

Search

Navigation