Jupyter Cookbook

Jupyter Cookbook

By : Dan Toomey

Buy this Book

Jupyter Cookbook

By: Dan Toomey

Buy this Book

Overview of this book

Jupyter has garnered a strong interest in the data science community of late, as it makes common data processing and analysis tasks much simpler. This book is for data science professionals who want to master various tasks related to Jupyter to create efficient, easy-to-share, scientific applications. The book starts with recipes on installing and running the Jupyter Notebook system on various platforms and configuring the various packages that can be used with it. You will then see how you can implement different programming languages and frameworks, such as Python, R, Julia, JavaScript, Scala, and Spark on your Jupyter Notebook. This book contains intuitive recipes on building interactive widgets to manipulate and visualize data in real time, sharing your code, creating a multi-user environment, and organizing your notebook. You will then get hands-on experience with Jupyter Labs, microservices, and deploying them on the web. By the end of this book, you will have taken your knowledge of Jupyter to the next level to perform all key tasks associated with it.

Title Page

Packt Upsell

Contributors

Preface

Free Chapter

Installation and Setting up the Environment

Introduction

Installing Jupyter on Windows

Installing Jupyter on the Mac

Installing Jupyter on Linux

Installing Jupyter on a server

Adding an Engine

Introduction

Adding the Python 3 engine

Adding the R engine

Adding the Julia engine

Adding the JavaScript engine

Adding the Scala engine

Adding the Spark engine

Accessing and Retrieving Data

Visualizing Your Analytics

Introduction

Generating a line graph using Python

Generating a histogram using Python

Generating a density map using Python

Plotting 3D data using Python

Present a user-interactive graphic using Python

Visualizing with R

Generate a regression line of data using R

Generate an R lowess line graph

Producing a Scatter plot matrix using R

Producing a bar chart using R

Producing a word cloud using R

Visualizing with Julia

Drawing a Julia scatter diagram of Iris data using Gadfly

Drawing a Julia histogram using Gadfly

Drawing a Julia line graph using the Winston package

Working with Widgets

Introduction

What are widgets?

Using ipyleaflet widgets

Using ipywidgets

Using a widget container

Using an interactive widget

Using an interactive text widget

Linking widgets together

Another ipywidgets linking example

Using a cookie cutter widget

Developing an OPENGL widget

Creating a simple orbit of one object

Using a complex orbit of multiple objects

Jupyter Dashboards

Introduction

What is Jupyter dashboards?

Creating an R dashboard

Create a Python dashboard

Creating a Julia dashboard

Develop a JavaScript (Node.js) dashboard

Sharing Your Code

Introduction

Using a Notebook server

Using a web server

Sharing your Notebook through a public server

Sharing your Notebook through Docker

Sharing your Notebook using nbviewer

Converting your Notebook into a different format

Converting Notebooks to R

Converting Notebooks to HTML

Converting Notebooks to Markdown

Converting Notebooks to reStructedText

Converting Notebooks to Latex

Converting Notebooks to PDF

Multiuser Jupyter

Introduction

Why multiuser?

Providing multiuser with JupyterHub

Providing multiuser with Docker

Running your Notebook in Google Cloud Platform

Running your Notebook in AWS

Running your Notebook in Azure

Interacting with Big Data

Introduction

Obtaining a word count from a big-text data source

Obtaining a sorted word count from a big-text source

Examining big-text log file access

Computing prime numbers using parallel operations

Analyzing big-text data

Analyzing big data history files

Jupyter Security

Introduction

Security mechanisms built into Jupyter

Using SSL

The Jupyter trust model

Controlling network access

Additional practices

Jupyter Labs

Introduction

Installing and starting JupyterLab

Index

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Obtaining a word count from a big-text data source

While this is not a big data source, we will show how to get a word count from a text file first. Then we'll find a larger data file to work with.

How to do it...

We can use this script to see the word counts for a file:

import pyspark

if not 'sc' in globals():
    sc = pyspark.SparkContext()

text_file = sc.textFile("B09656_09_word_count.ipynb")
counts = text_file.flatMap(lambda line: line.split(" ")) \
    .map(lambda word: (word, 1)) \
    .reduceByKey(lambda a, b: a + b)

for x in counts.collect():
    print(x)

When we run this in Jupyter, we see something akin to this display:

The display continues for every individual word that was detected in the source file.

How it works...

We have a standard preamble to the coding. All Spark programs need a context to work with. The context is used to define the number of threads and the like. We are only using the defaults. It's important to note that Spark will automatically utilize underlying multiple...

Jupyter Cookbook

By : Dan Toomey

Jupyter Cookbook

By: Dan Toomey

Overview of this book

Related Content you might be interested in

Current Title:

Jupyter Cookbook

Jupyter for Data Science

JupyterLab Quick Start Guide

Mastering Geospatial Analysis with Python

Obtaining a word count from a big-text data source

How to do it...

How it works...