Learning Jupyter

Learning Jupyter

By : Dan Toomey

Buy this Book

Learning Jupyter

By: Dan Toomey

Buy this Book

Overview of this book

Jupyter Notebook is a web-based environment that enables interactive computing in notebook documents. It allows you to create and share documents that contain live code, equations, visualizations, and explanatory text. The Jupyter Notebook system is extensively used in domains such as data cleaning and transformation, numerical simulation, statistical modeling, machine learning, and much more. This book starts with a detailed overview of the Jupyter Notebook system and its installation in different environments. Next we’ll help you will learn to integrate Jupyter system with different programming languages such as R, Python, JavaScript, and Julia and explore the various versions and packages that are compatible with the Notebook system. Moving ahead, you master interactive widgets, namespaces, and working with Jupyter in a multiuser mode. Towards the end, you will use Jupyter with a big data set and will apply all the functionalities learned throughout the book.

Learning Jupyter

Credits

About the Author

About the Reviewer

www.PacktPub.com

Preface

Free Chapter

Introduction to Jupyter

First look at Jupyter

Installing Jupyter on Windows

Installing Jupyter on Mac

Notebook structure

Notebook workflow

Basic notebook operations

Security in Jupyter

Configuration options for Jupyter

Summary

Jupyter Python Scripting

Basic Python in Jupyter

Python data access in Jupyter

Python pandas in Jupyter

Python graphics in Jupyter

Python random numbers in Jupyter

Summary

Jupyter R Scripting

Adding R scripting to your installation

Basic R in Jupyter

R dataset access

R visualizations in Jupyter

R cluster analysis

R forecasting

Summary

Jupyter Julia Scripting

Adding Julia scripting to your installation

Basic Julia in Jupyter

Julia limitations in Jupyter

Standard Julia capabilities

Julia visualizations in Jupyter

Julia Vega plotting

Julia parallel processing

Julia control flow

Julia regular expressions

Julia unit testing

Summary

Jupyter JavaScript Coding

Adding JavaScript scripting to your installation

JavaScript Hello World Jupyter Notebook

Basic JavaScript in Jupyter

JavaScript limitations in Jupyter

Node.js d3 package

Node.js stats-analysis package

Node.js JSON handling

Node.js canvas package

Node.js plotly package

Node.js asynchronous threads

Node.js decision-tree package

Summary

Interactive Widgets

Widgets

Summary

Sharing and Converting Jupyter Notebooks

Sharing notebooks

Converting notebooks

Summary

Multiuser Jupyter Notebooks

Sample interactive notebook

JupyterHub

Docker

Summary

Jupyter Scala

Installing the Scala kernel

Scala data access in Jupyter

Scala array operations

Scala random numbers in Jupyter

Scala closures

Scala higher-order functions

Scala pattern matching

Summary

Jupyter and Big Data

Apache Spark

Our first Spark script

Spark text file analysis

Spark - evaluating history data

Summary

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Spark word count

Now that we have seen some of the functionality, let's explore further. We can use a similar script to count the word occurrences in a file, as follows:

import pyspark
if not 'sc' in globals():
    sc = pyspark.SparkContext()
text_file = sc.textFile("Spark File Words.ipynb")
counts = text_file.flatMap(lambda line: line.split(" ")) \
             .map(lambda word: (word, 1)) \
             .reduceByKey(lambda a, b: a + b)
for x in counts.collect():
    print x

We have the same preamble to the coding. Then we load the text file into memory.

Once the file is loaded, we split each line into words. Use a lambda function to tick off each occurrence of a word. The code is truly creating a new record for each word occurrence. If a word appears in the stream, a record with the count of 1 is added for that word and for every other instance the word appears, new records with the same count of 1 are added. The idea is that this process could be split over multiple processors, where each...

Learning Jupyter

By : Dan Toomey

Learning Jupyter

By: Dan Toomey

Overview of this book

Related Content you might be interested in

Current Title:

Learning Jupyter

Spark word count