Network Science with Python

By : David Knickerbocker

Network Science with Python

By: David Knickerbocker

Overview of this book

Network analysis is often taught with tiny or toy data sets, leaving you with a limited scope of learning and practical usage. Network Science with Python helps you extract relevant data, draw conclusions and build networks using industry-standard – practical data sets. You’ll begin by learning the basics of natural language processing, network science, and social network analysis, then move on to programmatically building and analyzing networks. You’ll get a hands-on understanding of the data source, data extraction, interaction with it, and drawing insights from it. This is a hands-on book with theory grounding, specific technical, and mathematical details for future reference. As you progress, you’ll learn to construct and clean networks, conduct network analysis, egocentric network analysis, community detection, and use network data with machine learning. You’ll also explore network analysis concepts, from basics to an advanced level. By the end of the book, you’ll be able to identify network data and use it to extract unconventional insights to comprehend the complex world around you.

Preface

Who this book is for

What this book covers

Download the example code files

Conventions used

Get in touch

Share Your Thoughts

Download a free PDF copy of this book

Part 1: Getting Started with Natural Language Processing and Networks

Free Chapter

Chapter 1: Introducing Natural Language Processing

Technical requirements

What is NLP?

Why NLP in a network analysis book?

A very brief history of NLP

How has NLP helped me?

Common uses for NLP

Advanced uses of NLP

How can a beginner get started with NLP?

Summary

Chapter 2: Network Analysis

The confusion behind networks

What is this network stuff?

Resources for learning about network analysis

Common network use cases

Advanced network use cases

Getting started with networks

Summary

Further reading

Chapter 3: Useful Python Libraries

Technical requirements

Using notebooks

Data analysis and processing

Data visualization

NLP

Network analysis and visualization

NetworkX

Summary

Part 2: Graph Construction and Cleanup

Chapter 4: NLP and Network Synergy

Technical requirements

Why are we learning about NLP in a network book?

Asking questions to tell a story

Introducing web scraping

Choosing between libraries, APIs, and source data

Using NLTK for PoS tagging

Using spaCy for PoS tagging and NER

Converting entity lists into network data

Converting network data into networks

Doing a network visualization spot check

Additional NLP and network considerations

Summary

Chapter 5: Even Easier Scraping!

Technical requirements

Why cover Requests and BeautifulSoup?

Getting started with Newspaper3k

Introducing the Twitter Python Library

Summary

Chapter 6: Graph Construction and Cleaning

Technical requirements

Creating a graph from an edge list

Listing nodes

Removing nodes

Quick visual inspection

Adding nodes

Renaming nodes

Removing edges

Persisting the network

Simulating an attack

Summary

Part 3: Network Science and Social Network Analysis

Chapter 7: Whole Network Analysis

Technical requirements

Creating baseline WNA questions

WNA in action

Comparing centralities

Visualizing subgraphs

Investigating islands and continents – connected components

Understanding layers with k_core and k_corona

Challenge yourself!

Summary

Chapter 8: Egocentric Network Analysis

Technical requirements

Egocentric network analysis

Investigating ego nodes and connections

Identifying other research opportunities

Summary

Chapter 9: Community Detection

Technical requirements

Introducing community detection

Getting started with community detection

Exploring connected components

Using the Louvain method

Using label propagation

Using the Girvan-Newman algorithm

Other approaches to community detection

Summary

Chapter 10: Supervised Machine Learning on Network Data

Technical requirements

Introducing ML

Beginning with ML

Data preparation and feature engineering

Selecting a model

Preparing the data

Training and validating the model

Model insights

Other use cases

Summary

Chapter 11: Unsupervised Machine Learning on Network Data

Technical requirements

What is unsupervised ML?

Introducing Karate Club

Network science options

Uses of unsupervised ML on network data

Constructing a graph

Community detection in action

Graph embeddings in action

Using embeddings in supervised ML

Summary

Index

Why subscribe?

Other Books You May Enjoy

Packt is searching for authors like you

Share Your Thoughts

Download a free PDF copy of this book

Customer Reviews

5 star

4 star

3 star

2 star

1 star

Introducing ML

ML is a set of techniques that enable computers to learn from patterns and behavior in data. It is often said that there are three different kinds of ML: Supervised, Unsupervised, and Reinforcement learning.

In supervised ML, an answer – called a label – is provided with the data to allow for an ML model to learn the patterns that will allow it to predict the correct answer. To put it simply, you give the model data and an answer, and it figures out how to predict correctly.

In unsupervised ML, no answer is provided to the model. The goal is usually to find clusters of similar pieces of data. For instance, you could use clustering to identify the different types of news articles present in a dataset of news articles, or to find topics that exist in a corpus of text. This is similar to what we have done with community detection.

In reinforcement learning, a model is given a goal and it gradually learns how to get to this goal. In many reinforcement...

Network Science with Python

By : David Knickerbocker

Network Science with Python

By: David Knickerbocker

Overview of this book

Related Content you might be interested in

Current Title:

Network Science with Python

Network Science with Python and NetworkX Quick Start Guide

Graph Machine Learning

Graph Data Science with Neo4j

Introducing ML