Book Image

Hands-On Graph Analytics with Neo4j

By : Estelle Scifo

Book Image

Hands-On Graph Analytics with Neo4j

By: Estelle Scifo

Overview of this book

Neo4j is a graph database that includes plugins to run complex graph algorithms. The book starts with an introduction to the basics of graph analytics, the Cypher query language, and graph architecture components, and helps you to understand why enterprises have started to adopt graph analytics within their organizations. You’ll find out how to implement Neo4j algorithms and techniques and explore various graph analytics methods to reveal complex relationships in your data. You’ll be able to implement graph analytics catering to different domains such as fraud detection, graph-based search, recommendation systems, social networking, and data management. You’ll also learn how to store data in graph databases and extract valuable insights from it. As you become well-versed with the techniques, you’ll discover graph machine learning in order to address simple to complex challenges using Neo4j. You will also understand how to use graph data in a machine learning model in order to make predictions based on your data. Finally, you’ll get to grips with structuring a web application for production using Neo4j. By the end of this book, you’ll not only be able to harness the power of graphs to handle a broad range of problem areas, but you’ll also have learned how to use Neo4j efficiently to identify complex relationships in your data.

Preface

Who this book is for

What this book covers

To get the most out of this book

Section 1: Graph Modeling with Neo4j

Section 1: Graph Modeling with Neo4j

Free Chapter

Graph Databases

Graph Databases

Graph definition and examples

Moving from SQL to graph databases

Neo4j – the nodes, relationships, and properties model

Understanding graph properties

Considerations for graph modeling in Neo4j

Further reading

The Cypher Query Language

The Cypher Query Language

Technical requirements

Creating nodes and relationships

Updating and deleting nodes and relationships

Pattern matching and data retrieval

Using aggregation functions

Importing data from CSV or JSON

Measuring performance and tuning your query for speed

Further reading

Empowering Your Business with Pure Cypher

Empowering Your Business with Pure Cypher

Technical requirements

Knowledge graphs

Graph-based search

Recommendation engine

Further reading

Section 2: Graph Algorithms

Section 2: Graph Algorithms

The Graph Data Science Library and Path Finding

The Graph Data Science Library and Path Finding

Technical requirements

Introducing the Graph Data Science plugin

Understanding the importance of shortest path algorithms through their applications

Dijkstra's shortest paths algorithm

Finding the shortest path with the A* algorithm and its heuristics

Discovering the other path-related algorithms in the GDS plugin

Optimizing processes using graphs

Further reading

Spatial Data

Technical requirements

Representing spatial attributes

Creating a geometry layer in Neo4j with neo4j-spatial

Performing spatial queries

Finding the shortest path based on distance

Visualizing spatial data with Neo4j

Further reading

Node Importance

Node Importance

Technical requirements

Defining importance

Computing degree centrality

Understanding the PageRank algorithm

Path-based centrality metrics

Applying centrality to fraud detection

Further reading

Community Detection and Similarity Measures

Community Detection and Similarity Measures

Technical requirements

Introducing community detection and its applications

Detecting graph components and visualizing communities

Running the Label Propagation algorithm

Understanding the Louvain algorithm

Going beyond Louvain for overlapping community detection

Measuring the similarity between nodes

Further reading

Section 3: Machine Learning on Graphs

Section 3: Machine Learning on Graphs

Using Graph-based Features in Machine Learning

Using Graph-based Features in Machine Learning

Technical requirements

Building a data science project

The steps toward graph machine learning

Using graph-based features with pandas and scikit-learn

Automating graph-based feature creation with the Neo4j Python driver

Further reading

Predicting Relationships

Predicting Relationships

Technical requirements

Why use link prediction?

Creating link prediction metrics with Neo4j

Building a link prediction model using an ROC curve

Further reading

Graph Embedding - from Graphs to Matrices

Graph Embedding - from Graphs to Matrices

Technical requirements

Why do we need embedding?

Adjacency-based embedding

Extracting embeddings from artificial neural networks

Graph neural networks

Going further with graph algorithms

Further reading

Section 4: Neo4j for Production

Section 4: Neo4j for Production

Using Neo4j in Your Web Application

Using Neo4j in Your Web Application

Technical requirements

Creating a full-stack web application using Python and Graph Object Mappers

Understanding GraphQL APIs by example – GitHub API v4

Developing a React application using GRANDstack

Further reading

Neo4j at Scale

Technical requirements

Measuring GDS performance

Configuring Neo4j 4.0 for big data

Other Books You May Enjoy

Other Books You May Enjoy

Leave a review - let other readers know what you think

Customer Reviews

5 star

0

4 star

0

3 star

0

2 star

0

1 star

0

Measuring the similarity between nodes

There are several techniques used to quantify the similarity between nodes. They can be divided into two categories:

Set-based measures: Compare the content of two sets globally. For instance, sets (A, B, C) and (C, D, B) have two common elements.
Vector-based measures: Compare vectors element-wise, meaning that the position of each element is important. Euclidean distance is an example of such measures.

Let's go into more detail about these metrics, starting from the set-based similarities.

Set-based similarities

The GDS 1.0 implements two variants of set-based similarities we'll cover here.

Overlapping

The overlapping similarity is a measure of the number of common elements between two sets, relative to the size of the smallest set.

Definition

This measure's mathematical definition is as follows:

O(A, B) = | A ∩ B | / min(|A|, |B|)

A ∩ B is the intersection between sets A and B (common elements) and |A| denotes the...