It is the basic abstraction of the
Graphx API. Property is a directed multi-graph where every vertex and edge is associated with a property. Each vertex in the Property Graph is also associated with a unique 64-bit long identifier (
VertexId). A directed multi-graph is defined as a directed graph where there can be multiple edges (
relationships) between the same vertices, such as A can be a friend and team mate of B.
The following is a logical representation of a Property Graph:
Logical representation of Property Graph
Here, we have a Property Graph consisting of five vertices. Each vertex in the graph consists of a
VertexId and a
property, which is a string object in this case, and every edge is also associated with a property, which is a string object as well, which describes the relation between the vertices.
Spark stores vertices and edges in different RDDs as follows:
Storage representation of Property Graph
Every element of the RDD of vertices contains a