sparkgraph | Spark-Gremlin is an implementation of the Blueprints Graph
kandi X-RAY | sparkgraph Summary
kandi X-RAY | sparkgraph Summary
Spark-Gremlin is an implementation of the Blueprints Graph API (and the Gremlin Graph Traversal Language (implmented using Spark (
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of sparkgraph
sparkgraph Key Features
sparkgraph Examples and Code Snippets
Community Discussions
Trending Discussions on sparkgraph
QUESTION
Hi I have created Graph using vertex and edge files. Size of graph is 600GB. I am querying this graph using motif feature of Spark GraphFrames. I have setup an AWS EMR cluster for querying graph.
cluster details:- 1 master and 8 slaves
Master Node:
...ANSWER
Answered 2020-Jun-21 at 20:51The problem with the implementation of Graphframes is that it makes self joins of the internal dataframes as many times as you use on the motifs. That means that you will have more a more shuffle as the length of the chain increases
You can see more details at https://www.waitingforcode.com/apache-spark-graphframes/motifs-finding-graphframes/read
I have also tried a similar approach and have seen that when the length of the chain is greater than 12 Spark starts being not responsive and connections are lost with executors, even if I increased resources.
If you are trying to do that, I would recommend using a graph database instead.
Hope this helps
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install sparkgraph
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page