spark-gremlin | implement implement the Tinkerpop Gremlin API
kandi X-RAY | spark-gremlin Summary
kandi X-RAY | spark-gremlin Summary
Spark-Gremlin is an attempt to implement implement the Tinkerpop Gremlin API (This is a complete re-write of the SparkGraph project (which was an attempt to implement the TinkerPop V2 API.
Support
Quality
Security
License
Reuse
Top functions reviewed by kandi - BETA
Currently covering the most popular Java, JavaScript and Python libraries. See a Sample of spark-gremlin
spark-gremlin Key Features
spark-gremlin Examples and Code Snippets
Community Discussions
Trending Discussions on spark-gremlin
QUESTION
I have wrote a program in order to perform some queries on top of Gremlin (I use Janus Graph with Cassandra and Solr as the engine) with the help of Spark, but the query result is terrible slow.
Most probably I have setup something not correctly.
Here is the code I have used.
Driver program:
...ANSWER
Answered 2020-Oct-26 at 19:31OLAP based Gremlin traversals will be much slower than standard OLTP traversals even for small datasets. There is considerable cost just in getting Spark primed up to process your traversal. That overhead alone might easily give your OLAP query a 1 minute handicap over OLTP. In the comments to your question you explained that your query is taking around six minutes. That does seem a bit on the long side but maybe in the realm of normal for OLAP depending on your environment??
Some graph will optimize for an OLAP count()
and get you a pretty speedy result but you tagged this question with "JanusGraph" so I don't think that applies here.
You typically don't see the value of OLAP based traversals until you start concerning yourself with large scale graphs. Compare counting 100+ million edges in OLAP versus OLTP and you probably won't mind waiting six minutes for an answer at all (as OLTP might not finish at all).
It's hard to say what you might do to make your current setup faster as you are really just proving things work at this point. Now that you have a working model, I would suggest that the next step would be to generate a significantly larger graph (10 million vertices maybe) and try your count again with a decent sized spark cluster.
Community Discussions, Code Snippets contain sources that include Stack Exchange Network
Vulnerabilities
No vulnerabilities reported
Install spark-gremlin
Support
Reuse Trending Solutions
Find, review, and download reusable Libraries, Code Snippets, Cloud APIs from over 650 million Knowledge Items
Find more librariesStay Updated
Subscribe to our newsletter for trending solutions and developer bootcamps
Share this Page