ClassNotFoundException running Gremlin on Spark
I'm attempting to transition from Titan to JanusGraph 0.1.0 and am having problems getting OLAP queries to work via Spark. I've loaded a graph with about 2 million vertices and tried to execute a simple count:
The job soon fails with "java.lang.ClassNotFoundException: org.apache.tinkerpop.gremlin.spark.structure.io.gryo.GryoSerializer", which I know is in spark-gremlin-3.2.3.jar. This appears to happen before the Spark executor has a chance to start. I tried adding this jar to spark.executor.extraClassPath, but it didn't help. Does HADOOP_GREMLIN_LIBS come into play? I've tried fiddling with it but to no avail.
I'm using HBase 22.214.171.124.5.3.0-37 and Spark 1.6 on HDP 126.96.36.199.
OLTP Gremlin queries work ok.
Here's my properties file:
This was working fine using Titan.
Jason Plurad <plu...@...>
toggle quoted messageShow quoted text
A similar message came up on the gremlin-users mailing list. You might want to compare notes with that thread.
On Wednesday, May 17, 2017 at 1:12:16 AM UTC-4, Jerrell Schivers wrote: