Compatiblity with Spark 2.3
polaco...@...
Hi, is it possible to run olap queries using SparkGraphComputer via YARN, with Apache Spark 2.3.2 ? https://docs.janusgraph.org/latest/version-compat.html states that only 2.2.x is supported. Has anyone tried compatibility with new version of Apache Spark? Thanks.
|
|
Evgeniy Ignatiev <yevgeniy...@...>
Hello. As far as I remember, Spark compatibility is dictated solely by
the corresponding spark-gremlin module from Tinkerpop and 3.3.x
release train officially supports only 2.2. Best regards, On 4/24/2019 2:25 PM,
polaco...@... wrote:
|
|
HadoopMarc <marc.d...@...>
JanusGraph/TinkerPop have all necessary Spark dependencies included in their distributions, so Spark compatibility on Spark/Yarn is not an issue as long as you keep the Spark jars of the Yarn cluster from the various CLASSPATHS involved. You can get the general idea by combining: and What storage backend do you use for JanusGraph? SparkGraphComputer is known to be painfully slow on JanusGraph/HBase (although I am not sure whether reading from the HBase Snaphot feature was ever implemented in JanusGraph, have not ever tried that). Cheers, Marc Op woensdag 24 april 2019 12:42:17 UTC+2 schreef Evgeniy Ignatiev:
|
|
Evgeniy Ignatiev <yevgeniy...@...>
By the way, some time ago we used HBase Snapshot InputFormat for JanusGraph - https://github.com/JanusGraph/janusgraph/blob/master/janusgraph-hadoop-parent/janusgraph-hadoop-core/src/main/java/org/janusgraph/hadoop/formats/hbase/HBaseSnapshotBinaryInputFormat.java - it is actually faster, in one of our tests it outpaced Cassandra by an order of magnitude, but after that we faced with some extremely tricky issues with data distribution across cluster (horrible data skew in comparison with the identical code used for Cassandra backend) and were not able to truly track the root cause in our timeframe of HBase use. Best regards, On 4/24/2019 5:11 PM, HadoopMarc wrote:
|
|
Juraj Polačok <polaco...@...>
Hi, Currently, I am getting this error:
TinkerPop 3.4+ should support Apache Spark 2.3, so I think it should work. Have you tried running some queries using Spark 2.3?
On Wednesday, 24 April 2019 15:11:26 UTC+2, HadoopMarc wrote:
|
|