Re: Compatiblity with Spark 2.3
Evgeniy Ignatiev <yevgeniy...@...>
By the way, some time ago we used HBase Snapshot InputFormat for JanusGraph - https://github.com/JanusGraph/janusgraph/blob/master/janusgraph-hadoop-parent/janusgraph-hadoop-core/src/main/java/org/janusgraph/hadoop/formats/hbase/HBaseSnapshotBinaryInputFormat.java - it is actually faster, in one of our tests it outpaced Cassandra by an order of magnitude, but after that we faced with some extremely tricky issues with data distribution across cluster (horrible data skew in comparison with the identical code used for Cassandra backend) and were not able to truly track the root cause in our timeframe of HBase use. Best regards, On 4/24/2019 5:11 PM, HadoopMarc wrote:
|
|