Janusgraph - OLAP using Dataproc
bobo...@...
Hi, We are using Janusgraph (0.5.2) with Scylladb as backend. So far we are only using OLTP capabilities but would now like to also do some more advanced batch processing to create shortcut edges, for example for recommendations. To do that, I would like to use the OLAP features. Reading the documentation this sounds pretty straightforward, assuming one has a Hadoop cluster up and running. But here comes my problem: I would like to use Dataproc - Google's managed solution for Hadoop and Spark. Unfortunately I couldn't find any further information on how to get those two things playing well together. Does anyone have any experience, hints or documentation on how to properly configure Janusgraph with Dataproc? In a very first step, a was trying the following (Java application with embedded Janusgraph)
the graph.properties file looking like this
If I just run the code like this, without specifying anything else, it just results in nothing happening, and endless log output like these
Additionally, I added the hdfs-site extracted from dataproc to my classpath, but that didn't help any. The same in the OLTP world works like a charm. (of course using a proper query, one not iterating over the whole graph .... :D ) Any hints, ideas, experiences or links are greatly appreciated. Looking forward to some answers, Claire
|
|