mixed index - Reindex is very slow


vamsi.lingala@...
 

Hi we have imported around 4 billion vertices in janus graph.
We are using big table and elastic search

reindexing speed is very slow..around 2000 records per second
is there any way to speed it up?


hadoopmarc@...
 

janusgraph-dev is for discussion on internal implementation details of JanusGraph itself. Questions about using JanusGraph, installation, configuration, and integrations should be posted on janusgraph-users. First-time posts are moderated and may not be visible immediately.

Could you please repost your question on janusgraph-users, so that I can copy the answer below on the right list?

Did you try to use the mapreduce way of reindexing?
https://docs.janusgraph.org/index-management/index-reindexing/#executing-a-reindex-job-on-mapreduce


hadoopmarc@...
 

The janusgraph user forum was moved to:

https://lists.lfaidata.foundation/g/janusgraph-users

You have to fill an e-mail address and acknowledge a request sent to that address.

Regarding the code lines of the mapreduce index:
// Run a JanusGraph-Hadoop job to reindex
mgmt = graph.openManagement()
mr = new MapReduceIndexManagement(graph)
mr.updateIndex(mgmt.getGraphIndex("mixedExample"), SchemaAction.REINDEX).get()

As far as I know this runs on your local machine and all dependencies are present in the JanusGraph distribution. In other words, no need for an hadoop or spark cluster for this. 

Cheers,     Marc