Questions about writing a storage backend for Solr for nodes/edges in addition to indexes
Russell Jurney <russel...@...>
Hello fellow JanusGraphers,
I'm interested in writing a storage backend for Solr to store nodes/edges in addition to indexes and would like some pointers on doing so. We would open source the implementation as a part of JanusGraph.
I work at an analytics startup and we're using JanusGraph to analyze the open source ecosystem. We have a Solr cluster to search over the same data that will appear in our JanusGraph database. It occurred to me that it would be nice to have one less system and so rather than store our graph data in duplicate some place like Cassandra or BigTable, why not access it on a Solr cluster? Solr is a capable database. It would be cool to use the same index but even if it were a different index served from different machines in the cluster, we would have significantly less operational overhead than if we ran something else and had to develop expertise in both systems.
So, my questions are:
* How much work would be involved? The existing storage backends vary a lot in how much code they are made up of, so I have no idea of an estimate.
Thanks in advance!
Russell Jurney, Founding Engineer @ Archipelo.co (we do foss graphs)