janusgraph and deeplearning


jonathan.mercier.fr@...
 

Dear,
I am looking to use both janusgraph with a deeplearning frameworks such as pytorch.
Does  anyone have some experience/example on this subject ?
Actually I use parquet -> dataframe -> pytorch

Thanks for your help


hadoopmarc@...
 

Hi Jonathan,

Can you elaborate on why you make the connection between janusgraph and deep learning? I can only imagine the wish to apply graph data stored in Janusgraph to train a GNN. I do not think however that you can leverage the message passing of TinkerPop VertexPrograms, because it is java based and cannot apply GPU's.

Best wishes,    Marc


jonathan.mercier.fr@...
 

Hi marc,
Thanks for your reply.
I have some knowledge data from multiple source, so firstly (i) I had to loaId those data to janusgraph, (ii) I need to apply a reconciliation algorithm which generate the knowledge graph. So I would like to train on this newly model with a graph neural network with pytorch or if not possible with deeplearning4j (I prefer python)

Thanks


hadoopmarc@...
 

Hi Jonathan,

One thing is not yet clear to me: does your graph fit into a single node (regarding memory and GPU) or do you plan to use distributed pytorch? Either way, I guess it would be most efficient to use a two step process:

  1. get all data from janusgraph and store it on disk in a suitable format
  2. run pytorch geometric (may be in a distributed way) from the files on disk
JanusGraph only supports the hadoop InputFormats to retrieve graph data in a distributed way. Some teams succeeded in retrieving data from partitions from the janusgraph storage backends (not using any janusgraph API, see here), which could be done in a custom pytorch loader, but this is not documented (yet).

Cool that you apply janusgraph to this use case, so do not hesitate to ask for more details!

Marc