Re: Nodes with lots of edges


Joe Obernberger
 

Hi Marc - yes, it takes minutes to do queries on nodes with lots of edges.  Like:

:> g.V().has("somevar","someVal").outE().has("indexedField","value")

I believe this is because of the large partition size.  I would love to use vertex cutting; but there seems to be a problem with it:

-----

Every-time, I built a small graph and exported to graphML for viewing in Gephi I would have node IDs that only existed in the edges list.
I printed the nodeID in my code everywhere it was used, and I would never see it in the output, but the graphML had it in the edges list and those 'zombie nodes' did exist in the graph as confirmed by gremlin queries. This was happening because I was using:
VertexLabel sourceLabel = mgmt.makeVertexLabel("source").partition().make(); Once I removed partition, the "zombie" node IDs disappeared.  I wanted to use partition for that since those particular vertexes can have a lot of edges; potentially billions. 

-----

Is there a bug with vertex cutting?  Thank you!

-Joe

On 7/8/2022 1:44 AM, hadoopmarc@... wrote:
Hi Joe,

You do not describe whether breaking this rule of thumb causes real performance issues in your case. Anyway, JanusGraph allows you to partition the stored edges of a node, see:
https://docs.janusgraph.org/advanced-topics/partitioning/#vertex-cut

Marc



AVG logo

This email has been checked for viruses by AVG antivirus software.
www.avg.com


Join janusgraph-users@lists.lfaidata.foundation to automatically receive all group messages.