Re: Nodes with lots of edges
Matthew Nguyen <nguyenm9@...>
toggle quoted message Show quoted text
Saw the same thing awhile back. Boxuan put in a Jira for it: https://github.com/JanusGraph/janusgraph/issues/2966
From: Joe Obernberger <joseph.obernberger@...>
Sent: Mon, Jul 11, 2022 7:32 am
Subject: Re: [janusgraph-users] Nodes with lots of edges
Hi Marc - yes, it takes minutes to do queries on nodes with lots of edges. Like:
I believe this is because of the large partition size. I would love to use vertex cutting; but there seems to be a problem with it:
Every-time, I built a small graph and exported to graphML for viewing in Gephi I would have node IDs that only existed in the edges list.
I printed the nodeID in my code everywhere it was used, and I would never see it in the output, but the graphML had it in the edges list and those 'zombie nodes' did exist in the graph as confirmed by gremlin queries. This was happening because I was using:
VertexLabel sourceLabel = mgmt.makeVertexLabel("source").partition().make(); Once I removed partition, the "zombie" node IDs disappeared. I wanted to use partition for that since those particular vertexes can have a lot of edges; potentially billions.
Is there a bug with vertex cutting? Thank you!
On 7/8/2022 1:44 AM, hadoopmarc@... wrote:
You do not describe whether breaking this rule of thumb causes real performance issues in your case. Anyway, JanusGraph allows you to partition the stored edges of a node, see: