Date
1 - 3 of 3
GroupCount performance
ever...@...
I have the following query: g.V().groupCount().by(T.label) It does exactly what I want to accomplish. The problem is, label is not indexed and so it takes ages to complete for a larger vertex count. 20k set takes about 5 seconds to process. Larger sets have taken up to 30 minutes without returning. How can I improve on this performance without indexing the label? Note that labels are not currently allowed to be indexed: https://github.com/JanusGraph/ |
|
Daniel Kuppitz <me@...>
An index wouldn't help, it's a full scan anyway. However, you should run that in OLAP; this way you should gain performance through parallelisation. Cheers, Daniel On Wed, Nov 29, 2017 at 9:50 AM, <ever...@...> wrote:
|
|
"Everly O." <ever...@...>
I have no experience with OLAP - sounds like I have some learning to do. I saw this online, hopefully it's a good starting point: http://docs.janusgraph.org/latest/hadoop-tp3.html. If not, any references would be helpful! On Wednesday, November 29, 2017 at 4:35:21 PM UTC-6, Daniel Kuppitz wrote:
|
|