|
Re: Bulk loading using Json, python or Scala?
I see.. we need to use GraphML file. The issue here is our graph might contain billions of nodes, will Python client be able to handle that?
Thanks!
I see.. we need to use GraphML file. The issue here is our graph might contain billions of nodes, will Python client be able to handle that?
Thanks!
|
By
Yihang Yan <yanyi...@...>
·
#322
·
|
|
Time series modelling help needed...
I need a help in time series modelling, with Cassandra as the backend storage...
Consider a model as below
Has many Which Report
Domain
I need a help in time series modelling, with Cassandra as the backend storage...
Consider a model as below
Has many Which Report
Domain
|
By
Ravikumar Govindarajan <ravikumar....@...>
·
#321
·
|
|
Re: Bulk loading using Json, python or Scala?
Thanks, Dave! I am wondering what's the recommended format of the data, xml, txt or csv ?
Thanks, Dave! I am wondering what's the recommended format of the data, xml, txt or csv ?
|
By
Yihang Yan <yanyi...@...>
·
#320
·
|
|
Re: HBase unbalanced table regions after bulkload
Hi Ali,
OK, I overlooked your config line "storage.hbase.region-count=1024". This is far too large a number, since HBase likes regions with a size of the order of 10GB, rather than the 130MB you
Hi Ali,
OK, I overlooked your config line "storage.hbase.region-count=1024". This is far too large a number, since HBase likes regions with a size of the order of 10GB, rather than the 130MB you
|
By
marc.d...@...
·
#319
·
|
|
Re: HBase unbalanced table regions after bulkload
Hi Marc,
As far as I know, even if yarn schedules executors unevenly, it does not mean that the data written across hbase will be uneven.
The data is written hbase according to the key of the datum
Hi Marc,
As far as I know, even if yarn schedules executors unevenly, it does not mean that the data written across hbase will be uneven.
The data is written hbase according to the key of the datum
|
By
aoz...@...
·
#318
·
|
|
Re: Bulk loading using Json, python or Scala?
You should be able to bulk load with any client that allows you to submit raw gremlin scripts to the server. For example, you can do it with ipython-gremlin, check out cell # 2:
You should be able to bulk load with any client that allows you to submit raw gremlin scripts to the server. For example, you can do it with ipython-gremlin, check out cell # 2:
|
By
David Brown <dave...@...>
·
#317
·
|
|
Re: HBase unbalanced table regions after bulkload
Hi Ali,
I have never tried to optimize this myself, but maybe you should also look into the docs at
12.3.30. storage.hbase
...
storage.hbase.region-count
The number of initial regions set when
Hi Ali,
I have never tried to optimize this myself, but maybe you should also look into the docs at
12.3.30. storage.hbase
...
storage.hbase.region-count
The number of initial regions set when
|
By
HadoopMarc <m.c.d...@...>
·
#316
·
|
|
Bulk loading using Json, python or Scala?
Other than Groovy, is it possible to do bulk loading using Json, python or Scala? Any sample code could be provided?
Thanks!
Other than Groovy, is it possible to do bulk loading using Json, python or Scala? Any sample code could be provided?
Thanks!
|
By
Yihang Yan <yanyi...@...>
·
#315
·
|
|
HBase unbalanced table regions after bulkload
We are using cloudera 5.7.0 with java 1.8.0_74 and we have spark 1.6.0, janusgraph 0.1.1, hbase 1.2.0.
I managed to bulkload 130GB of data into 1024 region hbase table in 2 hours 30 minute with 1024
We are using cloudera 5.7.0 with java 1.8.0_74 and we have spark 1.6.0, janusgraph 0.1.1, hbase 1.2.0.
I managed to bulkload 130GB of data into 1024 region hbase table in 2 hours 30 minute with 1024
|
By
aoz...@...
·
#314
·
|
|
Re: how to load a CSV file into janusgraph
Hi Elizabeth,
For JanusGraph you should also take into account the TinkerPop documentation. A relevant pointer for you
Hi Elizabeth,
For JanusGraph you should also take into account the TinkerPop documentation. A relevant pointer for you
|
By
HadoopMarc <m.c.d...@...>
·
#313
·
|
|
Re: Finding supernodes with insufficient frame size
That won't work if the framesize is not large enough.
That won't work if the framesize is not large enough.
|
By
Adam Holley <holl...@...>
·
#311
·
|
|
Re: Finding supernodes with insufficient frame size
This should give you the counts, highest first, by vertex id:
g.V().group().by(id()).by(outE().count()).order(local).by(values,decr)
Robert Dale
This should give you the counts, highest first, by vertex id:
g.V().group().by(id()).by(outE().count()).order(local).by(values,decr)
Robert Dale
|
By
Robert Dale <rob...@...>
·
#312
·
|
|
Finding supernodes with insufficient frame size
Using cassandra as the backend, I was trying to count edges {g.E().count} and of course ran into the framesize problem because I had a supernode. I found that I could identify which node was the
Using cassandra as the backend, I was trying to count edges {g.E().count} and of course ran into the framesize problem because I had a supernode. I found that I could identify which node was the
|
By
Adam Holley <holl...@...>
·
#310
·
|
|
how to load a CSV file into janusgraph
Hi all,
I am new to Janusgraph, I have dived into docs of Janusgraph for almost two weeks, nothing found.
I could only gather the scatted information and most of the time it will prompt some
Hi all,
I am new to Janusgraph, I have dived into docs of Janusgraph for almost two weeks, nothing found.
I could only gather the scatted information and most of the time it will prompt some
|
By
Elizabeth <hlf...@...>
·
#309
·
|
|
Re: Index not being used with 'Between" clause
Outstanding - thank you Jason.
-gene
Outstanding - thank you Jason.
-gene
|
By
Gene Fojtik <genef...@...>
·
#308
·
|
|
call queue is full on /0.0.0.0.:60020, too many items queued? hbase
Here is my problem:
We are using cloudera 5.7.0 with java 1.8.0_74 and we have spark 1.6.0, janusgraph 0.1.1, hbase 1.2.0.
I try to load 200Gb of graph data and for that I run the following code in
Here is my problem:
We are using cloudera 5.7.0 with java 1.8.0_74 and we have spark 1.6.0, janusgraph 0.1.1, hbase 1.2.0.
I try to load 200Gb of graph data and for that I run the following code in
|
By
aoz...@...
·
#307
·
|
|
Re: Production users of JanusGraph
Hi Misha ,
Thanks a lot for your response and useful information much appreciated.
Thanks ,
Anurag
Hi Misha ,
Thanks a lot for your response and useful information much appreciated.
Thanks ,
Anurag
|
By
anurag <anurag...@...>
·
#306
·
|
|
Re: Another perspective on JanusGraph embedded versus server mode
Hi Jamie,
Good question, and I dig the ASCII art. To answert your question, they will describe the same graph as if you were running the Janus instances in their own JVMs. I've used both approaches.
Hi Jamie,
Good question, and I dig the ASCII art. To answert your question, they will describe the same graph as if you were running the Janus instances in their own JVMs. I've used both approaches.
|
By
Ted Wilmes <twi...@...>
·
#305
·
|
|
Another perspective on JanusGraph embedded versus server mode
We have a _domain specific_ REST API that is architecturally decoupled from JanusGraph. In other words, users of the REST API have no indication that their calls interact with JanusGraph, or even with
We have a _domain specific_ REST API that is architecturally decoupled from JanusGraph. In other words, users of the REST API have no indication that their calls interact with JanusGraph, or even with
|
By
Jamie Lawson <jamier...@...>
·
#304
·
|
|
Re: Index not being used with 'Between" clause
Make sure you're using a mixed index for numeric range queries. Composite indexes are best for exact matching. The console session below shows the difference:
gremlin> graph =
Make sure you're using a mixed index for numeric range queries. Composite indexes are best for exact matching. The console session below shows the difference:
gremlin> graph =
|
By
Jason Plurad <plu...@...>
·
#303
·
|