Re: HBase unbalanced table regions after bulkload
aoz...@...
Hi Marc, As far as I know, even if yarn schedules executors unevenly, it does not mean that the data written across hbase will be uneven. The data is written hbase according to the key of the datum and the key-ranges of the regions, it does nothing to do with the node that the writer jvm is working on. My executors are working on %90 of my nodes (it is not that uneven) however percentage of my empty regions is %90(900 of 1024 regions). If you were right the latter percentage would be %10 instead of %90. If there is some other mechanism while assigning ids in distributed fashion, may you please keep me updated and elaborate on the mechanism. Best, Ali 15 Haziran 2017 Perşembe 22:51:14 UTC+3 tarihinde HadoopMarc yazdı:
|
||||||||||
|