105. Hiveの動的ParPPonの上限
ParPPon数の上限がデフォで1000なので、
SET
hive.exec.dynamic.parPPon=true;
SET
hive.exec.dynamic.parPPon.mode=nonstrict;
SET
hive.exec.max.dynamic.parPPons=1000000;
SET
hive.exec.max.dynamic.parPPons.pernode=1000000;
SET
hive.exec.max.created.files=15000000;
みたいにして増やしておく必要がある
(数は適当。デメリット等は不明。単に増えると遅いのだ
と思う)
105
106. Hive実行時のOpPonメモ
106
SET
hive.exec.dynamic.parPPon=true;
SET
hive.exec.dynamic.parPPon.mode=nonstrict;
SET
hive.exec.max.dynamic.parPPons=1000000;
SET
hive.exec.max.dynamic.parPPons.pernode=1000000;
SET
hive.exec.max.created.files=15000000;
SET
hive.exec.compress.output=true;
SET
io.seqfile.compression.type=BLOCK;
SET
hive.exec.compress.intermediate=true;
SET
hive.intermediate.compression.type=BLOCK;
SET
mapred.output.compress=true;
SET
mapred.output.compression.type=BLOCK;
SET
mapred.output.compression.codec=org.apache.hadoop.io.compress.SnappyCodec;
SET
hive.merge.mapfiles=true;
SET
hive.merge.mapredfiles=true;
SET
hive.merge.size.per.task=256000000;
SET
hive.merge.size.smallfiles.avgsize=16000000;