Hadoop Conference Japan 2013ã§è©±ãããã¨ã¨æã£ããã¨
Hadoop Conference Japan 2013
http://hcj2013w.eventbrite.com/
å
é±çµäºãããªãã®çæ³ã§çµãã£ãæãã§ããã¾ãã¯éå¬ããµãã¼ããã¦é ããç¸å½ã®è² æ
ã¾ã§é ãããªã¯ã«ã¼ãã»ãã¯ããã¸ã¼æ§ã«æè¬ç³ãä¸ãã¾ããã©ãããããã¨ããããã¾ããã
ãã¦ããã£ã¨ãååããããããã¤ã ã£ãã®ããè¯ãè¦ãã¦ãªãããã§ã2011 Fallã ã£ããããªã
http://hadoop-conference-japan-2011-fall.eventbrite.com/
2011å¹´ã®9æãªã®ã§ã1å¹´4ã¶æã¶ãã¨ããæãã§ãããTrackæ°ãå¢ãã¦2ãã3ã§ãä¼å ´ããã«ãµã¼ã«ããããã°ãµã¤ãã«ãªã£ã¦ãã¾ããã人æ°ã1000人è¶
ã«ãªã£ã¦ããã¾ãã
以ä¸ãææ³æã§ããè¨é²ã¨ãã¦ããã¦ããæãã§ã
ã»å 容ã§å°è±¡ã«æ®ã£ããã®
ã»HBase~LINEã®ããã¯ãã¼ã³ã§ä½¿ã£ã¦ããã¨ããå
容ãããããã¦ã¼ã¶ã¼æ°ã1åçªç ´ãã¦ããã¨ããå ±åã§ãããªã大å¤ãããªå°è±¡ã§ããã
ãDCã®ãªã³ã©ã¤ã³ç§»è¡ã»ã¼ãã¼ã«é¡ã¯èªåã§æºåããæãã ã¨ããNNé害ã¯ã¾ããã®èªårsyncã ã¨ãããªãã¨ãããæ°åãã§ä¹ãåãæãã¨ããããµã¼ãã¹ã®æé·ã«å¨ãã追ãã¤ãããã®ã«ç²¾ä¸æ¯æãããã¾ãã¦ããã®ããããé常ã«å°è±¡çã§ããããããHBaseãç«å±±ãçãªè©±ã¯åã§èãã¦ãã¾ããããå°è±¡ã¨ãã¦ã¯ããã¾ããå«ãã¦å
¨é¨ç«å±±ãã¦ãªæããã¨ãããããHBaseã¯ãªãã¨ãé å¼µã£ã¦ã¾ã£ãç¶æ
ã«è¦ãã¾ããããããæªããããããªæå³ã§åããªãã¨ããã ãã¾ã¼ãã®ã¾ã¾ããããã¤ãé
·ãäºæ
èµ·ããããã§æãã§ãããã»ã»ã»
ã»TDã®å±æãã¬ã¼ã³ãããããæ¥æ¬ã®ITãã³ãã£ã¼ã®æ´å²ã«åãæ®ãããTreasure Dataã®å¤ªç°ããã®ãã¬ã¼ã³ãã¨ãããããããã§ã¨ãï¼ã¨ããæãã話ã®å 容ã¯ãTDã§ã®ãä½ãã©ããã¦ãããã¨ãããã¸ãã¹ã©ã¤ã³ã®è©±ã§é常ã«åèã«ãªãã¾ããã話ã®é åºã¨ãã¦ã¯ãããªãã¯ã©ã¦ããï¼ãã¨ãã話ããå§ã¾ã£ã¦ããã¸ãã¹çã«ã¯æ¡å¤§è·¯ç·ä¸ã§æ次ã§+40%ã®ãã¼ã¿æé·ã®è©±ã¸ã¨ç¶ãæãããããã«åãã£ãã§ããã
ãå人çã«ãã£ã¨ãå°è±¡æ·±ãã£ãã®ã¯ããã©ã¼ã«ã¹ãã¤ã³ãã¯åæã§ã¯ãªããã¬ãã¼ããã¨ããç¹ãããã¯é常ã«è³æã§ããå¤åããã®ãã©ã¼ã«ã¹ã¯æ£è§£ã§ããã¸ãã¹ã¢ãã«ã¨ãã¦ã¯ã·ã³ãã«ãªåã ããé常ã«åªãã¦ãããã¨ã«å ããä¸åã·ã§ã¢ãã¨ã£ã¦ãã¾ãã°ã競ååªä½ã常ã«åããã¨ããä½ç½®ã§ããããã®ãããã¯ããã¾ã注ç®ããã¦ããªãããã§ããããããã¯ãã¼ã¿ãã§è¸ã£ã¦ããã¡ã¼ã«ã¼ç³»SIå±ããã¹ã³ãã»è©è«å®¶ã®äººãã¡ã¯ãæ£åº§ãã¦èãã¹ãã§ããããå¥ã«TDã«åæãã§ããªã人æãããªãããã§ããªãã§ããããããã太ç°ããã®å身ãPFIã ã£ããã¨ãèããã°ãããªãæå³æ·±é·ã§ããè¦ã¯ãITã®ä½¿ãæ¹ã¨ãã¦æå³ããããã¨ãããã¨ã¨ãã¾ãã¯ãã¸ãã¹ã«ãªããã¨ãããã¨ã¯å¿
ãããã¤ãªãããªãã¨ãããã¨ã§ãããã
ãAWSã®RedShiftã¨æ¯ã¹ã¦ã©ã¼ãªãã ãã¨ããæè¦ã¯ããã¨åæã§ããã¨ã¯æãã¾ãããTDã®ãã®ã¢ãã«ãã¿ãéããå½åå®æ³°ã ãªãã¨ããå°è±¡ã§ããããã¨ã¯ããé²ãã ãã§ãä¸çªé£ããå±é¢ã¯ä¹ãåã£ãããã«è¦ãã¾ãã競åãåºãã¨ãã¦ãã追ãã¤ãã®ã¯é£ããã§ãããã
ãã¨ã¯ãããä¸çªå°è±¡ã«æ®ã£ãã®ã¯ã太ç°ããã»å¤æ©ãããä¸å¹´åã¨ã¾ã£ããå¤ãã£ã¦ããªãã£ãã¨ããã¨ãããã¨ãããã ãæåè·¯ç·ã«ä¹ãã¨ãã¡ãã£ã¨æµ®ã¤ããã¨ãããã§ãã®ã人éã§ããããã¼ãã¼ã¨ããããªããç«æ´¾ã§ãããTDã®æ¬æ¥ã®å¼·ã¿ã¯å½¼ãã®äººéæ§ã§ã¯ãªãããªã¨æã£ãããã¾ãã
ã»ã»ã»
ãã¨ã¯ã«ã³ãã¡ã¬ã³ã¹ã§ã¯ããã£ã¨ããããèãããã£ãã®ã§ããããªããªãæéãåãã¾ããã§ãããéä¸ã§ã客ããã¨Mtgãå
¥ã£ããã§ãä¸åº¦ããã°ãµã¤ããé¢ãã¦ãèªåã®è©±ãæéã«åããã¦æ»ã£ãæãã§ããï¼ã¾ããHadoopã使ããã¨ããã客ããã¨ã®Mtgã§ãããHCJã«ã¯ã客ããã¯åå ãã¦ããªãããã§ã»ã»ã»è¦ããã«ããããã¨ã³ã¿ã¼ãã©ã¤ãºã®æµãã¨ã³ãã¥ããã£ã®æµãã¯é¢ãã¤ã¤ãããªããã¨ãæãã¾ãããï¼
ã»ããã¹ã£ãå 容
ã以ä¸ãèªåã®ããã¹ã£ãå 容ã§ããã»ãã·ã§ã³çã«ã¯è¡¨ãTDã®å¤æ©ããã®è©±ã ã£ãã®ã§ãå®å ¨ã«è£çªçµç¶æ ã§ããã¹ã³ãç³»ã®äººã¯å ¨é¨åããã«ãã£ãã®ã§ããé°ã§ãã¡ãã¯è¨ãããäºãè¨ããæãã§ãããæè¿ã¯ãã¼ããã¼ããã¨ã®æåããã¸ã·ã§ã³ãã¼ã¯ãå¤ãã®ã§ãä»åã¯ãå®æ ã赤裸裸ã«ãã¨ããæãã§è©±ãã¾ããã端ããè¦ãã¨æ¯ã¯åãã¾ããã«è¦ããã§ãããããæ¯ã®ãªãç¾å®ã¯ããããªãããé¢ç½ãããªãã¨ããªãã®ã§ããã®è¾ºã¯æµ èã»å¯å¸çãªæãã§è©±ãã¾ãããå²ã¨è£è©±ãããã®ã§ãå½ç¶ãã¬ã¼ã³è³æãUstã¯ç¾æç¹ã§ã¯éå ¬éã«ããã¦ããã£ã¦ãã¾ããã¨ã¯ããããã¼ãã¼ãã¨ã¯ä»åãæå¾ã§ãå¤åãä»å¾ã¯å ¬éã®å ´æã§ããããdisãã¨ãããã¨ã¯ãªãã§ããããHadoopé¢é£ã¯ãããããããã§ã¼ãºã¯éãã¾ããããèªåãç«å ´çã«ãããã話ããªãç¶æ ã«ãªã£ã¦ãã¾ãã
ãã¾ãã¯Asakusaã®è©±ãä¸å¿ã§ãä»ã®èª²é¡ã¨ãã®è§£æ±ºçãã©ããã¾ããï¼ã¨ãã話ãä¸å¿ã«è©±ãã¾ãããæ¥åç³»ã®å¦çãHadoopã§ãã£ã¦ã¿ãã¨ãã試ã¿ã¯ãããããããã¨ã«å¸å ´ã§ã¯ã¢ã¯ã»ããããã¤ã¤ããã¨æãã¾ããæ¥åç³»ã®å©ç¨ã®éãæãã¦ããã®ã¯ãHadoopã³ãã¥ããã£ã«ã¨ã£ã¦ãããäºã§ã¯ãªããã¨æãã¾ãã
ã欧米ã®ç¾ç¶ã®ãããªããã°ãã¼ã¿ä¸æ¬æ§ã ã¨ãããã°ãã¼ã¿ã®å®ä½ã®ãªãæ¥æ¬ã§ã¯ããã«ãæ½°ããã¨ãã«ãåæ£ã»ä¸¦åã®å©ç¨ãã®ãã®ãæ½°ããããªãã§ãããã®æå³ã§ã¯ãã³ãã¥ããã£çã«ã¯ãæå¹ãªæè¡ããããªãã«æ®ãã¨ãããã¨ã«æå³ãããã¨ã¯æã£ã¦ãã¾ããä»ã®ãã¼ã±ãã£ã³ã°ã®æµãã¯ãå¾åã®CRMã®ããã«ãã£ãããªã®ã§ãå·éã«èããã°ãã©ã£ãã§æ½°ããããªãã¨ã¯æãã¦ãã¾ãããã®æã«ã確å®ã«æ®ãæµãã«ã¯ãããã§ãããï¼ã¨ã¯ãããèªåãã¯Hadoopã使ããã¨ãç®çã§ã¯ãªããæ¥åã¡ãªãããåããæè¡ã®å°å
¥ã»å±éãç®çã§ã¯ããã¾ãããï¼
話ãã骨åã¯å²ã¨ç°¡åã§ã大ä½ä»¥ä¸ã®è©±ã«ãªã£ãã¨æã£ã¦ãã¾ãã
1. æ¥åãããå¦çã§ã®Hadoop
ãç¾å¨ã®ã¨ãããåºæ¬çã«å¤é度ã§åè² ãã大è¦æ¨¡ãããã¯ã»ã¼Hadoopã«è»é
ãããã£ã¦ãã¾ããä»ã¾ã§ã¯ãåççã«ã§ããã¨ããããã°ã§ããã¨ããç¶æ
ã ã£ãã®ã§ãããã»ã¼ä¸å¹´è¶
ã®éç¨ããã£ã¦ã¿ã¦ååå®ç¨å¯è½ãªé åã«å°éãã¦ãããã¨ãããã£ã¦ãã¾ãããã®ç¯å²ã§ã¯æ±ç¨æ©ã§ãããã¨ãRDBMSã§ãããã¨å®éç¨ã¬ãã«ã§ã®ããã©ã¼ãã³ã¹ã§ã¯ãHadoopã®å§åãã¨ãããã¨ã«ãªãã¾ãããçµè«ã¯ã§ã¦ãã¾ãã
ãã¨ããããç¾å®ã¯ããããã®ãæ°ãããèªä½ã§ããã°ãå¤é度ã§åè² ãã大è¦æ¨¡ããããããå°è¦æ¨¡ã»å°ãã¼ã¿ã§ã®ãããå¦çã®æ¹ãå¤ãã§ããããããå§åçã«å¤ããå¿è«ãããã®å
訳ã¨ãã¦ã¯ãã¯ã¼ã¯ãã¼ãããèªä½ã¯ããã¼ããããééããã§ããã¡ãã«ç®ãè¡ã£ã¦ãã¾ãããå®éã«Hadoopã§ã®ããã¼ããããç縮ãããã¨ãã·ã§ã¼ãããããç®ã«ã¤ãããã«ãªãã¾ãããããã©ããããï¼ã課é¡ã§ãã
2. Asakusaã®å¯¾å¿
ããã¯ãèããã°è³æ¥µå½ããåã®åççãªå¯¾å¿ãã§è§£æ±ºãã¾ããããªãã¡ã·ã§ã¼ããããã¯RDBMSå®è¡ããããã¼ãããã¯Hadoopã§å®è¡ãããã¨ãã風ã«å¶å¾¡ãã¦ãåçã«Asakusaãæé©åãã¾ããï¼èª¤è§£ã®ãªãããã«ããã¨æé©åã®ç¨åº¦ã¯å¶å¾¡å¯è½ã§ããããç¨åº¦ãå¦çæéãèªã¿ããã±ã¼ã¹ãããã¨æãã¾ãã®ã§ãï¼ã¤ã¾ãAsakusaDSLã§ããããæ¸ãã¦ããã°ããã¨ã¯ã³ã³ãã¤ã©ããããªã«æé©åãã¦ããããã¨ããä»çµã¿ã«ãªãã¾ãã
ãå
ã
ãAsakusaã¯RDBMSã¨Hadoopã®é£æºãã³ã¢ã«ããã¾ãããããã£ã¦ãå®è¡åºç¤èªä½ã¯æ¢ã«ããããã§ãæ°è¦ã«RDBMSã追å ãã¦ãã ãããã¨ãããã¨ã§ã¯ããã¾ããããã¨ãã¨ããRDBMSã®ãã®ä¸ã§ãå¦çãå®è¡ãã¾ãããã¨ãããã¨ã«ã表é¢ä¸ã¯è¦ããã¨æãã¾ããå®ä½ã¨ãã¦ã¯ãAsakusaãè¤æ°ã®å®è¡ã¨ã³ã¸ã³ã¨ãã¦ãRDBã¨Hadoopãå¾ããã¨ããã«ã¿ãã«ãªãã¾ãã
3. æå³ä»ã
ãããã¯ä¼å ´ã§ã¯è©±ããªãã£ããã©ãã¤ã¾ãAsakusaã®ä½ç½®ä»ããæ確ã«ããä¸ä½ã«é²ãã¦è¡ãã¨ãããã¨ã§ãããã¾ããä»ã¾ã§ã®ãã¼ã±ããã§ã®ç«ã¡ä½ç½®ã¯ãHadoopã§ãããã®éçºã»å®è¡ã®ããã®ããã«ãã¨ãããã¨ã§ãã£ããã©ã次ã¯ãAsakusaã¯æ¥åç³»ã®ãããå¦çã®éçºã»å®è¡åºç¤ã§ãããè¤æ°ã®å®è¡åºç¤ã®ä¸ã¤ã¨ãã¦Hadoopãé¸æãããã¨ããæå³ä»ãã«ãªãã¾ãã
ãå®ã¯ããã¨ãã¨ã®åºçºç¹ã¯å¾è
ã§ãã£ãã®ã ãã©ãããã°ãã¼ã¿ã»Hadoopã®ãã¼ã±ãã£ã³ã°ã®ããºã¯ã¼ãããã¾ãå©ç¨ããã¦ãããããã«ãç¨åº¦ã®éãã§ã¯ããã¾ãããåè
ã§ã®ã¡ãã»ã¼ã¸ãå¼·ãåºãã¦ãã¾ãããããããHadoopãååã ãã¯æ®åãã¦ããã®ã§ãããããããã¡ãã»ã¼ã¸æ§ã¯èãã¦ãããé åãã ã¨ãå人çã«ã¯ãæãã¾ããã¾ããã ãä¼ç¤¾ã®å¤æãã©ããªããã¯å¥ã§ããã»ã»
ããã¨ãã¨èªåãã¯åé¡è§£æ±ºå¿åã§ãã£ã¦ããã®ã§ãHadoopã§è§£æ±ºã§ããªãã®ã§ããã°ãä»ã使ãã°ãããã¨ããã¹ã¿ã³ã¹ã§ãã解決ãåªå
ã§ãã£ã¦ãHadoopã使ããã¨ãããèªä½ããç®çã§ã¯ããã¾ããããªã®ã§ãä»ãé å¼µã£ã¦ãããããªHadoopèªä½ã§ä½ã¬ã¤ãã³ã·ã¼åã«ã¯ãå®ã¯å¦å®çã§ããããHadoopã§ã¯ãªãã§ããããã¦ããæ®éã«RDB使ãã°ããã§ãããããªãããããããã«å°ãããã¼ã¿ãµã¤ãºã§ãHadoopéããçãªåããå£éè¦ãã¾ãããã¾ãRDBMã§å¦çããã°è¯ããã¨ã
ãAWSã®ã¤ã³ã¹ã¿ã³ã¹ãæè¿ã®ãµã¼ãã¼ã¹ããã¯ãè¦ãã°åããã¨ãããåãã¼ãã«æè¼ã§ããã¡ã¢ãªã¼éã¯æ¥½åã§100Gãè¶ãã¦ãã¦ãã¾ãããã®ãã¡æ®éã«1Tã«ãªããããããã®ã¬ãã«ã®ãããã°ãã¼ã¿ããã¤ã¾ãTãã¤ãã¢ã³ãã¼ã®ãã¼ã¿ãµã¤ãºã§ãMPPç³»ããããé å¼µã£ãã¨ããã§ãRDBã«ã¯åã¦ãªãã§ãããã
3.éç¨ã®å¼·å
ãããããå®éç¨ä¸ã®éç¨åºç¤ã®å¼·åã§ãããAsakusaãããã§ãå¦çãéä¸ã§æ¢ãããã¾ãæ¢ãã¦ã¨ããããåéãããã¨ãã£ããæ®éã«ã§ããªãã¨ã¾ãããã¨ããæ®éã«ã§ããããã«ãã¾ããããã¨ãã話ã§ããBIç³»ã ã¨ã¯ã¨ãªã¼æãã¦ãé§ç®ãªãæåããã§ãªãã¨ãéç¨ã§ãã¾ãããæ¥åç³»ã ã¨ããã¯è¡ãã¾ããããããã¾ã¼ãå°å³ããã¦ãã¾ãç®ç«ããªãæãã§ã¯ããã¾ãããä½æ°ã«çµæ§éè¦ã ã¨æã£ã¦ãã¾ãã
æ¦ãã以ä¸ã®ãããªè©±ãã¨ã
ã»ãã£ããææ³ã§ããå
¨ä½çã«ã¾ã¼ãããªãçãä¸ãã£ã¦ããã£ããªãã¨ãæåã®ãã¼ãã¼ãã®éå§æç¹ã§ã®äººæã®å°ãªãã¯ã¤ããæãããªãããã¾ããããæå¾ã¯å¾ãã§ç«ã£ã¦ãã人ãåºã¦ãããããªæãã§ããã¨ã¯ãããä»åããã¼ã¯ãªãããªæ°ããã¾ããä»å¾ã¯æ¥çµãã主ä½ã®ãå売ã¢ã¼ãå
¨éããã¸ã·ã§ã³ãã¼ã¯ããã¼ã±ãã£ã³ã°ã¯ã¼ããããã°ãã¼ã¿ææºè¼ãã®ã¨ã³ãã©ã»ã»ããã¼ãåå°ã§éãããã§ããããããã¨ã³ãã©ç³»ã®äººã¯ãã£ã¡ã§è©±ãããã§ãããããã³ãã¥ããã£çã«ã¯ãã以ä¸ã®æé·ã¯ã¾ãéã£ã話ã«ãªã£ã¦ãã¾ããã¨ã
ãã¾ãã¶ã£ã¡ããç¾è¡ã®Hadoopã§ã¯ãããæè¡ãã¿ã¯ãªãã£ããwãæ¬ã沢山åºã¦ãããèãäºãªãã§ããã»ã»ã»Hadoopãã©ããªãã®ã¯ãããã§ãã§ãã§ãããããããã¯ããã§å¥ã§å
輪ã§ãããã¿ãã¨ã
ãããªæãã