æ¥å¹´ã®Hadoop
Hadoopã¢ããã³ãã»ã«ã¬ã³ãã¼ã®å¤åæçµæ¥ã®ã¯ãã
ãã£ãããªãã§ãæ¥å¹´ã®äºæ³ã§ããã¦ã¿ãããã¨ã
æ¥æ¬ã®è©±ã§ããä¸çã®ãã¨ã¯ãããããã¾ãããæ¬å½ã®ãã¨ã¯ãæ¥æ¬ã«ã¯ä¼ãããªãï¼è¡¨åãã®è©±ã¯ã¨ããããç¾ç¶ã§ã¯VCãããã®å¤éã®æ¹ãçºè¨åãããã¨æãããåããã§ãããã®è¾ºã®æ£ç¢ºãªæ å ±ã¯ä¼æãã¦ãæ°ããã¾ããï¼ã¨æãã®ã§ãã¨ã¯ãããæ¥æ¬ã®Hadoopãã¼ã±ããã¯ããããªãããã£ã¦ããï¼ã¨ããããããã£ã¦ããªãã¨ã¾ããï¼æãã¿ãããªã®ã§ã»ã»ã»åæã«ãæ¥å¹´ã®Hadoopã¨ãäºæ³ãã¾ããå¤ãããç¼ãèãããã¾ãã
1 大éãã¼ã¿å¦çã§ã®ããã¡ã¯ãå ã»ããããWebç³»ã§ã¯ã¤ãã£ã¦ããªãã¨ããã¯ä¸ç¤¾ããªããªã
ç¹ã«ã¬ã³ã¡ã³ãã¼ã·ã§ã³ã¨ã³ã¸ã³ãããã¯ãããæ®éã«å®è£
ãã¦ä½¿ãããã ãããã以ä¸ã®ãã®ã¯åºãªããéè¨å¦çã¨æ¨è«ããã¾ãå©ç¨ããã¬ã³ã¡ã³ãã¼ã·ã§ã³ã¨ã³ã¸ã³ï¼ã¨ãã®äºæµï¼ã徿¥ããã®ãã£ã«ã¿ãªã³ã°ã®ã¨ã³ãã³ã¹ã§ã®å©ç¨ã§æ®éã«ä½¿ããããä½¿ãæ¹çã«ã¯ããã®è¾ºã§ã¡ãã£ã¨é æã¡ã«ãªããå®è£
ã¯ãèªåæ´¾ã¨ãã³ãã¼ãä»»ãæ´¾ã¨AWSã®ï¼ã£ã¤ãæ½®æµã«ãªã£ã¦ãå¤åèªåæ´¾ã¯éç¨ã§ããªãã¦ç¸å½æ°ãçç ããããã®ä¸æ¹ã§ã徿¥ã®ããã«æè¶³ã®ããã«ã¡ããã¨ä½¿ã人ãã¡ããããªãã«åºã¦ãããä¸ã¤ã®ãªãã§ã¯ãå人çã«ã¯AWSç³»ããã£ã¨ãå¹çãè¯ãã¨æãã¾ãããã¾ãã§ã
2 ãã£ã¹ããªãã¥ã¼ã·ã§ã³ã®å¤æ§åã»Apache (Horton)
ã»CDH (Cloudera)
ã»MapR (EMC)
ã»Windows Hadoop (MS)
ã»IBM
ãããªãã¨ã5ã£ã¤ã®ãã©ãããã©ã¼ã ããããªãã«åºã¦ããã
ãããã«ãã¦ãããã¼ã±ããã¯ã¨ã³ã¿ã¼ãã©ã¤ãºä¸»å°ã«ãªããWebç³»ã®BIã¨ã§ã¯åãã¦ããéã®æ¡ãéãã
ã¨ã³ã¿ã¼ãã©ã¤ãºã§ã®æ¬å½ã¯MapRã«ãªãããã¯ãEMCã®å¶æ¥åã¯åãã®ã§ãã¾ã¨ãã«ãã¼ã©ã¼å§ããããå¸å ´ãå¸å·»ããã¨æãã
ã¨ã³ãã©å¸å ´ã§ã¿ãã¨ãOSSç³»ã§ã¯æ¥æ¬ã§ã¯ä¸æããã¨Apacheãããã¼ã®æ¹ãã¤ã³ã¹ãã¼ã«ãã¼ã¹ã§CDHãæããããããªãã1.0.0ã«ãã¼ã¸ã§ã³ã決ããã®ã¯æ®åã¨ããç¹ã¯ãã©ã¹ã«æ¯ãããCDHã«ã¤ãã¦ã¯ãæ¥æ¬æ³äººã®ç¾ç¶ã®ãªãã¬ã¼ã·ã§ã³ãå¾®å¦ã§ãç¾æç¹ã§å è¡ã®è²¯éã使ãæãããæãããããï¼ããä¸å¹´ã§ãã¼ããã¼ãNTTDã®ä¸ç¤¾ã ãã§ãã¡ãã£ã¨ãã¬ã¼ã³ã¹ããããå人çã«ã¯@shiumachiããã«ã¯ãæ¥æ¬äººHadooperã¨ãã¦è¶ é å¼µã£ã¦ã»ããã®ã§ãããé å¼µãï¼ï¼Apacheã§ã¯ãHortonãã¹ããã¯ãªã¼ãã¼ãªæãã«ãªã£ã¦ãã¦ããã®ã§ããããªClouderaã¨ã¯ãããããªãè¦æ¦å¿ æ»ããæ¬å®¶ã®çæ¿ãã¯æ¥æ¬ã§ã¯Apacheã®æ¹ã«ãªã£ã¦ãã¾ãããã§ã
ãã¼ã¯ãã¼ã¹ã¯MSãDryadããã£ã±ãåã£ããã¨ããè¯ãæ¹åã«é²ãå¯è½æ§å¤§ããã£ã±ãwindowsã§ä½¿ããã使ãã®ãæ¥æ¬äººã¨ããããæ¥æ¬ã®ä¼ç¤¾ã§ããIBMãçµæ§å¼·ã説ãããã¾ãããããã¯å¾æ¥ã®IBMã ã£ãå ´åã®è©±ã ã¨æããç¾ç¶ã§ã¯å¤±éæ°å³ãç¹ã«å¤æ®µã
ã¾ãè¦ããã«ãCDHãããã¡ã¯ãç¶æ ã§ã¯ãªããªã£ã¦ãæ¬æ ¼çãªç«¶äºãå§ã¾ãã¾ãããã¨ããæããã¨ã
3 ã»ã³ãµã¼ãã¼ã¿ã§ã®å©ç¨ãçé¢ç®ã«æ¤è¨ããå§ããç¹ã«ãæ¥æ¬ã®ãéå°å質ãªçµã¿è¾¼ã¿ãã¼ã¿åéãã¯ä»ã¾ã§çé¢ç®ã«ä½¿ãã¦ã¾ããçãªæ±ãã ã£ãã®ã§ããããã©ãåæãã¦ããããï¼çãªè©±ãåºã¦ãããç¹ã«ãã»ã³ãµã¼ãã¼ã¿ç³»ã¯çµã¿è¾¼ã¿ç³»ã¨ã®æ¥ç¶ã«ãªã£ã¦ããã®ã§ãæè¡çãªåé¡ããããã³ãã¼ã®çµã¿æ¹ã®åé¡ãã¯ãã¼ãºã¢ãããããã
æè¡çã«ã¯ãã»ã³ãµã¼ããã®ãã¼ã¿åéã®æ¬¡ã«ãã£ã¼ãããã¯ãã©ãæ»ããï¼ã¨ããäºãå½ç¶è°è«ã«ãªãã®ã§ããããªã¢ã«ã¿ã¤ã æ§ãæ±ãããããåæã«ãCEPã®æªå¤¢åã³çãªè©±ãåºã¦ãã¦ãæ··ä¹±ããæãã«ãªããããããHadoopã¯ã¹ã«ã¼ãããã追æ±ããä»çµã¿ãªã®ã§ãã¬ã¤ãã³ã·ã¼ã®è¦æ±ã¯çãéããæ¹åã¨ãã¦ã¯ããªã¬ãªã¬Hadoopã¯ãªã¢ã«ã¿ã¤ã ã§éããããçãªãå¾®å¦ãªä»çµã¿ã¨ãHadoopã§è¡ãã¾ãã£ã¦ã¹ã¿ã¼ããã¦ãçµå±ä½¿ãã¾ããã§ããã¨ããæ®å¿µããã¸ã§ã¯ããå²ã¨ç®ã«ä»ãããã«ãªãã
ãã¸ãã¹çã«ã¯ããªããã§ããããããªãã¯å¤§æµã¯å ¨é¨é§ç®ãç¸å ´ããã ããå°æ°ã§ã¯ããããããããåæ©è½ã§ãå²ãåã£ããããã½ãªã¥ã¼ã·ã§ã³ã§ä»ãå§åããã¨ãããåºã¦ããã¨æãããããåã¡çµãããããããããã¯æ¦å¨ã®ã¯ãããã£ãããªã¼ã«çµã¯å ¨æ» ã
4 å°å³ãªé¨åã®é£æºæ©è½ã¨æ°¸ç¶åå±¤ã«æ³¨ç®ããã¤ã¾ã飿ºã«ã¤ãã¦ã¯ãæ£ç´ãä»ã¾ã§ã¯æ¬å½ã«ããªãã¡ãã£ã¦é£æºããã»ã¨ãã©ãåºæ¬ãèªåã§ã¤ãªããéåã«ãªã£ã¦ããã®ã§ãMãªäººããã¨ã¦ãããã¦ãªããè·äººä¾å度ãå¼·ãããã®è¾ºãã¡ããã¨ã©ãããããï¼ã¨ãã話ã常ã«ããããçãã¦ããæããä»ã®ã¾ã¾ã ã¨ä¸å®å ¨çç¼ã§ä¸é ¸åçç´ ä¸æ¯ãããã©ã¼ãã³ã¹ã«åãæ¹ã¨ãå®å ¨ã«åãæ¹ã®ï¼æ¹ååºã¦ãããä»ã¯ãã°ãã¼ã¿ã®åæ¾ã§ãã©ãããã°è¯ãçãªè©±ãå¤ããããã®ãã¡æ§é åãã¼ã¿ãåãè¾¼ã¿ãããã¨ãã話ãåºã¦ãã¦ãHadoopã£ã¦å®ã¯æ§é åãã¼ã¿ã«æãããå¼±ããªãï¼ã£ã¦ãä»é æ°ãã¤ã人ãåºã¦ããããã®è¾ºã¯ãHadoopã®ããæ¹ã«ãé¢ããã
æ°¸ç¶å層ã«ã¤ãã¦ã¯ãå®å ¨æ§ã»ä½¿ãåæã«ããæ³¨ç®ãéã¾ãã ãããMR2.0ç³»ã«ã¤ãã¦ã¯ãä¸ä½ã®ã¢ã«ã´ãªãºã ããããããæ°¸ç¶å層ã®HAãããã®æ¹ã注ç®ãããã忣ãã¡ã¤ã«ã·ã¹ãã ã«HDFSã®APIã使ã£ããã®ããå²ã¨ç®ã«ã¤ãããã«ãªããåç´ãªæ°¸ç¶åã ãã§ã¯ãªãã¦ããããã¯ã¼ã¯ã¨ãã¿ã§è°è«ãããããã«ãªããä½ãããã¡ã¯ãã«ãªããã¯ãå¸å ´æ¬¡ç¬¬ã
5 OSSã®æå³ä»ããå¸èã«ãªãããã¨ã³ã¿ã¼ãã©ã¤ãºãã®æå³ãã¯ãã¼ãºã¢ãããããã¨æããä»ã®çHadoopã¯ãã¨ã¦ãã¨ã³ã¿ã¼ãã©ã¤ãºã¨ã¯è¨ããªããã«ãããããããHadoopã¨ã³ã¿ã¼ãã©ã¤ãºã¨ãã声ã大ããããããã¯ãã¸ã·ã§ã³ãã¼ã¯ã¨ããã£ã¦è¨ã£ã¦ããã¯ããï¼ããã¾ãã¯æ¬å½ã«ä½ãç¥ããªãã ãã®å¯è½æ§ããããï¼ãã¨ã³ã¿ã¼ãã©ã¤ãºãªããããã§ãããããã§ãããã£ã¦è¨ããã¦ãããããHadoopã¯OSSãªãã§ãèªåã§ãã£ã¦ãããã£ã¦è¨ãã¨ããã¯ï¼ãã¨ãè¨ãããããã§ããããããBIã¯ãæ¬æ¥çã«ã¨ã³ã¿ã¼ãã©ã¤ãºã§ã¯ãªããã¨ãããããã¿ã³ã®æãéãã®å§ã¾ãã
ç¹ã«ã¨ã³ã¿ã¼ãã©ã¤ãºã ã¨ããã³ãã¼ä¸»å°ã«ãªãã®ã§ãèªåã§ãªãã¨ãããOSSãã¯å½±ãæ½ããããããã¯ãè²ã®å¼·ãOSSãã«è»¸ãç§»ããå種ã®ãã£ã¹ããªãã¥ã¼ã·ã§ã³ã«ããåæ§ã®è¦æ±ãçªãã¤ããããã
6 ãã©ãã«å¤çºHadoopã¯æ·å± ããããããã®åããã¸ã§ãªãã£ã¼ãããããã«ãªã£ããå½ç¶ã®çµæãç¾é¬¼å¤è¡ã®æªå¤¢åã³ããã»ã»ããããä½ãã¦ãã§ãããããããçãªè»å£ãã©ãã©ãåºã¾ãããã©ãã«åºã¾ããã¾ããä¸ã®ä¸ã®å¸¸ãªã®ã§ããããå«ãã¦ãæ®åãã¨ããã®ã§ããããã»ãã¨ã«ä½¿ããã®ãï¼ããããããä¸å¿OSSã§ããããããããã§ããåãã¨ã³ã¿ã¼ãã©ã¤ãºã£ã¦è¨ã£ããããï¼ã»ã¼ããºã¼ãã«æ¸ãã¦ãããããwwwwwwwwwwããããªä¼è©±ãåæã§èãããã
7.ããã°ãã¼ã¿ããã«ã®æ®ãç«ãç¦ç¹ã¾ãè¦ããã«ããã°ãã¼ã¿ã»ããã«ãå¼¾ãã¾ãããã ãæ¥å¹´ã®å¾åã¾ã§ã¯æã¤ã¨æããããç¨åº¦ãéãã¯ãããããã¨ã¯ããã2012å¹´å¾åã¯ã¡ãã£ã¨å¤±éæ°å³ã«ãªããã ã£ã¦ãéã«ãªããªããããå»¶å½ãã¦ããéã«å¥ã®ãã¿ãã§ããã©ããï¼ããã¤ã³ãã«ãªããCRMã¨ãSCMã¨ãã¯2-3å¹´ãã£ããã©ãããã°ãã¼ã¿ã¯ã¡ãã£ã¨è³å³æéåããæ©ãããã
8ãçµå±ä½ã«ä½¿ãã®ãï¼ãåé¡ã®åçå¨ããããã¾ãåãäºãã£ã¦ã®ãï¼ã¨ãæãããããããã¾ãããããã¯ãã¢ã¦ãã§ããã¾ããããããæ¥æ¬ã®ITãªãã§wã
ã¨ã¯ãããæ¥å¹´ã¯ãã忣å¦çï¼ãï¼ Hadoopã®ãã¨ã§ããï¼ãããªç¹å¥ãªãã¨ã§ããªãã§ãããã¨ããè¨èãããããããã«ãªãã§ãããããããããã«å¤©å°é©æãªãã¨ããå·éã«èããã°ãããã¨æãã¾ãããã¤ããã¼ã·ã§ã³ã¨ããã®ã¯ãèµ·ãã¦ã¿ãã¨å½ããåã«ãªã£ã¦ãããã¨ãããã¨ããããã®ãã2012å¹´ã®Hadoopã®æ¬è³ªã§ããããã¾ãéã¯ãã¼ãã§ãããçããé å¼µãã¾ãããã
æ¬å¹´ã¯å¤§å¤ãä¸è©±ã«ãªãã¾ãããæ¥å¹´ããããããé¡ããããã¾ãã