ã¨ã³ã¿ã¼ãã©ã¤ãºåãã«ãã¡ã¤ã«ã·ã¹ãã ãé²åããã
ãå¹ççã«ä¸¦ååæ£å¦çãè¡ããHadoopã®ã¢ã¼ããã¯ãã£ã¯ãããã°ãã¼ã¿æ代ã«ããªãæå¹ãªä»çµã¿ã ããããã¨ã¯ãããHadoopãããä¼æ¥ãå©ç¨ãããã¨ããã¨ããã¾ãã¾ãªåé¡ãçºçãã¾ããçºçããåé¡ã®å¤ãã¯ãHadoopã®åæ£å¦çãå®ç¾ããããã®ãã¡ã¤ã«ã·ã¹ãã ãHDFSãã«ããã¾ããã¨è¿°ã¹ãã®ã¯ããããã¢ã¼ã«ã»ãã¯ããã¸ã¼ãºã®ã»ã¼ã«ã¹ãã£ã¬ã¯ã¿ã¼ å¹³æè¯ææ°ã ã
ããã®HDFSã®èª²é¡ã解決ããããã«ãMapRã§ã¯ç¬èªã®MapRãã¡ã¤ã«ã·ã¹ãã ãçã¿åºããããã®MapRãã¡ã¤ã«ã·ã¹ãã ã®Lockless Storage Serviceã¢ã¼ããã¯ãã£ãèãåºããã®ããå社ã®åµæ¥ã¡ã³ãã¼ã§ããChief Technology Officerã®M.C. Srivasæ°ãå½¼ã¯ãNetAppã®å身ã§ããSpinnaker Networksã§Chief Architectãå¤ãã人ç©ã§ãããå°éã¯ãã¡ã¤ã«ã·ã¹ãã ã ã
ãMapRãã¡ã¤ã«ã·ã¹ãã ã®å¤§ããªç¹é·ã®1ã¤ããNFSï¼Network File Systemï¼ã¨ãã¦ãã¦ã³ãã§ãããã¨ãHadoopã§ããã°ãã¼ã¿æ´»ç¨ããããã¨ããéã«ã大ããªåé¡ã¨ãªãã®ããªã¬ã¼ã·ã§ãã«ãã¼ã¿ãã¼ã¹ãå¤é¨ã®ãã¼ã¿ã½ã¼ã¹ããã®ãã¼ã¿ç§»åã ããã¹ã¦ã®ãã¼ã¿ãæåããHDFSã«ããã°åé¡ãªããã大æµã¯Hadoop以å¤ã®ãªã¬ã¼ã·ã§ãã«ãã¼ã¿ãã¼ã¹ãã¹ãã¬ã¼ã¸ã«ãã¼ã¿ã¯åå¨ãããããããHadoopä¸ã§å¦çããããã°ãHDFSã®ä¸ã«è¼ããªããã°ãªããªãããã®ãã¼ã¿ç§»åã®å¦çãããã¼ã¿ã大ãããªãã°ãªãã»ã©å¤§ããªè² è·ãä¼´ãæéãããããã¨ã«ãªãã
ãMapRãã¡ã¤ã«ã·ã¹ãã ã¯NFSã§ãã¦ã³ãã§ããããã¦ã³ãããã°Hadoop以å¤ã®ã¢ããªã±ã¼ã·ã§ã³ããã¯ãåãªããããã¯ã¼ã¯ã¹ãã¬ã¼ã¸ã«è¦ãããããã«ããããã¼ã¿ãå°ç¨ã®ãã¼ã«ãã¹ã¯ãªããã使ç¨ãHDFSã«ç§»åããã¨ããæéãªããä»ã®ã¢ããªã±ã¼ã·ã§ã³ããã¼ã¿ãã¼ã¹ã¨ã®éã§ãã¼ã¿å ±æãå¯è½ã¨ãªãã
ããWebãµã¼ãã¼ã®ãã°ãã¡ã¤ã«ãHadoopã§åæããããé常ã¯ãµã¼ãã¼ããåãåºããããã°ãã¡ã¤ã«ãåå¾ãããããå å·¥ãã¦HDFSã«è¼ããå¦çãå¿ è¦ã«ãªãã¾ããMapRã®å ´åã¯NFSãã¦ã³ããããMapRã®ãã¡ã¤ã«ã·ã¹ãã ã«Webãµã¼ãã¼ãç´æ¥ãã°ãåãåºãããã«è¨å®ããã°ãMapRã®Hadoopã§ããããã®ã¾ã¾å¦çã§ãã¾ãããããå¦çã«ãããã¡ã¤ã«è»¢éãªã©ãå¿ è¦ãªãã®ã§ã常ã«ææ°ã®Webãã°ãåç §ãåæãè¡ãã¾ããï¼å¹³ææ°ï¼
ã ã¾ããç¬èªãã¡ã¤ã«ã·ã¹ãã ã¨ãããã¨ã§ãä¼æ¥å©ç¨ã§å¿ è¦ãªãã¼ã¿ä¿è·æ©è½ãå¼·åã§ããããªã¼ãã³ã½ã¼ã¹çã®Hadoopã§ãJobTrackerãé害æã«ãã§ã¤ã«ãªã¼ãã¼ããããã¨ã¯ã§ãããããããªãããé害çºçæã®å¦çã¯ç¶ç¶ããããã¸ã§ãèµ·ååã¾ã§æ»ã£ã¦ãã¾ããMapRã§ã¯ãã¨ãã¨æ§æãHAï¼High Availabilityï¼åãã¦ããããã§ã¤ã«ãªã¼ãã¼æã«ãå¦çã¯ç¶ç¶ãããããã«ãªã£ã¦ãããæéã®ããããããå¦çãªã©ã§ã¯ãå¦çãç¶ç¶ã§ãããã§ããªããã®éãã¯éç¨ã«å¤§ããå½±é¿ããã
ãããã«ãMapRã«ã¯ãã«ãããã³ãæ©è½ãããããã¡ã¤ã«ã·ã¹ãã ã®ä¸ãããªã¥ã¼ã ã¨ããåä½ã«åãããã¨ã§ã§ããããªã¥ã¼ã ãã¨ã«å©ç¨ã§ãããã¼ã¿å®¹éã«å¶éãè¨ããããããç´°ããã¢ã¯ã»ã¹å¶éãããããããã¨ãã§ãããããã«ãããã³ãã¯MapRãªãã§ã¯ã®æ©è½ã§ãããã¾ãã«ã¨ã³ã¿ã¼ãã©ã¤ãºåãã®ãã®ã§ããã¨å¹³ææ°ã¯è¨ãã
ä¼æ¥ã¦ã¼ã¹ã®ããã«ã¯ããã¡ã¤ã«ã·ã¹ãã ã®ã¢ã¼ããã¯ãã£ã®è¦ç´ããå¿ è¦ã ã£ã
ããMapRã§ã¯ããã¡ã¤ã«ã·ã¹ãã ã®ã¢ã¼ããã¯ãã£ãå¤ãã¦ãã¾ããããã«ããããªã¼ãã³ã½ã¼ã¹Hadoopã®ãã¾ãã¾ãªããã«ããã¯ã解æ¶ãã¦ãã¾ããã¨è¿°ã¹ãã®ã¯ããããã¢ã¼ã«ã»ãã¯ããã¸ã¼ãº ã·ã¹ãã ã¨ã³ã¸ãã¢ã®èèæ彦æ°ã ãå··ã§ã¯ãããMapRã¯é«éåã®ããã«HDFSã®Javaã®å®è£ ãC++ã§æ¸ãæããã¨è¨ããã¦ããããããã«C++ã使ã£ã¦æ¸ãæãã¦ã¯ããããåç´ã«ãã©ãããã©ã¼ã ã®ãã¤ãã£ãå®è£ ã«ãããã¨ã§é«éåãããã¨ããã®ã§ã¯ãªãã
ã ãHDFSã«ãããªã¼ãã¼ããããåãé¤ãããããã®1ã¤ãJavaã®ã¬ã¤ã¤ã¼ã ã£ãã®ã§ãããããåãé¤ãã¹ãã¨å¤æããçµæçã«C++ã§æ¸ãæãããã¨ã«ãªã£ãã®ã§ããï¼èèæ°ï¼
ãä¸ä¾ã¨ãã¦ãJavaã¬ã¤ã¤ã¼ãå ¥ã£ã¦ãã¾ãã¨ãã©ããã¦ãã¬ãã¼ã¸ã³ã¬ã¯ã·ã§ã³ã®åé¡ãçºçããã大容éã¡ã¢ãªã¼ã管çãã¦ããç¶æ³ã§ãå é¨çã«ãã¼ã¿ã®çæãåé¤ãç¹°ãè¿ãã¨ã¬ãã¼ã¸ã³ã¬ã¯ã·ã§ã³ãèµ·ããåãã極端ã«é ããªã£ã¦ãã¾ãããã®åé¡ããJavaã¬ã¤ã¤ã¼ããªãããã¨ã§è§£æ¶ãããããã®çµæãC++ã®æ¡ç¨ã¨è¨ãããã ã
ã åæ§ãªæ¹éã§ãMapRã§ã¯ãã¼ã ãã¼ãé¨åã®ã¢ã¼ããã¯ãã£ãå¤æ´ããããªã¼ãã³ã½ã¼ã¹ã®Hadoopã§ã¯ãåºæ¬çã«ãã¼ã ãã¼ãã1ã¤ã®ã¯ã©ã¹ã¿ã«1ã¤ã¨ããæ§æã¨ãªã£ã¦ããããã®ãããé害ãçºçããã°ã¯ã©ã¹ã¿å ¨ä½ã使ããªããªãããã¼ã ãã¼ãã®HAåã«ãã£ã¦é害ã«å¯¾å¿ãããã¨ãã§ããããå¦çã1ç®æã«éä¸ãããã«ããã¯ã«ãªãã¨ããåé¡ã¯å¤ãããªãã¾ã¾ã§ããã
ãMapRã§ã¯ããã¼ã ãã¼ããã¯ã©ã¹ã¿ã®ä¸ã§åæ£ããã¢ã¼ããã¯ãã£ãã¨ã£ã¦ãããããã§åä¸é害ãã¤ã³ãã¯ãªããªããããã«ãã¼ã ãã¼ãå¦çã®ããã«ããã¯ããªããªããããã以å¤ã«ããç¬èªã®ãã¡ã¤ã«ã·ã¹ãã ã¨ãããã¨ã§ãã©ã¼ãªã³ã°æ©è½ã«ããç½å®³å¯¾çããã¹ãããã·ã§ããæ©è½ã«ãããã¼ã¿ä¿è·ã¨ä»»æã®æç¹ã¸ã®ãªã«ããªã¨ãã£ããã¨ã³ã¿ã¼ãã©ã¤ãºåãã®ã¹ãã¬ã¼ã¸è£ ç½®ãæã¤ãããªæ©è½ããã¡ã¤ã«ã·ã¹ãã ã®ã¬ãã«ã§å®è£ ãã¦ãããã¾ãApache Hadoopã§ã¯å ¨ä½ã®1/3ç¨åº¦ããå©ç¨ããããªããã¼ãã¦ã§ã¢ãªã½ã¼ã¹ããMapRã§ããã°ã»ã¼æ§è½éçã¾ã§ä½¿ãåããããã«ããªã£ã¦ããã
ãããã®ããã«ãã¡ã¤ã«ã·ã¹ãã ã®ã¢ã¼ããã¯ãã£ãå¤æ´ãã¦ãã¦ããã¤ã³ã¿ã¼ãã§ã¤ã¹é¨åã¯100ï¼ Apache Hadoopã¨ã®äºææ§ãå®ã£ã¦ãã¾ããã¨å¹³ææ°ã¯è¨ããããã¯ãMapRã®é¡§å®¢ãæãã§ãããã¨ã§ããããããããããã®æ¹éã¯å¤ãããã¨ã¯ãªããäºææ§ãããã®ã§ãHadoopã®ã¨ã³ã·ã¹ãã ã§ãããImpalaãHiveãªã©ã®ãã¾ãã¾ãªHadoopä¸ã®ã¢ããªã±ã¼ã·ã§ã³ããã¹ã¦ãã®ã¾ã¾åãããã¨ãã§ããã
ããApache Hadoopã§åãã¢ããªã±ã¼ã·ã§ã³ãã©ã¤ãã©ãªãæ¢ã«ãã£ã¦ãä½ãæãå ããã«åãã¾ããæ¢åã®è³ç£ã¯ç¡é§ã«ãã¾ãããï¼èèæ°ï¼
ãäºææ§ãããã ãã§ãªããéç¨ç°å¢ã®æè»æ§ãé«ããHadoo 2.xã§å°å ¥ãããæ°ããã¢ããªã±ã¼ã·ã§ã³ãã¬ã¼ã ã¯ã¼ã¯ã«YARNããããHadoop 1.0ãåããã¦ãã¦ããã«YARNãåãããããã°ãé常ã¯å¥ã®ã¯ã©ã¹ã¿ãå¿ è¦ã ãMapRã§ã¯ããã1ã¤ã®ã¯ã©ã¹ã¿ã ãã§åããããããã«ããéç¨ã³ã¹ãã®åæ¸ãå¯è½ã¨ãªããããã«å¤ãã¢ããªã±ã¼ã·ã§ã³ãã¬ã¼ã ã¯ã¼ã¯ããã®ç§»è¡ã®éã®æéã¨ãªã¹ã¯ãä½æ¸ã§ããã
ãã¾ãä¸è¬ã«ãã£ã¹ããªãã¥ã¼ã·ã§ã³ãå©ç¨ããå ´åã«ã¯Hadoopã¯ã©ã¹ã¿ã®ãã¼ã¸ã§ã³ãä¸ããéã«ããã®ä¸ã§åãã¨ã³ã·ã¹ãã ã®ã¢ããªã±ã¼ã·ã§ã³ããã¼ã¸ã§ã³ãä¸ããªããã°ãªããªããMapRã§ã¯ãã¯ã©ã¹ã¿ã¨ã¨ã³ã·ã¹ãã ãå¥ã ã«éç¨ã§ããããã«ãã¦ããããã®ãããã¯ã©ã¹ã¿ããã¼ã¸ã§ã³ã¢ãããã¦ããå¿ ãããä»åãã¦ããã¢ããªã±ã¼ã·ã§ã³ã®ãã¼ã¸ã§ã³ãä¸ããªãã¦ãããã®ã ãã¦ã¼ã¶ã¼ã¯ãå®å®ããã·ã¹ãã ãé·ã使ãç¶ãããã¨ãå¯è½ã¨ãªãã