å称表è¨ãæºãã¦ã¦å¾®å¦ã ãã© Hbase at FaceBook on Zusaar ãã®ã¤ãã³ãã«è¡ã£ã¦ãããFacebookã®äººã¯ "HBase Tokyo meetup" ã¨èªèãã¦ããããã ã
å
容ã®ã¾ã¨ãã¯ãããªãã®ã§ã以ä¸ã®åãã¼ã¸ãªã©ãã覧ã«ãªãã¨è¯ãã®ã§ã¯ãªãã§ããããã
Tokyo HBase Meetup - Realtime Big Data at Facebook with Hadoop and HB…
Hbase at FaceBookのまとめ - Togetterまとめ
FacebookがHBaseを大規模リアルタイム処理に利用している理由(前編) - Publickey
FacebookがHBaseを大規模リアルタイム処理に利用している理由(後編) - Publickey
ã»ãã·ã§ã³ã®å 容ã¨èªåãèãããã¨ã¨äººã¨ããã¹ã£ããã¨ããã£ããããã«ããã«æ¸ãã¦ãããåããã®ãããã©ãã
ãã¼ã¿éã¨æ¸ãè¾¼ã¿ã¹ã«ã¼ãããã®åé¡
- ãã¼ã¿éçã«HDFSã«æ¸ããã
- 1ãã¡ã¤ã«ã«è¤æ°ã®å ´æããæ¸ããããã¡ã¤ã«ãåããã®ãå²ã¨é¢åã§ããã£ã¬ã¯ããªåããã¨MRæã«ããã©ããã
- ã²ã¨ã¤ã®æ¸ãè¾¼ã¿å¯¾è±¡ãªã½ã¼ã¹ã¨ãã¦æ±ãã¦ãé«ã¹ã«ã¼ããããåºããã®ã欲ãã
- hbase !
- ãã ããããã¾ã§ã®æ¸ãè¾¼ã¿éã«ãªããã¼ã¿ãæã£ã¦ããã¨ãããã©ãã ããããï¼ ã¯å¾®å¦
å¦çã¢ãã«ã¨å®æé追å¾ã®è©± (ãããã¯Puma欲ãã)
- appendã¨close(rotate)ã¨MR対象
- rotateåä½ãå¦çããMR(ãHive)ã¯ãããªãã«éãå¦çã«ãªã
- ããã§HDFSä¸ã§ã tail ãã¦å¦çã«ããããã PTail
- ãã¼ã¿èªä½ã¯ãã£ã¨å°ããåä½ã§ãã£ã¦ãããã ããããã£ã¦ãã端ããé次å¦çããã°ããããå®ä¸çã§ã®ã¤ãã³ãã«å¯¾ããé 延ã¯å°ãããã
- ããããã®è¨ç®ã¢ãã«ã¯ããæ¢åã®hadoopä¸ã«ã¯ãªãä½ãã ãªãåæ£å¦çã ãã©ãHDFSãã¼ã¹ã®ãã¤ãã©ã¤ã³
- ã©ã®ããã shard/pipeline æ¬æ°ãæ§æãããã¨ããã®ãèªåå¶å¾¡ãªã®ãæåãªã®ããããã¯å®éã«ã¯ã©ãã§åãã¦ããã®ã
- ã¨ããã®ãèãå¿ãã¦ããã¨ããã¾æãåºããâ¦â¦
- ãã¨ãããå¦çã«è¼ã¹ãã¨å¦çãããã®ã³ã¹ãã¯å½ç¶é«ããªã
- ãã許容ã§ããã¬ãã«ã ã¨æããå¤å
- ã·ã£ããã«ã¯hbaseã®ã½ã¼ãã«ãä»»ãã¨ããç解ã§ããã®ããªï¼
- ãããçè¦ç¹ã ã¨èªã¿åºãããªã¢ã¯ç¡ã
- ã¤ã³ã¯ãªã¡ã³ã¿ã«ãªããã¤èªã¾ãã¦ãç ´ç¶»ããªããã¼ã¿ã§ãªãã¨ãã¡
hadoopã®ä¾¡å¤ã¨ãããhdfsã®ä¾¡å¤
- ã¨ããããHDFSã«æ¸ã
- ãã¨ããMRããããããå¯è½æ§ãèãããããã¨ãHDFSã«ã¨ããããæ¸ããã®ã¯ã¢ãªã ããã
- PTailããããªãããã£ããå°ããã®HDFSã«æ¸ãã¦ããã¦ãããããé次å¦çã§ãã¼ã¿ãåãåºãã¦å¤æãã¤ã¤ãã«ãHDFSã«ããããã
- å¿ è¦ãããã°å°ããHDFSããMRã§ã¬ãã¨åãåºãç´ããã¨ãã§ãã
- PTailãããã°ãããª
- PTailãMapã«ãªããHBaseã«å¯¾ããã¯ã¨ãªãReduceã«ãªã
- HBaseã¸ã®æ¸ãè¾¼ã¿ã¾ã§ãã»ã¼å®æé追å¾ã§è¡ãããã¨ããã°ãå®æéã«å¯¾ããéè¨çµæãã¼ã¿ã®é
延ã¯HBaseã«å¯¾ããã¯ã¨ãªåã®ã¿ã«ãªã
- å¤åãããPumaã§è¨ã£ã¦ãã10ç§ã30ç§ã®é 延ã®å¤§é¨å
- PTailçãªä½ãã¯æ¬²ãããªã
- PTailå ¬éï¼ å ¬éï¼ ã¨ããããæ£ä½ãè¦ã¦ã¿ãªãã¨
- ã¾ããªããèªåéã§æ¸ãããæ°ãããããã¼ãã¨ããã®ãæ親ä¼ã§ã®ã話
- HBaseã¸ã®æ¸ãè¾¼ã¿ã¾ã§ãã»ã¼å®æé追å¾ã§è¡ãããã¨ããã°ãå®æéã«å¯¾ããéè¨çµæãã¼ã¿ã®é
延ã¯HBaseã«å¯¾ããã¯ã¨ãªåã®ã¿ã«ãªã
èªåçã¾ã¨ã
ãã°((ãã¡ãã»ã¼ã¸(Titan)ãå種ã¡ããªã¯ã¹(ODS)))ãåå¦çãã¦ããã®çµæãéç´å¦çç¨ã®ãã¼ã¿ã¨ãã¦ã©ããã«æ¸ãè¾¼ãã§ãããã¨ãããã¿ã¼ã³èªä½ã¯æ¢åã®ãã®ã¨å¤ãããªãã¦ã以ä¸ã®ãããªå¯¾å¿ã
- åå¦ç
- éç´å¦çç¨ãã¼ã¿ã¹ãã¢
- Puma以å: Hiveç¨ãã¼ãã«ã¨ãã¦ã®HDFSãã¡ã¤ã«
- Puma以å¾: HTable on HBase
ãã ã以ä¸ã®ãããªéãããããããã«ãããå®ä¸çã§èµ·ããã¤ãã³ã(ãã°æ¸ãè¾¼ã¿)ãéè¨çµæã«åæ ãããã¾ã§ã®æéãå¤§å¹ ã«ç縮ãã¦ããã
- ãã¼ã¿æ¸ãè¾¼ã¿ã«å¯¾ããé次å¦ççãªMap
- PTailãé 次ãã°ã®åå¦çãè¡ããã¨ã§ãMapã«ãããªã¼ãã¼ããããå®æéã¸ã®é 延ã«ã«ã¦ã³ãããªãã¦ãããªã£ã
- HBaseã(ãã¼ã¿æ¸ãè¾¼ã¿æã«)åçã«ã½ã¼ãå¦çãè¡ããã¨ã§ãéç´å¦çæã®ä»¥ä¸ã®å¦çã®ãªã¼ãã¼ããããããããªããªã£ã
- éè¨å¯¾è±¡çµãè¾¼ã¿ã®ããã®ã½ã¼ã
- ããã³ã½ã¼ãã®ããã®å ¨ä»¶æ¢ç´¢
- HBaseãå¤ã®ã¢ãããã¯ã»ã¤ã³ã¯ãªã¡ã³ãããµãã¼ããã¦ãã®ã§éè¨å¦çã§ãããã¨ã大å¹
ã«æ¸ããã
- ä¾ãã°Page Viewsã¿ãããªææ¨ã¯ããã¼ã¸ãã¨ã«ã«ã¦ã³ã¿ãã¤ãã£ã¦ã¤ã³ã¯ãªã¡ã³ããã¦ããã°ãããããå¦çã¿ã¤ãã³ã°ãåå¦çå´ã«åãã¦éè¨ãã§ã¼ãºã§ãããªãã¦ãããªã
ããã¦ããã¡ããè¯ããã¨ãããã§ã¯ãªãã¦ã以ä¸ã®ãããªãã¡ãªãããããã
- HBaseç¨ã¯ã©ã¹ã¿ã(ãããã)ã»ã¼å¿
é
- HBaseã¯ãã¼ã¿ã®ã½ã¼ããã¡ã¢ãªä¸ã§è¡ãããããã¡ã¢ãªä¸ã«ãã¼ã¿ãä¿æããã¾ã¾ã«ããã®ã§ããã®ãããã¡ã¢ãªé£ããã
- ãããå¦çç¨ã®MapReduceãèµ°ãããã¯ã©ã¹ã¿ã¨åå± ãããã®ã¯ãã¶ããã®ããããªã¹ã¯é«ã
- ããããããªãã®å°æ°ããªãã¨æ§è½åºãªããã
- ãã¼ãã®å¢æ¸ã«å¯¾ãã¦shardã®åé ç½®ãèªåã§è¡ããã¨ãããã¨ã¯ãshardåé ç½®ã®ãªã¼ãã¼ããããåé¡ã«ãªããªãã¾ã§ã«ãã¼ãæ°ãå¢ãããªãã¨ä½¿ããã®ã«ãªããªããã¨ãããã¨
- PTailç¨ã¯ã©ã¹ã¿ã(ãããã)ã»ã¼å¿
é
- PTailèªä½ãã©ã®ããã«åä½ããã¦ãããããããããªãã®ã§æè¨ã¯ã§ããªãããhadoop MapReduceã¨åãåºç¤ã§ã¯åããªãã¨æã
- PTailã®å ´åã¯MapReduceã¨éã£ã¦ã²ã¨ã¤ã®å¦çããã»ã¹ãé·æéåãç¶ãããã¨ã«ãªãã®ã§ããã®åã®ã¡ã¢ãªã¨CPUã確ä¿ããªãã¨ãããªã
- ããããã£ããã³ã¹ãé«ãã
ãã¦ãã¦ãã¨ããããPTailãæ°é±éã§å ¬éãã¦ããããããã®ã§ããããè¦ã¦ããããªï¼