ã¿ãã§ã.
ã¯ã©ã¦ããµã¼ãã¹ã®æ´»ç¨ãå¢ãã¦ããä¸ã§ä¼æ¥ã®æ±ããã¼ã¿ã¯å¢ãã¦ãã£ã¦,ãã®ãã¼ã¿ãæ´»ç¨ã§ãããã£ã¦ä¼æ¥æ´»åã«ããã¦ããã大äºã ã¨æãã¦ãã¼ã¿åæã«èå³ãæã£ãã®ã§ãã,ãã¼ã¿åæåºç¤ã®åå¼·ä¼ãData Engineering Study #1 ããéå¬ãããã®ã§åå ãã¦ãã¾ãã.ä»åã®è¨äºã§ã¯çºè¡¨ãèãã¦æãããã¨ãã¾ã¨ãã¦ããã¾ã.
ã¤ãã³ãæ¦è¦
æ¬ã¤ãã³ãã§ã¯ãããããæ°ï¼ @yuzutas0 ï¼ã«ã¢ãã¬ã¼ã¿ã¼ãä¾é ¼ããè¤æ°åã«ããã£ã¦ãååãã¼ãã«æ²¿ã£ãå 容ã§ååéã§ãæ´»èºããã¦ããã¨ã³ã¸ãã¢ï¼ç 究è ã«è¬æ¼ããã ãã¾ãã
ã¾ããè¬æ¼å¾ã«ã¯è¦è´è ã®æ¹ãåå ã§ããäºæ¬¡ä¼ä¼å ´(Zoom)ãç¨æãã¦ãã¾ããç»å£è ã¨å ±ã«ãã¼ã¿ã¨ã³ã¸ãã¢ãªã³ã°ã«é¢ããå¦ã³ãæ·±ãã¾ãããã
æ¬ç·¨åç»
å½æ¥ã®ãã¤ã¼ãã¾ã¨ã togetter.com
å 容
åºèª¿è¬æ¼ãData Platform Guide - äºæ¥ãæé·ããããã¼ã¿åºç¤ãä½ãã«ã¯
å人çãªå¤§ãã¡ã³ã§ãã yuzutas0 ãã( id:yuzutas0 )ã®çºè¡¨ã§ãã.åããã¼ã¿åæåºç¤ã«é¢ããããã¨æã£ãã®ã¯ yuzutas0 ããã®ä¸è¨ã®ã¨ã³ããªã¼ãè¦ãã®ããã£ããã ã£ããããã®ã§,楽ãã¿ã§ãã.
çºè¡¨ã¯æ¢ã«çºå£²ããã¦ãã Software Design ã®è¨äºããã¼ã¹ã«ããã¦ãã¾ã.çºè¡¨ã§ã¯ä¼æ¥ã®ä¸ã§ä½¿ããããã¼ã¿åºç¤ãä½ã£ã¦éç¨ãã¦ããããã®è¦ç¹ã¨ã¢ã¯ã·ã§ã³ãèããã¨ãã§ãããªã¨æãã¾ãã.ç¹ã«,å©ç¨è ã®ãã¼ãºã¨å©ç¨ã·ã¼ã³ã«ç®ãåãã¦å©ç¨ãã人ã使ã£ã¦ããããããã«ä¸ç·ã«ãã¼ã«ã試ããã,æä¾ãããã¼ã¿åºç¤ã®ãµã¼ãã¹ã¬ãã«ãåæå½¢æãã¦ããã¨ããã¡ãã»ã¼ã¸ãç´å¾ã§ãã¾ãã.yuzutas0 ããæ°ããã¼ã¿åºç¤ã§ã®åãçµã¿ã¯,å©ç¨è ã®ç®ç·ã«ç«ã£ã¦ãã¯ããã¸ã¼ãçµã¿åããã¦ããç·åæ ¼éæã ã¨ããã®ãä»å¾ãã¼ã¿åæã«é¢ããããã¨æã£ã¦ããåã«ã¨ã£ã¦è¸ã«å»ãã§ããããã§ã.
çºè¡¨ã¨ã¯å¥ã§ãã,yuzutas0 ãããå·çããããã¼ã¿ããã¸ã¡ã³ãæ¬ã¯ Kindle Unlimited ã«å å ¥ãã¦ããã°ç¡æã§èªããã®ã§ãã,ãã¼ã¿ããã¸ã¡ã³ãã«é¢ããç¥èã¨ã¢ã¯ã·ã§ã³ããµã¯ãã¨åå¼·ã§ãããªã¨æããã®ã§èå³ããæ¹ã¯æã«ã¨ã£ã¦ã¿ã¦ã¯ã©ãã§ããããï¼ãªã¹ã¹ã¡ã®ä¸åã§ã!
çºè¡¨è³æ
äºä¾ç´¹ä»1ãZOZOTOWNã®äºæ¥ãæ¯ããBigQueryã®è©±ã
ç¶ãã¦,ZOZO ãã¯ããã¸ã¼ã®å¡©å´ããã®çºè¡¨ã§ããã,çºè¡¨åé ããããã³ãã¹ã¯ãåºã¦ãã¦ãã¡ããã¡ãç¬ãã¾ããw ãã¯ãã§ããã®äººããããã¦ããããã§ãð
çºè¡¨ã¯ãã¼ã¿ã®åºç¤ã¨ãã¦æå㯠Redshift ã使ã£ã¦ãããã© BigQuery ã«ç½®ãæãã¦ããã®éç¨ã®ã話ã¨ç¤¾å ã§ã® BI ãã¼ã«ã®å©ç¨ä¾ãèãã¾ãã.BigQuery ã®éç¨è©±ã§é¢ç½ãã£ãã®ãã¯ã¨ãªãæãããã¦ãéãããã£ã¦ããæã«è¬ç¿ãããã,ã»ãã¥ãªãã£é¢ã§ã®å¯¾ç㯠GCP ãªãã§ã¯ã®å¯¾å¿ãèå³æ·±ãã£ãã§ã.yuzutas0 ããã®çºè¡¨ã§ãããã¾ããã,å©ç¨è ã«ãã£ã¦ BI ãã¼ã«ãæ§ã ã ã£ã¦ããã話ã®éã㧠ZOZO ããã®ä¸ã§ã PowerBI,Looker,Redash,Google Spreadsheet ã¨ãã£ãå種ã®ãã¼ã«ãå©ç¨è ã«ãã£ã¦ç°ãªã£ã¦ãã¦ããããã§ã.ä¸ã§ã Looker ã¯ããååãèãã¦ããã®ã§ãã,LookML ã LookML ã®ã¯ã¨ãªã GitHub ã§ç®¡çãã¦ãããããªãã¯ã¨ãªãè¦ã¤ãããã¨ãã£ãã¬ããã³ã¹ãå¹ããããã«ä½¿ã£ã¦ããã¨ããã®ã BI ãã¼ã«ã®å©ç¨ç®çã§èãã§åãã¦èãã¦é©ãã§ãã.
Looker é¢é£è¨äº techblog.zozo.com
ã¾ã,ZOZO ããã§ã¯ AI ã®æ´»ç¨ãå¢ãã¦ãã¦ãã鮮度ã®è¯ããã¼ã¿ãæä¾ãã¦æ¬²ããã¨ããã¨ãããããªã¢ã«ã¿ã¤ã ç³»ã®ãã¼ã¿åºç¤ãä»æããä¸ã¨ã®ãã¨ã§ã.ãã¡ããæ°ã«ãªãã¾ã.
çºè¡¨è³æ
é¢é£è³æã¨è¨äº
äºä¾ç´¹ä»2ãfreeeã®ãã¼ã¿åºç¤ã«ãããDWH/BIã®éç¨äºä¾ç´¹ä»ã
æå¾ã«,freee ã®ä¸å±±ããã®çºè¡¨ã§ DWH ã¨ã㦠Redshift,BI ã¨ã㦠Redash ã使ã£ã¦ããã話ã§ZOZO ãã¯ããã¸ã¼ããã®äºä¾ã¨ã¯ã¾ãéã£ãé¢ç½ããããã¾ãã.å人çã«çºè¡¨ã§æ±ããã¾ããã§ããã,å²æãããä¸é¨ã«ä½¿ããã¦ãã GCP ã LakeFormation ã使ã£ã¦ IAM ãã¼ã«ãã¦ã¼ã¶ã¼ãã¼ã¹ã§ã«ã©ã ã¬ãã«ã®ã¢ã¯ã»ã¹å¶å¾¡ãã¦ããã話ã¯æ°ã«ãªãã¾ãã.
Redshift ã®éç¨ã§ã¯ãã¼ã¿ã¯ãã¹ã¯å¦çããã,ã«ã©ã é¸å¥ãã¦ããã,æã«å©ç¨è ãå ¥ãã¦ãã¾ã£ãå人æ å ±ãã¼ã¿ãé©åãªå½¢ã«ã¯ã¬ã³ã¸ã³ã°ãã¦ããããã§ã.ã³ã¹ãé¢,éè¨å¦çã®åãããã,S3 ã¨ã®é£æºãããããããä¸æ¹,ãã£ãã·ãã£ãã©ã³ãã³ã°(æ°ã¥ããããã£ã¹ã¯å®¹éã 100% ã«ãªã£ã¦ãããããããã§ã...)ããã¼ãã«ã®ãã¥ã¼ãã³ã°ãå¿ è¦ã§è¦å´ããå ´é¢ãããããã§ã.Redash ã®éç¨ã«ãã㦠EC2 on Docker ã§åãã¦ã㦠Mackerel ã§ç£è¦ãã¦ããããã§ã.çµç¹çã«ç´ æµã ãªã¨æã£ããã¨ã freee ããã§ã¯ã©ã®é¨éã®äººã§ã SQL ãããããã¨ãæ±ãããã¦ããããã ã¨æãã®ã§ãã,å¤ãªã¯ã¨ãªãå°ãªãéç¨ã®è² æ 軽æ¸ã«ç¹ãã£ã¦ããã¨ã®ãã¨ã§ã.使ã£ã¦ããããããã«ãªãããã«ã¯å©ç¨è ã®ã¯ã¨ãªãµãã¼ããå¿ è¦ã ãªã¨æãã¾ãã.
ä»å¾ã®èª²é¡ã« Redshift ã® ra3 ã¤ã³ã¹ã¿ã³ã¹ã¿ã¤ãã試ããã,ãã¼ã¿ã«ã¿ãã°ã®æ´åã ETL å¨ãã®ã¬ã¬ã·ã¼ãªé¨åããªãã¡ã¯ã¿ãªã³ã°ãã¦ããããã¨ã®ãã¨ã§ãã.Redshift ã®æ°ããã¤ã³ã¹ã¿ã³ã¹ã¿ã¤ãã¯ç´è¿åºãã°ãããªãã§ãã.ãã£ããã¢ããã§ãã¦ã¾ããã§ãã.
çºè¡¨è³æ
ã¾ã¨ã
ãData Engineering Study #1 ãã§èããçºè¡¨ãã¨ã«ææãã¾ã¨ãã¦ããã¾ãã.ãã¼ã¿åºç¤ã«ç¹åããåå¼·ä¼ã£ã¦ãããªãã£ãã®ã§çµç¹å ã§ã®ãã¼ã¿ã®æ´»ç¨ã®é²ãæ¹ã¨æ´»ç¨äºä¾ãèãã¦ã¨ã¦ãåå¼·ã«ãªãã¾ãã.æ¢ã«ç¬¬2åãã¤ãã³ããå ¬éããã¦ããã®ã§èå³ããæ¹ã¯ãã²åå ãæ¤è¨ãã¦ã¿ã¦ã¯ãããã§ããããï¼ð