å²ã¨ã©ãã®ç¾å ´ã§ããã£ã¦ããåæã¨ãã¦ãCVRåä¸ã«æãè²¢ç®ããè¡åãã¿ã¼ã³ãã®åå®ã»æ½åºã¨ããã®ãããã¨æããã§ãããããã£ã¦ç°¡åãªããã§æå¤ã¨é£ãããã¤ã³ããã´ãã´ããã¦ããã§ãããã
ä¾ãã°ã¦ã¼ã¶ã¼è¡åãã°DBããã½ã·ã£ã²ã®ã¤ãã³ãA, B, C...ããã£ãããããªããããã©ã°ã¨ãããã¼ãã«ãæ½åºããCVã®ã©ãã«ã¨ãã¦ã1é±é以å ã«èª²éããorããªãã*1ã¿ãããªã®ãä¸ãã¦ã
UserID | Event A | Event B | Event C | Event D | ... | CV |
---|---|---|---|---|---|---|
1001 | 1 | 0 | 1 | 1 | ... | Yes |
1002 | 1 | 1 | 1 | 0 | ... | Yes |
... | ... | ... | ... | ... | ... | ... |
10X4 | 0 | 1 | 0 | 0 | ... | No |
10X5 | 0 | 0 | 0 | 1 | ... | No |
... | ... | ... | ... | ... | ... | ... |
ã¨ãããããªçãã¼ã¿*2ãå¾ã¦ãããã®ã¨ããã§ã¯ä»®å®ãã¾ãããããããCVRåä¸ã«æãè²¢ç®ããè¡åãã¿ã¼ã³ãçªãæ¢ããï¼ã¨ãã課é¡ãèãã¦ã¿ã¾ãããã
ã¢ã¯ã·ã§ã³çã§æ¯è¼ããoræ©æ¢°å¦ç¿ã§æ¯è¼ãã
ã©ããã£ã¦ãããã¨æããã§ãããåããã£ããã¨ã®ãããã¿ã¼ã³ã2ã¤ãç´¹ä»ã
ã¢ã¯ã·ã§ã³ç
ãã¡ãã¯å²ã¨ç°¡åã§ãæ®éã«Excelã§ãããã¾ãããDBä¸ã§ãã¯ã¨ãªããã¡ãã¨æ¸ãã°ããåºãã¾ã*3ãããããã®ã¤ãã³ããã¨ã®ãã©ã°æ°ã®ç·åãè¨ç®ããä¸ã§ãCV = Yesã®ç¾¤ã¨CV = Noã®ç¾¤ã«åãã¦ãããããã®ç¾¤ã®UUæ°ã§å²ã£ããã®ãã¢ã¯ã·ã§ã³çããã®ã¢ã¯ã·ã§ã³çå士ã®å·®ãä¾ãã°ãã¢ã¯ã·ã§ã³ç{Yes} - ã¢ã¯ã·ã§ã³ç{No}ãã®ããã«ãã¦æ±ãã¦ããã©ã¹ãªãCVRå¢å ã«ããã¤ãã¹ãªãCVRä½ä¸ã«å¯ä¸ããã¨å¤å®ãããã¨ãããã®ã§ãã
ä¾ãã°ã§ãããä¸ã®ä¾ã§è¨ãã°
Event A | Event B | Event C | Event D | ... | |
---|---|---|---|---|---|
Yes | 60% | 30% | 45% | 20% | ... |
No | 20% | 70% | 55% | 50% | ... |
å·®å | 40% | -40% | -10% | -30% | ... |
ã¿ãããªçµæãå¾ããã¾ããããã§ä¸çªä¸ã®è¡ã®ãã¢ã¯ã·ã§ã³çã®å·®åãããã©ã¹ããã¤ãã¹ãï¼ããã¦ãã®ããªã¥ã¼ã ãã©ãããããï¼ã§ãããããã®ã¤ãã³ããCVRåä¸ã«è²¢ç®ãããã©ããã測ãã*4ãã¨ããããã§ãã
æ©æ¢°å¦ç¿
ããæ¹æ¬¡ç¬¬ã§ãããRã¨ãã使ãã°ãã®ããã¼ãç°¡åã§ããã¨ãããããã¸ã¹ãã£ãã¯å帰ã¨ã*5ã§è¯ãã¨æãã¾ãï¼ããæ¹ã¯以前の記事1 or 以前の記事2ããããåç §ï¼ãããã§ã¯exp(coefficients)ãã¦åã ã®å¤æ°äºæ¸¬éè¦åº¦ãåºããå ã®coefficientsã®ã符å·ãããCVRå¢å ï¼æ£ï¼oræ¸å°ï¼è² ï¼ã«å¯ä¸ããã¨ã¿ãªããã¨ã«ãã¾ãããã
ããã¨ãä¾ãã°ã§ãã
Event A | Event B | Event C | Event D | ... | |
---|---|---|---|---|---|
å¤æ°äºæ¸¬éè¦åº¦ | 25.0 | -15.0 | 10.0 | -7.5 | ... |
ã¿ãããªçµæãå¾ããã¾ããããã®å¤ã大ãããã°ããã ãCVRå¢å ã«å¯ä¸ããããå°ãããã°ï¼ããã¦è² ã§ããã°ï¼CVRæ¸å°ã«å¯ä¸ãã¦ãã¾ããã¨ãããã¨ãè¨ããããã§ãã
ããï¼ã¢ã¯ã·ã§ã³çã«ããçµæã¨æ©æ¢°å¦ç¿ã«ããçµæã¨ãé£ãéãï¼
ã¨ããã§å
ã»ã©ã®ä¾ãè¦ã¦ãå¤ã ã¨æãã¾ããã§ãããï¼ãåãªãä»®æ³ä¸ã®ãã¼ã¿ã§ã¯ããã¾ããã
Event A | Event B | Event C | Event D | ... | |
---|---|---|---|---|---|
ã¢ã¯ã·ã§ã³çå·®å | 40% | -40% | -10% | -30% | ... |
å¤æ°äºæ¸¬éè¦åº¦ | 25.0 | -15.0 | 10.0 | -7.5 | ... |
Event Cã®è©ä¾¡ãéã«ãªã£ã¦ã¾ããããããªãã¨ããå¾ãã®ãï¼ï¼ï¼ã¨æã人ãå¤ãã§ãããããå®éã«ã¢ã¯ã·ã§ã³çå·®åã¨æ©æ¢°å¦ç¿ã§åºããå¤æ°äºæ¸¬éè¦åº¦ã¨ãé£ãéãã¨ãããã¨ã¯ããå¾ããã§ãã
å®ã¯ãçµã¿åããããåé¡
ä½ã§ãããªçç¾ãçããã®ãã¨ããã¨ããã®æã®è¡åãã¿ã¼ã³ã®é¸æè¢ãå¤ãç¶æ³ä¸ã§ã®åé¡ã§ã¯å¾ã
ã«ãã¦ãçµã¿åããããCVãåããè¦å ã«ãªã£ã¦ããããã§ãã
ä¸è¨ã®ä¾ã ã¨ãä¾ãã°"Event C"ã¯"Event B"ã¨ã¨ãã«ãã¬ã¤ããã¦ã¼ã¶ã¼ã®å ´åã¯CVRãä½ããªãã¨ãããã£ã¨è¤éã«è¨ãã°"Event B"ã¯ï¼ä¸ã®ä¾ã§ã¯çç¥ããï¼"Event E", "Event H"ã¨ã¨ãã«ãã¬ã¤ããã¦ã¼ã¶ã¼ã®å ´åã¯ãã®ãããCVRãä¸ããã¨ãããªã®ã«ã"Event B, E, H"ã¨ã¨ãã«ãã¬ã¤ãã¦ããªãã¦ã¼ã¶ã¼ã®å ´åã§ã¯CVRããã®ãããé«ããã¨ãããã
ããã§å ¨ä½æ°ã¨ãã¦ãæªãæ¹ã®çµã¿åãããããè¯ãæ¹ã®çµã¿åããããä¸åã£ã¦ããå ´åã«ãä¸æ¹ã§ãè¯ãæ¹ã®çµã¿åãããã®æ¹ãCVRã§å¤§ããåªãã¦ãããããã¨ãã¢ã¯ã·ã§ã³çã®å·®åã§ã¯ãã¤ãã¹ãªã®ã«æ©æ¢°å¦ç¿ã§åºããå¤æ°äºæ¸¬éè¦åº¦ã§ã¯ãã©ã¹ã«ãªãã¨ãããã¨ãããå¾ããã¨ããããã§ãã
ããã¯ãã®ãããããããã話ã§ãããå¤æ°ï¼ããã§ã¯åæ対象ã®ã¤ãã³ãï¼ãå¢ããã°å¢ããã»ã©èµ·ããå¾ãé¢åãªåé¡ã§ãã
ãçµã¿åãããã調ã¹ãã«ã¯ï¼
ã¨ãããããã®ãçµã¿åãããã調ã¹ããã¨ããã¨çµæ§å´åãè¦ãã¾ããããã ã5å¤æ°ããããªãå
¨ãã¿ã¼ã³èª¿ã¹ã¦ãä½ã¨ããªãã§ãããããããã50å¤æ°ã¨ããã£ãããæä¸ã*6ã§ããããããã
èãæ¹ã¯è²ã ããã¾ãããåã®å ´åã¯ã¢ã½ã·ã¨ã¼ã·ã§ã³åæï¼ãã¹ã±ããåæãassociation rulesï¼ãç¨ãããã¨ã«æ±ºãã¦ãã¾ããããæ¹ã¨ãã¦ã¯ãé常ã®ã¢ã½ã·ã¨ã¼ã·ã§ã³åæã®ãã©ã³ã¶ã¯ã·ã§ã³ãã¼ã¿ã«ãããã«CVãèå¥ããããã¼å¤æ°ã足ãã¦ããã¨ããæãã§ãããã®ä¸ã§âãããªæãã®
以前の記事ã§ãã£ããããªã°ã©ãæ§é ã¸ã®å¯è¦åãè¡ã£ãä¸ã§ãå ·ä½çã«ãCV = Yesã®ãã¼ããã¨ãCV = Noã®ãã¼ããã¨ãä¸è¨ã®ãããªããããããã¼ãï¼ä»åã®ä¾ã§ã¯"Event C"ï¼ã¨ãã©ãããé¢ä¿æ§ã«ãããã調ã¹ãã¨ãããã®ã§ãã
ããããå ã¯åãã¾ã 模索ä¸ãªã®ã§ä½ã¨ãè¨ããªããã§ããããã¿ã«Fruchterman-Reingoldã¢ã«ã´ãªãºã ã§æç»ããã®ãè¯ãããªã¼ãã¨æã£ã¦ã¾ããä½ã¨ãªãã
ãããã«
ãã¡ããããã§åãæ示ããæ¹æ³è«ãå
¨ã¦ã¨ããããã§ã¯ãªãã§ãããçµã¿åãããã®åé¡ã解決ããã«ã¯ãã£ã¨è¯ãæ¹æ³ããã£ã¨ããããããªããã¨æãã¾ãã®ã§ãæ°åã®è¯ãæ¹ã¯ãã²åã«æãã¦ä¸ããï¼ç¬ï¼ã
å¾ã¯ããã£ã±ãåç´ã«ãçµã¿åãããã表ç¾ããããã®ä½ãè¯ãæ°çã¢ãã«ãªãããªã¼ãã¨ããã¨ããã§ããããç¾ç¶ã§ã¯ã°ã©ãçè«ã¨ãã°ã©ãã£ã«ã«ã¢ãã«ã¨ããè¯ãããªãã§ããããã£ã¨åå¼·ããªãããããªããã§ããããªãããã
*1:ã½ã¼ã·ã£ã«ãªããç¿é±ãæ¥ãï¼å®çï¼oræ¥ãªãï¼é¢è±ï¼ãã§ãããããåºåãªããè³æè«æ±ããorããªããã¨ã
*2:ç´ æ§ï¼ãããï¼ãã¯ãã«ï¼åé¡ã©ãã«ãã¨ãããã¿ã¼ã³ã§ãã
*3:Hiveã ã¨å¤å°ãã¯ãã«ã«ã§ããããã¯ææ ¢
*4:è£ãè¿ãã¨ãã¤ãã¹ã®ãã®ã¯ãé¿ããã¹ããã¨ãããã¨ã«ãªã
*5:å¥ã«SVMã§ãã©ã³ãã ãã©ã¬ã¹ãã§ãè¯ããã©ããåãããåããå¿ è¦ããã
*6:試ãã«ãçµã¿åããã®æ°ãã®è¨ç®ã§ç®åºãã¦ã¿ãã¨ããã