é©åãªã¯ã©ã¹ã¿æ°ãæ¨å®ããX-meansæ³
K-meansæ³ã«ããã¯ã©ã¹ã¿ãªã³ã°ã§ã¯ããããããã¯ã©ã¹ã¿æ°Kãåºå®ããå¿ è¦ãããã¾ããHatenarMapsã§ãK-meansæ³ã使ã£ã¦ããã®ã§ãããã¯ã©ã¹ã¿æ°ã¯ï¼ç¹ã«æ ¹æ ããªãï¼200åã«æ±ºãæã¡ã«ãªã£ã¦ãã¾ããã
ããã«å¯¾ãã¦ãX-meansæ³ã¨ããK-meansæ³ã®æ¡å¼µãææ¡ããã¦ãããã¨ãç¥ãã¾ãããX-meansæ³ã使ãã¨ããã¼ã¿ã«å¿ãã¦æé©ãªã¯ã©ã¹ã¿æ°ãæ¨å®ã§ãã¾ãã
K-means and X-means implementations
http://www-2.cs.cmu.edu/~dpelleg/download/xmeans.pdf
X-meansæ³ã®èãæ¹ã¯ãK=2ã§å帰çã«K-meansæ³ãå®è¡ãã¦ããã¨ãããã®ã§ãã¯ã©ã¹ã¿ã®åå²åã¨åå²å¾ã§BICï¼ãã¤ãºæ å ±éè¦æºï¼ãæ¯è¼ããå¤ãæ¹åããªããªãã¾ã§åå²ãç¶ãã¾ãã
調ã¹ãã¨ãããJavaã®ãã¼ã¿ãã¤ãã³ã°ãã¼ã«ã®Wekaã®ä¸ã«ãX-meansã®ã³ã¼ããå«ã¾ãã¦ãã¾ãããã ãBICã®ç®åºæ³ãè«æã¨ã¯å¾®å¦ã«éã£ã¦ãã¾ãã
Wekaãåèã«ããªããX-meansæ³ãå®è£ ãã¦ã以åæ¸ããK-means++ã¨çµã¿åããã¦HatenarMapsã«æ¡ç¨ãã¦ã¿ã¾ããã
ä»ã®ã¨ããHatenarMapsã§ã¯ãã¯ã¦ãªãã¤ã¢ãªã¼ã¦ã¼ã¶ã®ããã¯ãã¼ã¯æ°ä¸ä½ç´3000人ãã¯ã©ã¹ã¿ãªã³ã°ãã¦ããã®ã§ãããX-meansæ³ã§æ¨å®ãããã¯ã©ã¹ã¿æ°ã¯100åå¾ã«ãªãã¾ããã
以ä¸ã¯K-meansæ³ã§ã¯ã©ã¹ã¿æ°ãåºå®ãã¦çæãã¦ãããå¾æ¥ã®HatenarMapsã§ããã¯ã©ã¹ã¿æ°ãæé©åããã¦ããªãã£ãããããK-meansæ³ã®å¾ã«å®è¡ããé層ã¯ã©ã¹ã¿ãªã³ã°çµæã®ããªã¼ã®å¹³è¡¡ãæªãã¦ã赤ã§å²ãã ã¨ãªã¢ãå¨ãã«æ¯ã¹ã¦æ¸¦å·»ãç¶ã«æ·±ããªã£ã¦ãã¾ã£ã¦ãã¾ãã
以ä¸ã¯ãX-meansæ³ã«å¤ããçµæã§ããä¸ã«æ¯ã¹ã¦ãä½ã¨ãªããã©ã³ã¹ãè¯ããªã£ã¦ããæ°ããã¾ãã