å æ¥æ¸ãã機械学習における距離学習ã®ç¶ãã
kNN (k-nearest neighbour: k 近傍法)㯠Wikipedia ã®ã¨ã³ããªã«ãæ¸ãã¦ããéããæ師ããå¦ç¿ã®ä¸ã¤ã§ãããã¤ã³ã¹ã¿ã³ã¹ã®ã©ãã«ãå¨è¾º k åã®ã©ãã«ããæ¨å®ããææ³ãmemory-based learning ã¨å¼ã°ãããã¨ããããåç´ã«å¤æ°æ±ºãåãå ´åãããã°(åç¹ã解決ããå¿ è¦ãããã)ãè¿ãã¤ã³ã¹ã¿ã³ã¹ã®éã¿ã大ããããå ´åãããã®ã ãããããã«ããããªãå®è£ ã¯åç´ãªã®ã§ãä»ã®æ©æ¢°å¦ç¿ã¨ã®æ¯è¼(ãã¼ã¹ã©ã¤ã³)ã¨ãã¦ä½¿ããããã¨ãå¤ãã
ç°¡åãªã¢ã«ã´ãªãºã ã§ã¯ãããã1-NN ã®å ´åãã®ã¢ã«ã´ãªãºã ã®èª¤ãçã¯ãã¤ãºèª¤ãç(éæå¯è½ãªæå°èª¤ãç)ã®2å以ä¸ã¨ãªããã¨ã示ãããããçè«çã«ããããªãã«ã¯ãªã¢ã«ãªã£ã¦ãã¦ããã®ã§ã¯ãªããã¨æããã¾ããå¤ã¯ã©ã¹åé¡ãã¡ãã£ã¨ä¸æé㪠SVM (pairwise ã«ããã one-vs-rest ã«ããããããã[Crammer and Singer 1999] ãªã©ã®ææ³ãç¨ããã)ã¨éã£ã¦ãè¿ãã¨ããåã£ã¦ããã°ããã®ã§ãå¤ã¯ã©ã¹ã§ãå ¨ãåé¡ãªããã¨ããå©ç¹ãããã
ãã çµå± kNN ã¯ã¤ã³ã¹ã¿ã³ã¹å士ã®è·é¢ãã©ãé¸ã¶ãã¨ããã®ãä¸çªã®åé¡ã§ãç¹ã«ç´ æ§ãä½æ¬¡ã®å ´åã¯ããã®ã ããèªç¶è¨èªå¦çãªã©å ¸åçã«ã¯é«æ¬¡å ã¹ãã¼ã¹(ç´ æ§ã®æ°ãå¤ãããã»ã¨ãã©ã®ç´ æ§ã®å¤ã¯ã¼ã)ã§ãããããªåéã§ã¯ãã次å ã®åªããã®ããã«ããæ§è½ãåºãªããã¨ãå¤ãã(ãããåé¿ããããã«ãã¾ãç´ æ§ã PCA ãªã SVD ãªãã§æ¬¡å å§ç¸®ãã¦ããã¦ããã®ä¸ã§ kNN ããã¨ã¹ãã¼ã¹ãã¹ãåé¿ããã¦ããæ§è½ã«ãªããã¨ãç¥ããã¦ãã)ãã¾ããè¿ãã¤ã³ã¹ã¿ã³ã¹ãæ¢ããªãã¨ãããªãã®ã§ãå¦ç¿ã®æéã¯é¢ä¿ãªãããé«éãªè·é¢è¨ç®ãã§ããªãã¨åé¡æã«æéãããã£ã¦ãã¾ãã¨ããåé¡ç¹ããã(ãããããç¨ãã pruning ã Locality Sensitive Hashing ã使ãã¨ãããããæã¯ããã®ã ã)ã
è·é¢ã«ã¯ã¦ã¼ã¯ãªããè·é¢ãç¨ãããããã¨ãå¤ãã®ã ããããã¯ããããªè·é¢é¢æ°ããããã¦ãããã©ããã¹è·é¢ãç¨ããããããä»ã®å°ºåº¦ãç¨ããããããããã¨ããããããã§ãæ師ããå¦ç¿ã§(kNN ã®ããã®)ããã©ããã¹è·é¢ãå¦ç¿ããææ³ãæè¿åºã¦ãããç¹ã« kNN ã®è¨å®ã§ã¯ãç·å½¢å¤æããã ãã§ãããªãæ§è½ãä¸ãããã¨ã確èªããã¦ããã®ã§ããããããªææ³ãææ¡ããã¦ããã
ãã®ãã¡ Large Margin Nearest Neighbour (LMNN) (Weinberger et al. NIPS-2005; http://www.weinbergerweb.net/Downloads/LMNN.htmlコードもダウンロードできる) ã¨å¼ã°ããææ³ã¯ãæ大ãã¼ã¸ã³(large margne ã ããæ大ãããªãã¨æããä»ã«ããè¨èããªãã®ã§â¦â¦)åã«åºã¥ã㦠kNN ã®è·é¢ãå¦ç¿ããææ³ã§ãããSemidefinite programming ã¨ããå¶ç´ä»ãæé©ååé¡ã«è½ã¨ãè¾¼ããã¨ã§å¹ççã«ãã®åé¡ã解ããã¨ã«æåãã¦ããã
åãã©ãã«ãæã¤è¿åã®ã¤ã³ã¹ã¿ã³ã¹ã¯è¿ãã«ãéã«è¿ãã«ãã£ã¦éãã©ãã«ãæã¤ã¤ã³ã¹ã¿ã³ã¹ã¯é ãã«ãªãããã«è·é¢ãæ´æ°ããã®ããã½ã§ããã(è¿åã®å®ç¾©ã¨ k ãããã¤ã«ããã®ã㯠cross validation ããããã¦æ±ºãã)ãç¹ã«ãã°ã©ãã«åºã¥ãè·é¢å¦ç¿ãªã©ã§ã¯ã°ãã¼ãã«ã«ã©ãã«ã®å¶ç´ãæºããããã«å¦ç¿ããã®ã ãããã®ææ³ã§ã¯ãã¼ã«ã«ã«ã©ãã«ã®å¶ç´ãæºããã°ãããkNN ã§ã¯è¿ãã®ã¤ã³ã¹ã¿ã³ã¹ãæ£ããã©ãã«ã¤ããããã°(ã°ãã¼ãã«ã¯ã©ãã§ã)ããã®ã§ãã¡ããã© SVM ã§åé¢å¹³é¢è¿ãã®(ãµãã¼ããã¯ãã«ã¨ãªããããª)ã¤ã³ã¹ã¿ã³ã¹ã«ã ãéã¿ãã¤ãã®ã¨ããä¼¼ã¦ãããå®éãä»ã«ããããã SVM ã¨ä¼¼ã¦ããã¨ããããããè«ççã«ã¯ kNN çã® SVM ã§ããã¨è«æã«ãæ¸ããã¦ããã(æ§è½ã SVM ã¨åç¨åº¦ãSVM ãããããã¨ã)
ããããã¯ãåé¡ç¹ã¯ Semidefinite programming ã®é¨åã§ãããã®è¨ç®ã(æé©è§£ãå¹ççã«æ±ã¾ãã¨ã¯ãã)ããªãããã¼ãªããã§ããã¾ãã¾ãªå¹çåã®å·¥å¤«ããã¦ããããã ããããããæ°ä¸äºä¾(ç´ æ§ã®æ¬¡å 㯠PCA ã§å§ç¸®ãã¦æ°ç¾æ¬¡å ç¨åº¦ã«ãã)ãã使ããªããã¨ããã®ãåé¡ç¹ããªâ¦â¦ãSVM も最近は(線形分類に限定したりすると)速いã¨ãã話ãããã®ã§ãå®ç¨ä¸ã¯ãã¾ã使ããªãæ°ãããã
ã¡ãªã¿ã«é«é㪠SVM ã¨ãã¦ã¯ Core Vector Machine (ツール名は LibCVM) (ãããã¯ãã®çºå±ç³»ã® Ball Vector MachineãO 野原くんの日記にも2007年に言及があるã)ããããããã¯è²ããªã«ã¼ãã«ããµãã¼ããã¦ãã¦ããªããã¤å¦ç¿ãè¨ç·´ãã¼ã¿ã®ãµã¤ãºã«ä¾åããªã(!)ã¨ããå©ç¹ããããå®éè©ä¾¡ãè¦ã¦ã¿ãã¨å¦ç¿æéã¯ã»ã¨ãã©æ¨ªã°ãã§ãã»ã¼ä¸ç¬ã§è¨ç·´çµããããã§ãããã SVM ã¨åããããã®ç²¾åº¦ãªã®ã§ããããªã®ããã§ãããªããã¨æã£ãããããã
åæ師ããå¦ç¿ã§ã¯ Laplacian SVM (LapSVM) ã¨ããã®ãç¾å¨ã® state-of-the-art ã®ããã ããããã¯ã©ãã«ããã¤ã³ã¹ã¿ã³ã¹ã¨ã©ãã«ãªãã¤ã³ã¹ã¿ã³ã¹åããã¦æ°åäºä¾ãããããæ±ããªãããã§(ãåæ師ãããã¨è¬³ã£ã¦ããææ³ã§ããããããè«æãèªãã¨å ¨ç¶ã¹ã±ã¼ã«ããªãææ³ãã»ã¨ãã©ã§ããã®ãããã©ãã«ããäºä¾ãå°ãªãç¶æ³ã§å¹æãããããã©ãã«ãªãäºä¾ãå°ãããç¾å®çã«ã¯è¶³ããªããã¨ãå¤ãã®ã§ã使ããã¨èãã¦ãã人ã¯ã¹ã±ã¼ã©ããªãã£ã«æ³¨æããã»ãããã)ãããã®åæ師ããçã® Sparcified Laplacian CVM ã¨ããã®ã試ãã¦ã¿ãã(è«æ㯠NIPS-2006)ã®ã ããã¾ã ã³ã¼ãã«å ¥ã£ã¦ããªããããããã¨ãå¤ã¯ã©ã¹çãè«æã«ã¯æ¸ãã¦ããã®ã ããå®è£ ã¯ã¾ã ãªãªã¼ã¹ããã¦ããªãã¨ã®ãã¨ã(Windows çããæ£å¼ã«ãµãã¼ãããã¦ããªããããªã®ã§ FreeBSD ã§ã³ã³ãã¤ã«ã§ããããã«ä¿®æ£ãã¦ä½¿ã£ã¦ã¿ããã確ãã«å¦ç¿ã¯å¤§è¦æ¨¡ãã¼ã¿ã§ãç°æ§ã«æ©ãã¦ã³ã£ããããã¡ã¤ã«ãèªã¿æ¸ããã¦ããæéã®ã»ãããªããããããªããããâ¦â¦)
ã¨ããããLMNN ã«ã¤ãã¦è©³ããç¥ããã人ã¯彼の発表論文リスト(âåè«æããããã«è«æã®ä¸ããåã£ãç»åãè²¼ããã¦ãã¦ãããããç´¹ä»ã®ä»æ¹ãããã®ãï¼ãã¨ã³ã£ããããã®ã§ãä¸åº¦ã¯ãªãã¯ããããã¨ããè¦ããã)ãã辿ãã
- Weinberger and Saul. Distance Metric Learning for Large Margin Nearest Neighbor Classification (JMLR 09)
ãèªãã¨ãããããã¯å½¼ã® NIPS-2005 㨠ICML-2008 ã®è«æãã¾ã¨ããè«æãªã®ã§ãçãã»ããèªã¿ããã人ã¯åã ã®è«æãèªãã§ããããã¨ããã
ãã¦ãä¸çªæ¸ãããã£ãã®ã¯ããããæ¸ãã¦ãã Weinberger ã¯çµå± Upenn ãåæ¥ãããã¨ãä»ã¯ Yahoo! Research で働いているãããâ¦â¦ããã㦠Semi-supervised Learning æ¬

Semi-Supervised Learning (Adaptive Computation and Machine Learning series)
- ä½è : Olivier Chapelle,Bernhard Schoelkopf,Alexander Zien
- åºç社/ã¡ã¼ã«ã¼: The MIT Press
- çºå£²æ¥: 2006/09/22
- ã¡ãã£ã¢: ãã¼ãã«ãã¼
- ã¯ãªãã¯: 5å
- ãã®ååãå«ãããã° (3件) ãè¦ã
ãæ¸ãã Olivier Chapelle も現在は Yahoo! Research ã§åãã¦ããããã ããã¼ããã¾ããããã ãã
æ©æ¢°å¦ç¿ãã¿ã¯ããããæ¸ããããã¨ã¯ãã(=èªåãæ¸ããã¨ã§è¨æ¶ã«å®çãããã¡ã½ãã)ã®ã ãããªããªãè«æã¨ã®å ¼ãåãã§æ¸ãã®ãé£ãããè«æã«æ¸ããªã(æ¸ããªã)ã¨åãã£ããæ¥è¨ã«æ¸ããããã¦ããã®ã ãâ¦â¦ã