æ½å¨çæå³ã¤ã³ããã·ã³ã°
id:naoyaãããたつをさんãªã©ã®è¶ æå人ãªæ¹ã ã以åããå®æ½ããã¦ãããIIR輪èªä¼ãã¨ãããã®ãããã¾ãã¦ãã©ãããä»åã¯ç¬¬18ç« ã® "Matrix decompositions and latent semantic indexing"ã輪èªããããã§ãã
http://d.hatena.ne.jp/naoya/20090208
http://chalow.net/2009-02-08-2.html
Latent Semantic Indexingã¨ã¯ãé称LSIã¨ãLSAï¼Latent Semantic Analysisï¼ã¨ãããã¾ãããæ¥æ¬èªã ã¨ãæ½å¨çæå³ã¤ã³ããã·ã³ã°ããªãã¦å¼ã³ã¾ããã
ç°¡åã«è¨ã£ã¦ã¿ãã¨
ã§ã£ãããããªãã¯ã¹ï¼æ°ä¸Ãæ°ä¸ã¨ãã®è¡åï¼ããã¨ãã°ãæ°ç¾Ãæ°ä¸ããããã«ã¾ã§ããã ãã ã£ã¨æ¨ªã«æ¼ãã¤ã¶ãããã«å§ç¸®ãã¦ã¿ãã¨ãããä¸æè°ãã®ãã£ãè¡åã¯ã¨ã¦ãæå³ããã®æ¿ããã¯ãã«ã®éã¾ãã«ãªã£ã¦ããã
ã ãããããããç¹å¾´ãã¯ãã«ãã¨ãã¦ä½¿ãã¨ãããã
ç¹å¾´ãã¯ãã«ãé«æ¬¡å ã®ç©ºéã«ãããã³ã°ããã¨è·é¢è¨ç®ã¨ãã§é¡ä¼¼åº¦ãªãããè¨ç®ã§ãããã ãã
ã¨ãã£ãæãããªãããèªç¶è¨èªå¦çã®åéãªã©ã§ä½¿ããã¦ãã¾ãã
è¨èã§æ¸ãã¨å®ã«é¦¬é¹¿ã£ã½ãã®ã§ããããã¤ã¯ã次å å§ç¸®ãã¨ããã辺ããæ°å¦çã«ã¯ããã¶ãã¨é«åº¦ã§é£ããã¿ããã§ãããããããã¾ãããã
ã§ããã¾ãã¾ãªãã§ãããèªåãæ°ã¶æåã«LSIã«ã¤ãã¦perlã§ãã«ããã«ãã¨å®é¨ãç¹°ãè¿ãã¦ããææãããã¾ãã¦ã
ãã®æã«å¢ãã§ã¤ãã£ã¦ç¡è²¬ä»»ã«ãCPANã«ããã¦ããã¢ã¸ã¥ã¼ã«ããã£ãã®ã§ãã¡ããã©ããæ©ä¼ãªã®ã§ç´¹ä»ãã¦ããã¾ãã
http://search.cpan.org/~miki/Algorithm-DimReduction/
Algorithm::DimReduction - Dimension Reduction tool that relies on 'Octave'
ãªãã®ãã¨ã¯ãªããæåéããOctaveã使ã£ã¦SVDï¼ç¹ç°å¤å解ï¼ã§æ¬¡å å§ç¸®ã®è¨ç®å¦çãããããããã®perlã¢ã¸ã¥ã¼ã«ã§ãã
ãã£ã¦ãå¦çãå®åçã ãæ±ç¨æ§ãæ¡å¼µæ§ãå
¨ç¶ããã¾ãããã¢ã¸ã¥ã¼ã«ã¨ãã¦ã¯æ£ç´ãã¯ã½ç³»ãã§ãã
ã ãã©LSIã¨ãèå³ãã人ã«ã¨ã£ã¦ã¯å°ãã¯ä½ãã®åèã«ãªããããããªããã¨æãupãã¦æãã¦ãã次第ã§ãã
ã§ãå°ãã ãä¸èº«ã®ãã¨ã«è§¦ãã¾ããããã¨ãã¨ã¯PDL(Perl Data Language)ã§è²ã ã¨è©¦ããã®ã§ãããï¼ï¼ä¸Ãï¼ï¼ä¸ç¨åº¦ã®è¡åãä½æ¥ã¾ã£ã¦ãçµãããªãã£ãã®ã§è«¦ãã¾ãããï¼PDLã§ã¯ãã¾ã大ããªè¡åã¯æ±ããªãã®ããªããã ããPDLã«è©³ãã人ãããæãã¦ï¼ï¼ããã§PDLã¯ããããã¦Octaveã§ãã£ã¦ã¿ãã¨ãããããªãè¨ç®çµæãã§ã¦ãã¾ãããOctaveãSugee!
ããªããªã®ç 究è ã®æ¹ã§ããã°ããã£ã¨é«åº¦ãªMathematicaã¨ãã使ããã§ãããããç§ã¯æ°å¦ããããªãæç³»ãªåãªã®ã§ãOctaveã§ãããããã°perlã§ãªãã¨ãã§ããªãããã¨ãæã£ã¦æ¸ãã¦ã¿ã¦ãã¾ãã¾ããããããã¾ãããã»ãã®åºæ¥å¿ã§ããåå¼ãã¦ã¤ããããããã
ã¨ããããã§ãã¤ãã¤ãèªä¿¡ãªãã¢ã¸ã¥ã¼ã«ã§ãããèå³ããæ¹ã¯CPANããè½ã¨ãã¦è§¦ã£ã¦ã¿ã¦ãã ããã
ï¼ä»æ´ãªãã ãã©ãIIR輪èªä¼ã«åå ãã¦ã¿ãããªããçªç¶ãéªéãã¦ãããã®ã ãããããã