2011-06-01ãã1ã¶æéã®è¨äºä¸è¦§
ä¸ã¤ã®é åã使ãã®ã¨ï¼vectorãçµã¿åãããã®ã¨boost::multi_arrayã使ãã®ã§ã¯ã©ããä¸çªä½¿ãããããã§ããããï¼
Visual Studio2010 ã«Boostã©ã¤ãã©ãªãã¤ã³ã¹ãã¼ã«ããã®ã§ã¡ã¢ï¼ Download Boost Library Here - BoostProããã¤ã³ã¹ãã¼ã©ããã¦ã³ãã¼ããã¦å®è¡ï¼ ããããã£ã®VC++ãã£ã¬ã¯ããªâã¤ã³ã¯ã«ã¼ããã£ã¬ã¯ããªã«C:\Program Files\boost\boost_1_46_1ï¼ã©ã¤â¦
"Deciphering Foreign Language" ãã©ã¬ã«ã³ã¼ãã¹ã対訳è¾æ¸ãªãã§æ©æ¢°ç¿»è¨³ãè¡ã£ã¦ããè«æï¼ ã¢ãã«ãä½ã£ã¦EMã¢ã«ã´ãªãºã ãã®ãã¹ãµã³ããªã³ã°ã§ãã©ã¡ã¼ã¿æ¨å®ï¼ ãã©ã¬ã«ã³ã¼ãã¹ã使ã£ãæ¹æ³ã¨comparableãªçµæã£ã¦æ¸ãã¦ãããã©ï¼æ°å¤ã«ã¯å¤§ããªå·®â¦
import unicodedata def countKanji(text): s = 0 for c in text: if (unicodedata.name(c)[0:3]) == 'CJK': s += 1 return s
mecab-dict-indexã¨è¾æ¸ã®å ´æããªããªãããããªãã£ãã®ã§ã¡ã¢ï¼ /usr/lib/mecab/mecab-dict-index /usr/share/mecab/dic/ipadic /usr/lib/mecab/mecab-dict-index -d /usr/share/mecab/dic/ipadic -u user.dic -f utf-8 -t utf-8 user.csv
以åä½æããpixivç¨ã®Greasemonkeyã¹ã¯ãªããpixiv-tag-suggestããã¼ã¸ã§ã³ã¢ããï¼ å°èª¬ã®æ¹ã§ã¯ã¹ã¯ãªãããåãã¦ããªãã£ãã®ãä¿®æ£ï¼ pixivã¯ã¤ã©ã¹ãã®æ¹ã¨å°èª¬ã®æ¹ã§å¾®å¦ã«HTMLã®æ§é ãç°ãªã£ã¦ãã®ãè¬ï¼
Pythonã§ã¯whileãforã«ã¼ãã«elseã使ããã¨ç¥ã£ã¦ã³ã£ããï¼ elseã¯breakãªã©ã§æããã«é常ã®æ¹æ³ã§ã«ã¼ããçµäºããã¨ãã«å®è¡ããããããï¼ ä»ã¾ã§ã¯éä¸ã§breakããã¨ãã¨æå¾ã¾ã§å®è¡ããã¨ããåºå¥ããããã«ï¼ãããããã©ã°å¤æ°ãä½ã£ã¦ããã®ã§â¦
ããã§ã¾ã Noviceã£ã¦ãããã ããä¸ä½ã¯é ãä¸çã ãªãâ¦â¦ï¼
urllib2ã§éãããã®ãèªåçã«closeãããã£ããã§ããï¼èª¿ã¹ããcontextlib.closing()ã使ãã°withæã§å¯¾å¿ã§ããã¿ããã§ãï¼ import contextlib import urllib2 with contextlib.closing(urllib2.urlopen('http://www.python.org')) as page: for line iâ¦
1171â1081ï¼å¤§å¹ å¾éï¼æè¿ã¯é 調ã ã£ãã ãã«æ®å¿µï¼ 250 æ°å¤ãåæã«ããã®ã«å¿ è¦ãªã³ã¹ãï¼ æ°å¤ã«1ãã¤è¶³ãï¼å¼ãï¼ãªããï¼æ°å¤ãæååã«ãã¦ååã¨å¾åã«åãã¦å転ãã¦æ¯è¼ããï¼ æ®éã«æ°å¤ã®ã¾ã¾ã§æ±ã£ãã»ããç°¡åï¼ 500 çµã¿åãããæ±ãã¦æ®éâ¦
Probabilistic Latent Semantic Analysisã¨Probabilistic Latent Semantic Indexingã®ã©ã£ã¡ã®ååã使ãã°ããã®ããããã¾ãããï¼æPLSAã®å®è£ ã«ææ¦ããã¨ãã®ã½ã¼ã¹ãåºã¦ããã®ã§æãã¨ãï¼ ã¡ããã¨åãã¦ããã©ããã¯ä¸æï¼Tempered EMã¢ã«ã´ãªãºã â¦
2010å¹´ã®è«æï¼ Twitterããããã¢ã«ã¦ã³ãã人éï¼botï¼ãããã¯Cyborgï¼æåã¨èªåã®ä¸¡æ¹ï¼ã®ãããããèå¥ï¼ 人éã¨botã«ã¤ãã¦ã¯9å²ä»¥ä¸ï¼Cyborgã«ã¤ãã¦ã¯8å²ç¨åº¦ã®æ£è§£çï¼ èå¥ã§å©ç¨ãã¦ããæ å ± æ稿æéã®ãã¿ã¼ã³ ãã¤ã¼ããspamçãã©ãã ã©ãâ¦
1158â1171ï¼ç¾ç¶ç¶æï¼ 250 hashCode()ã®æ¸ãå¿ãã«æ°ã¥ããã«ï¼æéã使ãéããï¼Setã使ããªãã§æåããé åã§æ¸ãã°ããã£ãâ¦â¦ãããã¯java.awt.Pointã®åå¨ãæãåºãã°ï¼ 500 æ¹éãç«ããªãã£ããã ãã©ï¼åç´ãªå¹ åªå æ¢ç´¢ã§ããã£ãã®ããªï¼
ãããã¯ã¢ãã«çãªã®ã¯ãèªãã§ããªããªãç解ã§ããªãã§ãã "Structural Topic Model for Latent Topical Structure Analysis" ä¸æãã¨ã«ãããã¯ãå²ãå½ã¦ã¦ããããã¯ã®é·ç§»ãèããï¼Sentence orderingãã§ããï¼ "Sequential Latent Dirichlet Alloâ¦