2011-01-01ãã1å¹´éã®è¨äºä¸è¦§
人çååå¼·ä¼ã ãã£ãã®ã¯ counting sort 㨠radix sortã ãã®äºã¤ã¯æ¸ãããã¨ããªãã£ãã®ã§ãPython ã§è¶ ææãã«æ¸ãã¦ã¿ããhttps://gist.github.com/1395308 base = 10 def radix_sort(x, max): radix = 1 while radix < max: x = counting_sort(x, â¦
å é±ã®æ¨ã»éãããã¯å°ãå¹ãã§ãããå忏ããããã«ããã£ã¨å¯å¤æ¬¡æ°CRFè«æã®è±è¨³ãååçµãã£ãã®ã§ãå è¡ç ç©¶ï¼http://books.nips.cc/papers/files/nips22/NIPS2009_0300.pdfï¼ã®äººã«ã¡ã¼ã«ãéã£ã¦ã¿ããããããããã«èªãã§ããªããããªãã£ããªãâ¦
ããã®ã¨ããä¸ã¶æããããä¿®å£«è«æï¼å¯å¤æ¬¡æ° CRFï¼ã®è±è¨³ã«åãçµãã§ããï¼ãã¾ããã£ããå½éå¦ä¼ã¨ããã«åºãã¦ã¿ããããªã¨ãã¡ãªã¿ã«åºãããã¨ã¯ãªãï¼ããã ã®ç¿»è¨³ã®ã¤ããããå§ãã¦ã¿ãã¨ããããã¨ã¾ããã¨ãããè¶³ããªãã¨ãããè¦ã¤ãã£ãã®â¦
å 鱿¸ããã¢ã«ã´ãªãºã ã¯ã¤ãã¯ãªãã¡ã¬ã³ã¹ 7.4 ã®A*æ¢ç´¢ã Python ã§å®è£ ãã¦ã¿ãã 8ããºã«ã®åãã¼ã¹ã¯ãå·¦ä¸ãèµ·ç¹ã¨ãã (X座æ¨, Y座æ¨) ã®ã¿ãã«ã§ç®¡çãã¦ããã 8æã®ãã¼ã¹å ¨ä½ã¯ Board ã¯ã©ã¹ã ããããææãã def CalcManhattanScoreCoords(â¦
8ããºã«ã®è©ä¾¡é¢æ°ã®ä¾ã¨ãã¦æãããã¦ãã GoodEvaluatorã P(n)+3*S(n)ãP(n)ã¯ãåé§ã®ããã¼ã ãããã®ãã³ããã¿ã³è·é¢ã®åã S(n)ã¯ãåç®ãé ã«èª¿ã¹ã¦ä»ããç¹æ°ãæ£ããæ¬¡ã®é§ãå¾ã«ç¶ãã¦ããªãã£ãã2ããã®ä»ã®é§ã¯ã0ããã ããä¸å¤®ã«ä½ç½®ããâ¦
éå½ã«è¡ã£ã¦æ¼«ç»çãè²·ã£ã¦ãããä»èªãã§ããã ç¥ããªãã£ãåèªã表ç¾ãã¡ã¢ãã¦ããã ì°ì§íë¤ æ ããªã 콩ë물 ì루ì²ë¼ ãããã®çãã壺ã®ããã« ì ë´ë ë¹ëì¤ è¦ãªãã§ãããã íê²½ é¢¨é´ ë°ì± ãããç¶² ì¼ì¥ì ì§ë¥´ë¤ ä½ãããã¾ããã£ã¦ããªãâ¦
ããã¦è¨ãåããããã§ã¯ä¸»ã«è¨ç®éã«ã¤ãã¦è¨ã£ã¦ããã®ã§ãã¹ã¯ãªããè¨èªã§ãã¢çãªãã®ãä½ããããªå ´åã¯é¤ãããã®è¨äºã§ã¯ C++ ã使ã£ã¦æ¸ããlogsumexp ã§ã©ã®ããã«è¨ç®ã®éãå¢ãããã¯unnonouno: logsumexpã¨ã¹ã±ã¼ãªã³ã°æ³ã«è©³ãããã¡ãã£ã¨å¼â¦
å°ãã¿ãPerl ã§æ¥æ¬èªã®ç°¡åãªå¦çãããããæ¹ï¼ãããããã¨ãç°¡åã«ã§ããã¨ããä¾ã§ãå ·ä½çãªãªãã·ã§ã³ã®æå³çã¯è§£èª¬ãã¦ããªãï¼ãã³ãã³ãã©ã¤ã³ã§ã¡ãã¡ãã£ã¨æ¥æ¬èªã®å¦çããããæãPerl ã¯ãã£ããå½¹ã«ç«ã¤ãæ¥æ¬èªã®ä¸æåã䏿åã¨ãã¦æ±â¦
èªå¦ãåå¼·ããæãã·ã£ãã¦ã¤ã³ã°ï¼é³å£°ãèããªãããããã«åããã¦ããã¹ããã¨ï¼ã¨ããæ¹æ³ãããããªãã¼ãã£ã³ã°ï¼é³å£°ãèãã¦ãçµãã£ã¦ããå 容ãç¹°ãè¿ãï¼ãæèªã¨ãã£ããã®ããããããããã«ããèªåã§çºå£°ãããã¨ãæ¬ ãããªããé»è»ã§ã®éâ¦
æ¨æ¥ã®ã¨ã³ããªï¼ã³ã¼ãã¹ã¨å®ç¨ã®ãµã¤ã¯ã« - ã¢ã¹ãæ¥è¨ï¼ã®ç¶ãã ãã³ã¼ãã¹ã使ã£ã¦æ¼¢åããªã»ããªæ¼¢å夿ãä½ãããã®ééããã³ã¼ãã¹ã«åæ ãããã¨ããã®ãå ·ä½çã«ã©ãããããèªåã¨ãã¦ã¯ããã£ããç¾ä»£æ¥æ¬èªæ¸ãè¨èåè¡¡ã³ã¼ãã¹ (BCCWJ)ãã§â¦
ä»äººã«ã¯ã©ãã§ããããã¨ã ããã¼ãã¯æ¯æ¥ 1æéãèªåã®ããã®æéï¼å¨¯æ¥½ä»¥å¤ãªã®ã§ãèªæ¸ã¨èªå¦ã¯é¤ãï¼ãè¨ãã¦ãããæ°å¦ã®åå¼·ã»ããã°ã©ãã³ã°ã»ããã°ã¨ã³ããªä½æçã«ä¸»ã«ä½¿ã£ã¦ããã çããä¸ã§ã¯ã¼ã¯ã©ã¤ããã©ã³ã¹ãéè¦ãªããã«ãèããã»ããâ¦
æ¥æ¬èªã®ããªæ¼¢å夿ã«å¿ è¦ãªãã®ã®ãã¡ã主è¦ãªãã®ã 3ã¤æãã¦ã¿ãï¼ç´°ããè¨ãã°ãå¿ è¦ãªãã®ã¯ãã£ã¨ãã£ã¨ãã£ã±ãããï¼ã 1. N-gram è¨èªã¢ãã« ç´æ¥ã¤ãªããåå¾ã®æèã«ãã£ã¦ææ§æ§ãè§£æ¶ããããããåºæ¬ããããã£ã¦ã¯ãããããæã£ã¦-å ¥ããâ¦
http://d.hatena.ne.jp/nokuno/20110802/1312236781ã§ç´¹ä»ããã¦ãã@http://twitter.com/neubig ããã®åé¡ã«ã¤ãã¦ãid:nokuno ããã®è§£èª¬ã«å ããå ·ä½çãªçããæ¸ãã¦ã¿ãã åé¡ãæ¹ãã¦å¼ç¨ã è²´éãªã¯ã¤ã³ã®ç¶27æ¬ãæã£ã¦ãã¦ããã®å 1æ¬ã¯æ¯ã§æ±æâ¦
åã«æ¸ãã N-gram æ¼¢å-ããªå¤æ - ã¢ã¹ãæ¥è¨ ã®ã¢ã«ã´ãªãºã ã«ã¤ãã¦ã ããªã縦ã«é·ãã¨ã³ããªã«ãªãã¨æããéä¸ã¾ã§ã¯ä¸è¬çãªæ¥æ¬èªèªç¶è¨èªå¦çã«ãããããã¨ã ä¾ã¨ãã¦ãããããããã¾ã§ã¾ã¤ãã¨ããã²ãããªã®æããã³ã¼ããã¦ã対å¿ããæ¼¢åãâ¦
ãã®æ°æ¥éã¯ãã¤ãã³ãçããã£ã¦è¶£å³ã®ããã°ã©ãã³ã°ã¯ãã¦ããªãã åã åã åå ã¨ãã¬ãã£ããªæ¥è¨ãæ¸ãããèããè«æã«ãã¦ãã¢ã«ã´ãªãºã ã«é¢ãããã®ã§ãã£ã¦ãå®é¨ã§æ§è½ãåºãã¦å½éå¦ä¼ã§æ¡æããããªã©ããªãã¨ãã¢ã«ã´ãªãºã ã®å 容ãèªãã§â¦
仿¥ãå¼ãç¶ããã½ã¦ã«ã¸ã§ã ã®ç©¢ããã¾ãæ£ãããããªæ¥è¨ãæ¸ãã¦ã¿ãã ãªãè«æãæ¸ããããªããã ããã¯ãããããããã¯æ¸ããªãã¨ã ãã ãããã¨ç´å¾ã§ãã以ä¸ã®ãã¨ãæ¸ããªãã¨ãããªãããã ã ã¾ããå è¡ç ç©¶ã 修士ã®ãããè«æãæ¸ãã¦ãã¦â¦
N-gram ããªæ¼¢å夿 ãã¢ãããã¼ãã å é¨ã® Unicode åãæªç¥ã®æåã¸ã®å¯¾å¿ã ããã¾ã§ã¯æªç¥ã®æåãããã¨çµæãè¿ããªãã£ãã®ã§ãç¹ã«æ¼¢åããªå¤æã§ã¯è´å½çã ã£ãããä»åã®å¤æ´ã§å¤§ä¸å¤«ã«ãªã£ããããããªãã ã¾ã æ¼¢åããªã§ã¯è©¦ãã¦ãªããã©ã â¦
ãªãã¸ããªãæ´æ°ãããN-gram ID ãã ã¹ã³ã¢ãåå¾ããã®ã« cdb ã使ã£ã¦ããã®ãã¡ã¢ãªãããããã¡ã¤ã«ã«å¤ããã ããã«ã¹ã³ã¢ã¯ 1ãã¤ãã§æã¤ããã«ããã 400MB ãããã ã£ããµã¤ãºã 20MB ãããã«ãªã£ã¦ãé度ãã ãã¶åä¸ãããåè¨ããã¨ãè¾æ¸â¦
æ¨æ¥ï¼è¨äºãï¼æ¸ãã N-gram ããªæ¼¢åï¼ï¼æ¼¢åããªï¼å¤æã«ã¤ãã¦ã ããã°ã©ã ãæ¸ããã®ã¯ãã®ä¸é±éãããã å é±ã¯ä»äºãçµãã£ã¦ãããã¯ããã«ãã§æ¸ãã¦ãé£ä¼ä¸ã¯å®¶ã§ãæ¸ãã¦ãããN-gram ã® N ã¯ãæå㯠3 ã§ååãã¨æã£ã¦ãããã§è©¦ããã "Trâ¦
@gologo13ããã®è¨èªã¢ãã«é å¸ãã¼ã¸ã®ãã¼ã¿ãå©ç¨ãã¦ç°¡åãªæ¼¢å->ããª/ããª->æ¼¢å夿ãã§ããªãããªã¼ã¨æã£ã¦ä½ã£ã¦ã¿ããè¨èªã¢ãã«ã®ä½æã«ã¯ SRILMã使ç¨ãé å¸ä¸ã®ãã¼ã¿ã SRILM ã§æ±ãã«ã¯å¤å°å å·¥ããªãã¨ãããªãã®ã§ããã®å¤æã¹ã¯ãªãããä½â¦
ã¿ã¤ãã«ã®ã¾ã¾ãèªåã¯ãã£ã¨ Windows ã§éçºãã¦ããã®ã§ãC++ 㯠Visual Studio(VS) ã§ãã£ã¦ããã VS ã®ããã¨ããã¯ããããã°æ©è½ã æ¡ä»¶ä»ããã¬ã¼ã¯ãã¤ã³ããã¡ã¢ãªå¤æ´æã®ãã¬ã¼ã¯ã¨ãã£ããã¨ããã¦ãçéã§åãã¦ããã¦ãã¡ãã¨æ¢ã¾ã£ã¦ãããâ¦
Bloom filter ã«ã¤ãã¦æ¸ãã¦ã¿ãã å®è£ ä¾ã«ã¤ãã¦ã¯Bloom filterã®ã·ã³ãã«ãªå®è£ - 西尾泰åã®ã¯ã¦ãªãã¤ã¢ãªã¼çãããã®ã§ãããã§ã¯ãæ°æã¡ãä¸å¿ã«ã åæ:ããã·ã¥é¢æ°ã¨ key-value store ã®ç¥è 注æ:éä¸ã説æã®ããã«å®éã® Bloom filter ã¨ã¯â¦
ååã®ç¶ãã ç¹ä½åã®å ´åã use strict; use utf8; use Encode; my $enc_cp950 = find_encoding('cp950'); my $enc_cp932 = find_encoding('cp932'); my $level2_eucjp; $level2_eucjp .= chr(0xd0 + ($_ / 94)) . chr(0xa1 + ($_ % 94)) for 0..3389; # 3â¦
Twitter ä¸ã§ã id:showyou ããããã°ãã¼ã¿ã®æ¥ä¸è¨èªå¤å¥ããããã¨ãã話ããã¦ããã®ã§ãããã«ã¤ãã¦ãã¾ãåæã¨ãã¦ãæåã ãè¦ã¦ãæ¥æ¬èªã¨ç°¡ä½åä¸å½èªï¼ç¹ä½åã¯ãã£ã¨é¢åã ãã©ãããã§ã¯ãã£ããæ£ä¸ãï¼ã 100ï¼ å¤å¥ãããã¨ã¯ã§ããªãã ã¨â¦