大è¦æ¨¡ãã¼ã¿ãã one-pass 㧠itemï¼n-gram ãªã©ï¼ã®é »åº¦ãæ°ããææ³ã«é¢ããã¡ã¢ï¼ããæ°å¹´ï¼æ¯å¹´ã®ããã«è¶ 大è¦æ¨¡ãª n-gram ã®çµ±è¨æ å ±ã空éï¼æéå¹çè¯ãå©ç¨ããããã®ææ³ãææ¡ããã¦ããï¼æè¿ã ã¨ï¼ Storing the Web in Memory: Space Efficient Language Models with Constant Time Retrieval (EMNLP 2010) ã¨ãï¼ãã®è«æã§ã¯ï¼æå°å®å ¨ããã·ã¥é¢æ°ã power-law ãèæ ®ããé »åº¦è¡¨ç¾ã®å§ç¸®ãªã©ï¼ç´°ããæè¡ãä¸å¯§ã«çµã¿ä¸ãã¦ããï¼ããããã工夫ãç´°ãããªã£ã¦ããã¨log-frequency Bloom filter (ACL 2007) ãããããããå§ã¾ã£ã n-gram é »åº¦æ å ±ã®å§ç¸®ã®ç 究ãããããåæãããã¨ããå°è±¡ï¼ã¡ããã©è«æãèªãç´åã«ï¼ãã®è«æã®7ç¯ã®
{{#tags}}- {{label}}
{{/tags}}