æ å ±æ¤ç´¢ã®åéã«ããã¦ãtfâidf (ã¾ãã¯ã TF*IDFãTFIDFãTFâIDFãTfâidf)ã¯ãterm frequencyâinverse document frequencyã®ç¥ã§ãããã³ã¼ãã¹ãåéãããææ¸ç¾¤ã«ããã¦ãããåèªãããã«éè¦ãªã®ããåæ ããããã¨ãæå³ããçµ±è¨éï¼æ°å¤ï¼ã§ãã[1]ãã¾ããtf-idfã¯æ å ±æ¤ç´¢ããããã¹ããã¤ãã³ã°ãã¦ã¼ã¶ã¼ã¢ããªã³ã°ï¼è±èªçï¼ã«ãããéã¿ä¿æ°ï¼è±èªçï¼ã«ãããç¨ãããããããåèªã®tf-idfã®å¤ã¯ææ¸å ã«ããããã®åèªã®åºç¾åæ°ã«æ¯ä¾ãã¦å¢å ããã¾ãããã®åèªãå«ãã³ã¼ãã¹å ã®ææ¸æ°ã«ãã£ã¦ãã®å¢å ãç¸æ®ºãããããã®æ§è³ªã¯ãä¸è¬ã«ããã¤ãã®åèªã¯ããåºç¾ããããã¨ããäºå®ããã¾ã調æ´ãããã¨ã«å½¹ç«ã£ã¦ãããä»æ¥ãtf-idfã¯ãã£ã¨ãæåãªèªã®éã¿ã¥ã(term-weighting)ææ³ã§ããã2015å¹´ã«è¡ãããç 究
{{#tags}}- {{label}}
{{/tags}}