2024å¹´5æã«OpenAIã®ææ°AIã¢ãã«ãGPT-4oããçºè¡¨ããã¾ãããããã¹ããé³å£°ãã«ã¡ã©ã®å ¥åã人éã¨åããããã®é度ã§å¦çå¯è½ã¨ããé«ãæ§è½ã§ãããã¨ãå ±ãããã¦ããã®ã§ãããä¸æ¹ã§ãä¸å½èªã¦ã¼ã¶ã¼ããã¯ããã¬ã¼ãã³ã°ã«å¤§ããªåé¡ãæ±ãã¦ãã¦ããã¼ã¯ã³ãã¼ã¿ãæ±æããã¦ãããã¨ãææããã¦ãã¾ãã Just wrote a script to further investigate how the corpus used to train the gpt4o tokenizer is polluted by Internet scams. The results are quite interesting... ð¤¦ââï¸ð¤¦ââï¸ð¤¦ââï¸https://t.co/Fc2T4rSHix https://t.co/Q1Syh9amJn pic.twitter.com/lQ1u
{{#tags}}- {{label}}
{{/tags}}