ãã®è¨äºã§ã¯LLMããã«ãã¨ã¼ã¸ã§ã³ãã·ã¹ãã ã§ã©ã®ããã«å¿ç¨ããããããææ¡ããè«æãç´¹ä»ãã¾ãã
å¿ã®çè«ï¼Theory of Mindsï¼
人éã¯ãã¼ã ã¯ã¼ã¯ãããã¨ãããã¼ã ã¡ã¼ããã©ã®ãããªç¶æ³ã§ã©ããªè¡åãã©ã®ãããªæå³ã§è¡ãããæ¨æ¸¬ãã¾ããä¾ãã°ããµãã«ã¼é¸æã¯è¦æ¹ãã©ã®ãããªæå³ã§èµ°ã£ãããããªãã«ããã¦ããããå¯ç¥ãã¦ããã¯ãã§ããå³æ¹ã®æå³ã«åããã¦èªåã®è¡åï¼ã¹ãã¼ã¹ã«èµ°ã£ããããã¼ã«ãè¦æ±ãããï¼ã決ãã¾ããéã«ãã¹ãã¤ãªãããªãã£ãã¨ãã¯äºãã®æå³ãæ¨æ¸¬ããã®ã«å¤±æããã¨ãããã¨ã§ãããä»è
ãä½ãèãã¦ãããããç 究ããã®ã¯çºéå¿çå¦ãªã©ã§ãç 究ããã¦ãããã¨ã§ããï¼ä¾ï¼ããµãªã¼ã¨ã¢ã³èª²é¡ãï¼ãAIã¨ã¼ã¸ã§ã³ãã人éã®ããã«ä»è
ã®æèããã¾ãæ¨æ¸¬ã§ãããã¯é常ã«é¢ç½ããããã¯ã§ãããå¿ç¨ã¯å¤å²ã«ãããã¾ãã
ä»åã¯ãä»è
ã®å¿ã®ç¶æ
ãç®çãæå³ãç¥èã信念ãå¿åãç念ãæ¨æ¸¬ãªã©ãæ¨æ¸¬ããç´è¦³ã«ããå¿ã®æ©è½ã§ãããå¿ã®çè«ããLLMã¨ã¼ã¸ã§ã³ããç²å¾ã§ãããã«ã¤ãã¦ç 究ãã以ä¸ã®è«æãç´¹ä»ãã¾ãã
信念ç¶æ
å¿ã®çè«ã§ã¯ãããã¨ã¼ã¸ã§ã³ããèãã¦ãããã¨ãã信念ç¶æ
ãã¨ããã¾ãããã¨ãã°ããµãã«ã¼ã®ä¾ã ã¨ãããã®ã¹ãã¼ã¹ã«åãã£ã¦ããªãã«ããã°ãã£ã³ã¹ã«ãªããã¨ããããã¯ãã¹ãããæ¹ãç¸æãåãããã«ãªã£ã¦ã¹ãã¼ã¹ããã¾ããããããªã©ã§ãããµãã«ã¼ã«éãã人éã¯ä½ãããã«ãã¦ãä½ãããã®æå³ãäºæ¸¬ããã¦è¡åã決å®ãã¦ãã¾ã*1ã
è¨ãã¾ã§ããªãããã¼ã ã¯ã¼ã¯ãããã¨ãã¯ç¸æã®ä¿¡å¿µç¶æ
ããã¾ãäºæ¸¬ããªããã°ããã¾ããã
ãã¦ãLLMã¨ã¼ã¸ã§ã³ãã«ãã¼ã ã¯ã¼ã¯ããããã«ã¯ã©ãããã°ããã§ãããããæ¬è«æã§ã¯ä»¥ä¸ã®3ç¹ã«çç®ãã¦ãã¾ãã
- 0次ToM (Theory of Mind) æ¨è«LLMã¨ã¼ã¸ã§ã³ããèªèº«ã®ä¿¡å¿µç¶æ ãæ確ã«è¡¨ç¾ã§ããè½åãããã
- 1次ToM ã¨ã¼ã¸ã§ã³ããä»è ã®ä¿¡å¿µç¶æ ãæ¨å®ã§ãããã©ãã
- 2次ToM ä»è ãèªèº«ã®ä¿¡å¿µç¶æ ã«ã¤ãã¦ä½ãèãã¦ããããæ¨è«ã§ãããã©ãã
å®é¨è¨å®
æ¬è«æã§æ±ã£ã¦ããåé¡è¨å®ãå°ã説æãã¾ããèªã¿é£ã°ãã¦å¤§ä¸å¤«ã§ããé°å²æ°ã¨ãã¦ã¯ãã¨ã¼ã¸ã§ã³ãã3人ãã¦ãããããçå¼¾å¦çããã¾ããã¨ã¼ã¸ã§ã³ãã©ããã¯çå¼¾ããã¨ã©ããããã§ççºãããã§ããããããã®çå¼¾ã®ã¯ã¤ã¤ã¼ãåãããªã©ä½ãããã®è¡åããã¾ãã
ç´°ããåé¡è¨å®
3人ã®ã¨ã¼ã¸ã§ã³ãï¼AlphaãBravoãCharlieï¼ãæªç¥ã®ç°å¢ã«åæ£ãã¦ãã¾ããè²ã¤ãã®çå¼¾ã®ä½ç½®ãç¹å®ããå®å ¨ã«è§£é¤ãããã¨ãç®çã¨ãããã¼ã ã§ããåçå¼¾ã¯3è²ã®ããããã§ãããããã®è²ã¯çå¼¾ã®ãã§ã¼ãºã表ãã¾ãã解é¤ã«ã¯æ£ããé åºã®ã¯ã¤ã¤ã¼ã«ãã¿ã¼ãå¿ è¦ã§ãã ãã¼ã ã¡ã³ãã¼ã¯ããããç°ãªãè²ã®ã«ãã¿ã¼ãæã£ã¦ã²ã¼ã ãå§ãã¾ãã ç°å¢ã¯é£çµã°ã©ãã¨ãã¦æ¦å¿µåããã5åã®ãã¼ãã¯å»ä¸ï¼ã¨ãã¸ï¼ã§ã¤ãªãã£ã5åã®é¨å±ã表ãã¾ããåã©ã¦ã³ãã«ããã¦ãã¨ã¼ã¸ã§ã³ãã¯ä»¥ä¸ã®3ã¤ã®è¡åããä¸åé¸æãã¾ãã - 5åã®é¨å±ã®ãã¡ã®1ã¤ã«ç§»åãã - ç¾å¨ã®é¨å±ã«ããçå¼¾ã®ãã§ã¼ãºãæ¤æ»ãã - 3åã®ã¯ã¤ã¤ã¼ã«ãã¿ã¼ã®ãã¡ã®1ã¤ã使ç¨ãã
ã¨ã¼ã¸ã§ã³ãã®è¦³æ¸¬ã¯ãç¾å¨ã®é¨å±ã®ä¸èº«ã¨ã¨ã¼ã¸ã§ã³ãã®ã¹ãã¼ã¿ã¹ã«éå®ããã¾ãããã¼ã ã®ã¹ã³ã¢ãç¾å¨ã®é¨å±ã®ä¸èº«ããã¼ã ã¡ã¤ãã®ä½ç½®ãå©ç¨å¯è½ãªãã¼ã«ã«ã¤ãã¦ã¯ãå®æçã«æ´æ°ããã¾ãã段éã®çå¼¾ã解é¤ãããã¨ããã¼ã ã«ã¯
ãã¤ã³ããä¸ãããã¾ãã
LLM ã¨ã¼ã¸ã§ã³ã
æ¬è«æã§ã¯ããã¼ã ã¯ã¼ã¯ãè¡ãã«ããã£ã¦ã¨ã¼ã¸ã§ã³ãã信念ç¶æ
ãæ示çã«ä¿æããã®ãæã¾ããã¨ãã£ã¦ãã¾ããå³ï¼ã®ä¾ã§ã¯ãAlphaãCommunication Messageã¨ãã¦Bravoããåãåã£ãã¡ãã»ã¼ã¸ããã¨ã«èªåã®ä¿¡å¿µãæ´æ°ãã¦ãã¾ããããã§ä¿¡å¿µã¨ã¯ç°å¢ã«ã¤ãã¦ã®æ
å ±ã¨è¨ã£ã¦ããããããã¾ããã
ã²ã¼ã ã®å¾ç¹ã¯LLMã¨ã¼ã¸ã§ã³ãã信念ï¼Beliefï¼ç¶æ
ãæ示çã«ä¿æãã¦ããå ´åã®æ¹ãé«ãã§ãã
ã¡ãªã¿ã«MAPPOã¯ãã«ãã¨ã¼ã¸ã§ã³ã深層強åå¦ç¿ã®æåãªææ³ã§ãã
åµçºç¾è±¡ããã³0, 1, 2次ToM
ãã¼ã ã¯ã¼ã¯ãå¿ è¦ãªä»åã®çå¼¾å¦çã¿ã¹ã¯ã§ãããåµçºç¾è±¡ã¨ãã¨ããç¾è±¡ã確èªããã¦ãã¾ããå ·ä½çã«ã¯ãããä¸äººã®ã¨ã¼ã¸ã§ã³ãããªã¼ãã¼ã¨ãªããä»ã®äºäººã«æ示ãéãã¾ããä¸ã®å³ä¸é¨ã§ã¯ãAlphaãBravoã¨Charlieã«æ示ãéã, äºäººãæ示éãã«è¡åãã¦ããã®ãåããã¾ãã ã¾ãä¸ã®å³ä¸é¨ãè¦ãã¨ãLLMã¨ã¼ã¸ã§ã³ã(+信念ç¶æ )ã¯0, 1, 2次ToMãä¿æãã¦ããã¨ããããã§ãã
次åäºå
次åã¯ãªã¼ãã³ã½ã¼ã¹ã®Llama3.2 3B-Instractãç¨ãã¦å調è¡åãã§ããããæ¤è¨¼ãã¦ã¿ã¾ãã
ãã®ããã°ã¯æ ªå¼ä¼ç¤¾EfficiNet Xã®ããã¯ããã°ã§ãã
*1:é¨å観測ãã«ã³ã決å®éç¨ã§ã¯ãã信念ç¶æ ã¯å®éã®(çã®)ç¶æ ã«ä»ã©ã®ããããããã表ã確çãã®ãã¨ã§ãã