æ¦è¦ 深層å¦ç¿ãã¬ã¼ã ã¯ã¼ã¯Caffeã使ã£ã¦ï¼Deep Q-Networkã¨ãã深層強åå¦ç¿ã¢ã«ã´ãªãºã ãC++ã§å®è£ ãã¦ï¼Atari 2600ã®ã²ã¼ã ããã¬ã¤ããã¦ã¿ã¾ããï¼ Deep Q-Network Deep Q-Networkï¼ä»¥ä¸DQNï¼ã¯ï¼2013å¹´ã®NIPSã®Deep Learning Workshopã®"Playing Atari with Deep Reinforcement Learning"ã¨ããè«æã§ææ¡ãããã¢ã«ã´ãªãºã ã§ï¼è¡å価å¤é¢æ°Q(s,a)ã深層ãã¥ã¼ã©ã«ãããã¯ã¼ã¯ã«ããè¿ä¼¼ããã¨ããï¼è¿å¹´ã®æ·±å±¤å¦ç¿ã®ç 究ææãå¼·åå¦ç¿ã«æ´»ããããã®ã§ãï¼Atari 2600ã®ã²ã¼ã ã«é©ç¨ããï¼æ¢åææ³ãå§åããã¨ã¨ãã«ä¸é¨ã®ã²ã¼ã ã§ã¯äººéã®ã¨ãã¹ãã¼ããä¸åãã¹ã³ã¢ãéæãã¦ãã¾ãï¼è«æã®èè ãã¯ä»å¹´Googleã«è²·åãããDeepMindã®ç 究è ã§ãï¼ NIPS
{{#tags}}- {{label}}
{{/tags}}