ç±³Googleã®DeepMindãã¼ã ã¯2æ25æ¥ï¼ç¾å°æéï¼ã人工ç¥è½ï¼AIï¼ã¢ã«ã´ãªãºã ãdeep Q-networkï¼DQNï¼ãã«ã¤ãã¦ã®è«æãçºè¡¨ãããDQNã¯ã¼ãããã²ã¼ã ã®ã«ã¼ã«ãå¦ç¿ãããBreakoutãããPongãï¼ãããã¯å´©ãï¼ãªã©ã®ãAtari 2600ãã®2次å ãããªã²ã¼ã ã§æçµçã«ã¯äººéãããã¤ã¹ã³ã¢ãç²å¾ããã¾ã§ã«æé·ããã DQNã«ã¤ãã¦ã®ãHuman-level control through deep reinforcement learningï¼æ·±å±¤å¼·åå¦ç¿ã«ãã人éã¬ãã«ã®å¶å¾¡ï¼ãã¨é¡ããè«æãç§å¦éèªNatureã®ãµã¤ãã«æ²è¼ãããã DQNã¯ãç±³IBMã®Watsonã®ããã«ããã°ãã¼ã¿ãè§£æããçµæãæç¤ºããã®ã§ã¯ãªããã¼ãããå¦ç¿ãã¦é²åãã¦ãã人工ã¨ã¼ã¸ã§ã³ããâãã¯ã»ã«ã¨ã²ã¼ã ã¹ã³ã¢ãå ¥åããã ãã§âã²ã¼ã ã«ç¹°ãè¿ããã©ã¤ãã¦ãã¹ã¿ã¼ãã¦ã


{{#tags}}- {{label}}
{{/tags}}