ã¯ããã« å°ãæ代é ãããããã¾ããããå¼·åå¦ç¿ã®ææ³ã®ã²ã¨ã¤ã§ããDQNãDeepMindã®è«æMnih et al., 2015, Human-level control through deep reinforcement learningãåèã«ããªãããKerasã¨TensorFlowã¨OpenAI Gymã使ã£ã¦å®è£ ãã¾ãã ååã§ã¯è»½ãDQNã®ããããããã¾ãããå°ãã®å¼·åå¦ç¿ã®ç¥èãæã£ã¦ãããã¨ãåæã«ãã¦ãã¾ãã ãã§ã«ããã¤ãè¯è¨äºãåºã¦ããã®ã§ç´¹ä»ãããã¨æãã¾ããåããã¦èªãã¨ç解ã®å©ãã«ãªãã¨æãã®ã§ãæ¯éåèã«ãã¦ã¿ã¦ãã ããã DQNã®çãç«ã¡ãï¼ãDeep Q-NetworkãChainerã§æ¸ãã DQNãçã¾ããèæ¯ã«ã¤ãã¦èª¬æãã¦ããã¦ãã¾ããChainerã§ã®å®è£ ãããããã§ãã ã¼ãããDeepã¾ã§å¦ã¶å¼·åå¦ç¿ ã¿ã¤ãã«ã®éããã¼ãããDeepã¾
{{#tags}}- {{label}}
{{/tags}}