ゼロから始める深層強化学習(NLP2018講演資料)/ Introduction of Deep Reinforcement LearningPreferred Networks
Introduction of Deep Reinforcement Learning, which was presented at domestic NLP conference.
言語処理学会第24回年次大会(NLP2018) での講演資料です。
http://www.anlp.jp/nlp2018/#tutorial
ゼロから始める深層強化学習(NLP2018講演資料)/ Introduction of Deep Reinforcement LearningPreferred Networks
Introduction of Deep Reinforcement Learning, which was presented at domestic NLP conference.
言語処理学会第24回年次大会(NLP2018) での講演資料です。
http://www.anlp.jp/nlp2018/#tutorial
This document discusses using R and RStudio to simulate reinforcement learning models. It demonstrates simulating a Rescorla-Wagner model to update action values Q_A and Q_B based on payoffs from actions A and B over time. The model is expanded to select actions stochastically using a softmax function of the difference between Q_A and Q_B. Plots show the evolution of Q_A and Q_B over time for different learning rate and temperature parameters. The document provides an example code implementation of this reinforcement learning model in R.
Karl Fristonが提唱している「自由エネルギー原理(free-energy principle = FEP)」について、北大文学部の聴衆を対象にして、物理学や機械学習の知識の前提抜きにして、説明を行い、その意義を説明したものです。FEPの意識研究への応用に向けて、FEPとエナクション説の近接性について強調したものとなっております。
This document summarizes two lectures about consciousness and neuroscience:
1) It discusses theories of consciousness such as qualia and awareness, and the distinction between the dorsal and ventral visual pathways and their roles in vision for action vs perception. It also covers blindsight and the idea that a "feeling of something" without qualia may arise from saliency computation.
2) It discusses using bistable percepts like binocular rivalry to study neural correlates of awareness. It introduces the ideas of neurophenomenology and heterophenomenology to study first-person experience through intersubjective methods. It provides an example of neurophenomenology applied to the aura experience before epileptic seizures.
駒場学部講義2015 総合情報学特論III 「意識の神経科学:「気づき」と「サリエンシー」を手がかりに」Masatoshi Yoshida
1. The document summarizes a lecture about the neural basis of consciousness, focusing on awareness, attention, and the study of blindsight.
2. It discusses evidence that the dorsal visual pathway is involved in vision for action while the ventral pathway is involved in vision for perception.
3. In blindsight, there is a "feeling of something happening" in the blind field that can be explained by saliency computation and sensorimotor contingencies rather than conscious visual experience.
4. Friston K, Breakspear M, Deco G. Perception and self-
organized instability. Front Comput Neurosci. 2012 Jul 6;6:44.
FIg.1 (CC BY 3.0) http://www.frontiersin.org/files/Articles/
23035/fncom-06-00044-r4/image_m/fncom-06-00044-g001.jpg
自由エネルギー最小化原理
44. 自由エネルギー原理 (FEP)
x (or psi)とsは時間的なシークエンス
視覚サリエンスが高い
= そこを見ればqのエン
トロピーを最小化する
場所 => その場所を
control state u(x)のprior
として採用する。
Control state u(x)によって運動aを計画
実際のモデルはこの図のようにもっと複雑なのだけど、
エッセンスはこれでつかめたと思う。
Perceptions as hypotheses: saccades as
experiments. Friston K, Adams RA,
Perrinet L, Breakspear M. Front Psychol.
2012 May 28;3:151.
45. 参考文献
Friston K, Adams RA, Perrinet L, Breakspear M. (2012) Perceptions as hypotheses:
saccades as experiments. Front Psychol. 3:151. (視線移動のモデル及び反実仮想の概
念の初出)
McGregor, Simon; Baltieri, Manuel; Buckley, Christopher L. (2015) A Minimal Active
Inference Agent. arXiv:1503.04187 https://arxiv.org/abs/1503.04187 (このスライドの
説明の元ネタ。離散バージョンのFEPの説明などがあり、これがもっともわかりやす
い。)
Buckley, Christopher L.; Kim, Chang Sub; McGregor, Simon; Seth, Anil K. (2017) The
free energy principle for action and perception: A mathematical review. arXiv:
1705.09156 (省略無しで丁寧な説明。)
Friston, K. (2010). The free-energy principle: a unified brain theory? Nature Reviews
Neuroscience, 11(2), 127–138. (FEPの全体像について網羅的な記述)
Friston K, Rigoli F, Ognibene D, Mathys C, Fitzgerald T, Pezzulo G. (2015) Active
inference and epistemic value. Cogn Neurosci. 2015;6(4):187-214. (視線移動を
epistemic valueと捉えて、他のvalueと対置して扱っている)
Ryota Kanai (2017) Creating Consciousness. https://www.slideshare.net/ryotakanai/
creating-consciousness (FEPの反実仮想についての言及あり。)