GitHub

This branch is the code for the paper

Average-Reward Off-Policy Policy Evaluation with Function Approximation
Shangtong Zhang, Yi Wan, Richard S. Sutton, Shimon Whiteson (ICML 2021)

.
├── Dockerfile                                      # Dependencies
├── requirements.txt                                # Dependencies
├── template_jobs.py                                # Entrance for the experiments
|   ├── linear_ope_boyans_chain                     # Entrance of Boyan's chain experiments 
|   ├── neural_ope                                  # Entrance of MuJoCo experiments 
├── deep_rl/agent/LinearOPEAgent.py                 # GradientDICE / Diff-GQ1 / Diff-GQ2 / Diff-SGQ for Boyan's chain
├── deep_rl/agent/NeuralOPEAgent.py                 # GradientDICE / Diff-GQ1 / Diff-GQ2 / Diff-SGQ for MuJoCo  
└── template_plot.py                                # Plotting

I can send the data for plotting via email upon request.

This branch is based on the DeepRL codebase and is left unchanged after I completed the paper. Algorithm implementations not used in the paper may be broken and should never be used. It may take extra effort if you want to rebase/merge the master branch.

Name		Name	Last commit message	Last commit date
Latest commit History 386 Commits
deep_rl		deep_rl
images		images
.gitignore		.gitignore
Dockerfile		Dockerfile
LICENSE		LICENSE
README.md		README.md
docker_batch.sh		docker_batch.sh
docker_build.sh		docker_build.sh
docker_clean.sh		docker_clean.sh
docker_python.sh		docker_python.sh
docker_shell.sh		docker_shell.sh
docker_stop.sh		docker_stop.sh
examples.py		examples.py
requirements.txt		requirements.txt
setup.py		setup.py
template_jobs.py		template_jobs.py
template_plot.py		template_plot.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

About

Uh oh!

Uh oh!

Contributors 3

Uh oh!

Languages

License

ShangtongZhang/DeepRL

Folders and files

Latest commit

History

Repository files navigation

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 3

Uh oh!

Languages