Different RL algorithms implemented from scratch to the Easy 21 card game
This is based the Easy21 assignment from David Silver's RL Course
For 10,000,000 episodes:
To use: run monte_carlo.py
For 10,000 episodes:
To use: run sarsa.py
For 10,000 episodes:
To use: run sarsa_linear.py
numpy, tqdm, matplotlib, pandas