A curated list of Diffusion Model in RL resources (continually updated)
-
Updated
Dec 15, 2025
A curated list of Diffusion Model in RL resources (continually updated)
Context-Aware Taxi Dispatching at City-Scale Using Deep Reinforcement Learning
Soft Actor-Critic with advanced features
Modular Single-file Reinfocement Learning Algorithms Library
Using Reinforcement Learning on S&P500 dataset to predict the future stock prices. The implementation uses deep Q-learning model along with time series modeling to achieve the goal state.
Proximal Policy Optimization(PPO) with Keras Implementation
[IROS 2023] Value-Informed Skill Chaining for Policy Learning of Long-Horizon Tasks with Surgical Robot
Training a drone for altitude control using Reinforcement Learning, ROS2, PX4, and Gazebo Harmonic
Context & Guide For Reinforcement Learning with Verifiable Rewards with Large Language Models
Complete implementation of the AlphaZero algorithm
[AAAI25] Scaling Combinatorial Optimization Neural Improvement Heuristics with Search and Online Adaptation
Reinfocement Learning Approach to solve Shortest Path Problem.
Code and Report relating to the University Group Assigned Practical Task
Python code that runs a full PinBall experience, with 3 different ways to control the PinBall machine. This includes manual, with a bot, and with AI.
A modular, extensible, entity-component-system (ECS) gridworld environment
A reinforcement learning project that trains an agent to play the classic Snake game using Proximal Policy Optimization (PPO) from Stable-Baselines3.
Some algorithms of reinforcement learning.
Official implementation of our research paper. DOI: 10.1109/JIOT.2024.3360882
This is a reinforcement learning based project where multiple agents are created and trained to learn the play the flappy bird game.
Add a description, image, and links to the reinfocement-learning topic page so that developers can more easily learn about it.
To associate your repository with the reinfocement-learning topic, visit your repo's landing page and select "manage topics."