├── .gitattributes ├── .gitignore ├── .python-version ├── 01_simple_rl.ipynb ├── 02_q_learning.ipynb ├── 03_sarsa.ipynb ├── 04_expected_sarsa.ipynb ├── 05_dyna_q.ipynb ├── 06_reinforce.ipynb ├── 07_ppo.ipynb ├── 08_a2c.ipynb ├── 09_a3c.ipynb ├── 10_ddpg.ipynb ├── 11_sac.ipynb ├── 12_trpo.ipynb ├── 13_dqn.ipynb ├── 14_maddpg.ipynb ├── 15_qmix.ipynb ├── 16_hac.ipynb ├── 17_mcts.ipynb ├── 18_planet.ipynb ├── LICENSE ├── README.md ├── a3c_training.py ├── cheatsheet.md ├── main.py ├── pyproject.toml ├── requirements.txt └── uv.lock /.gitattributes: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/.gitattributes -------------------------------------------------------------------------------- /.gitignore: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/.gitignore -------------------------------------------------------------------------------- /.python-version: -------------------------------------------------------------------------------- 1 | 3.13 2 | -------------------------------------------------------------------------------- /01_simple_rl.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/01_simple_rl.ipynb -------------------------------------------------------------------------------- /02_q_learning.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/02_q_learning.ipynb -------------------------------------------------------------------------------- /03_sarsa.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/03_sarsa.ipynb -------------------------------------------------------------------------------- /04_expected_sarsa.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/04_expected_sarsa.ipynb -------------------------------------------------------------------------------- /05_dyna_q.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/05_dyna_q.ipynb -------------------------------------------------------------------------------- /06_reinforce.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/06_reinforce.ipynb -------------------------------------------------------------------------------- /07_ppo.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/07_ppo.ipynb -------------------------------------------------------------------------------- /08_a2c.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/08_a2c.ipynb -------------------------------------------------------------------------------- /09_a3c.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/09_a3c.ipynb -------------------------------------------------------------------------------- /10_ddpg.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/10_ddpg.ipynb -------------------------------------------------------------------------------- /11_sac.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/11_sac.ipynb -------------------------------------------------------------------------------- /12_trpo.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/12_trpo.ipynb -------------------------------------------------------------------------------- /13_dqn.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/13_dqn.ipynb -------------------------------------------------------------------------------- /14_maddpg.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/14_maddpg.ipynb -------------------------------------------------------------------------------- /15_qmix.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/15_qmix.ipynb -------------------------------------------------------------------------------- /16_hac.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/16_hac.ipynb -------------------------------------------------------------------------------- /17_mcts.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/17_mcts.ipynb -------------------------------------------------------------------------------- /18_planet.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/18_planet.ipynb -------------------------------------------------------------------------------- /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/README.md -------------------------------------------------------------------------------- /a3c_training.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/a3c_training.py -------------------------------------------------------------------------------- /cheatsheet.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/cheatsheet.md -------------------------------------------------------------------------------- /main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/main.py -------------------------------------------------------------------------------- /pyproject.toml: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/pyproject.toml -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/requirements.txt -------------------------------------------------------------------------------- /uv.lock: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/FareedKhan-dev/all-rl-algorithms/HEAD/uv.lock --------------------------------------------------------------------------------