├── Chapter02 └── Code.ipynb ├── Chapter03 ├── frozenlake8x8_policyiteration.py └── frozenlake8x8_valueiteration.py ├── Chapter04 └── SARSA Q_learning Taxi-v2.py ├── Chapter05 ├── .ipynb_checkpoints │ └── Untitled-checkpoint.ipynb ├── DQN_Atari.py ├── DQN_variations_Atari.py ├── Untitled.ipynb ├── atari_wrappers.py └── untitled ├── Chapter06 ├── AC.py ├── REINFORCE.py └── REINFORCE_baseline.py ├── Chapter07 ├── PPO.py └── TRPO.py ├── Chapter08 ├── DDPG.py └── TD3.py ├── Chapter09 └── ME-TRPO.py ├── Chapter10 ├── DAgger.py └── expert │ ├── checkpoint │ ├── model.ckpt.data-00000-of-00001 │ ├── model.ckpt.index │ └── model.ckpt.meta ├── Chapter11 └── ES.py ├── Chapter12 └── ESBAS.py └── README.md /Chapter02/Code.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter02/Code.ipynb -------------------------------------------------------------------------------- /Chapter03/frozenlake8x8_policyiteration.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter03/frozenlake8x8_policyiteration.py -------------------------------------------------------------------------------- /Chapter03/frozenlake8x8_valueiteration.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter03/frozenlake8x8_valueiteration.py -------------------------------------------------------------------------------- /Chapter04/SARSA Q_learning Taxi-v2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter04/SARSA Q_learning Taxi-v2.py -------------------------------------------------------------------------------- /Chapter05/.ipynb_checkpoints/Untitled-checkpoint.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter05/.ipynb_checkpoints/Untitled-checkpoint.ipynb -------------------------------------------------------------------------------- /Chapter05/DQN_Atari.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter05/DQN_Atari.py -------------------------------------------------------------------------------- /Chapter05/DQN_variations_Atari.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter05/DQN_variations_Atari.py -------------------------------------------------------------------------------- /Chapter05/Untitled.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter05/Untitled.ipynb -------------------------------------------------------------------------------- /Chapter05/atari_wrappers.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter05/atari_wrappers.py -------------------------------------------------------------------------------- /Chapter05/untitled: -------------------------------------------------------------------------------- 1 | -------------------------------------------------------------------------------- /Chapter06/AC.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter06/AC.py -------------------------------------------------------------------------------- /Chapter06/REINFORCE.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter06/REINFORCE.py -------------------------------------------------------------------------------- /Chapter06/REINFORCE_baseline.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter06/REINFORCE_baseline.py -------------------------------------------------------------------------------- /Chapter07/PPO.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter07/PPO.py -------------------------------------------------------------------------------- /Chapter07/TRPO.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter07/TRPO.py -------------------------------------------------------------------------------- /Chapter08/DDPG.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter08/DDPG.py -------------------------------------------------------------------------------- /Chapter08/TD3.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter08/TD3.py -------------------------------------------------------------------------------- /Chapter09/ME-TRPO.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter09/ME-TRPO.py -------------------------------------------------------------------------------- /Chapter10/DAgger.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter10/DAgger.py -------------------------------------------------------------------------------- /Chapter10/expert/checkpoint: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter10/expert/checkpoint -------------------------------------------------------------------------------- /Chapter10/expert/model.ckpt.data-00000-of-00001: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter10/expert/model.ckpt.data-00000-of-00001 -------------------------------------------------------------------------------- /Chapter10/expert/model.ckpt.index: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter10/expert/model.ckpt.index -------------------------------------------------------------------------------- /Chapter10/expert/model.ckpt.meta: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter10/expert/model.ckpt.meta -------------------------------------------------------------------------------- /Chapter11/ES.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter11/ES.py -------------------------------------------------------------------------------- /Chapter12/ESBAS.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/Chapter12/ESBAS.py -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Reinforcement-Learning-Algorithms-with-Python/HEAD/README.md --------------------------------------------------------------------------------