├── 01_CartPole-reinforcement-learning ├── Cartpole_DQN.py ├── IMAGES │ ├── CartPole_test.gif │ ├── image.png │ ├── math.PNG │ ├── testing_model.PNG │ └── training_model.PNG ├── cartpole-dqn.h5 └── cartpole_random.py ├── 02_CartPole-reinforcement-learning_DDQN ├── Cartpole_DDQN.py ├── Cartpole_DDQN_TF2.py └── IMAGES │ ├── DDQN_CartPole-v1.png │ ├── DDQN_CartPole-v1_soft.png │ └── DQN_CartPole-v1.png ├── 03_CartPole-reinforcement-learning_Dueling_DDQN ├── Cartpole_Double_DDQN.py ├── Cartpole_Double_DDQN_TF2.py └── IMAGES │ ├── DDQN_CartPole-v1.png │ ├── DDQN_CartPole-v1_Dueling.png │ └── DQN_CartPole-v1_Dueling.png ├── 04_CartPole-reinforcement-learning_e_greedy_D3QN ├── Cartpole_e_greedy_D3QN.py ├── Cartpole_e_greedy_D3QN_TF2.py └── IMAGES │ └── DDQN_CartPole-v1_Dueling_Greedy.png ├── 05_CartPole-reinforcement-learning_PER_D3QN ├── Cartpole_PER_D3QN.py ├── Cartpole_PER_D3QN_TF2.py ├── IMAGES │ ├── DDQN_CartPole-v1_Dueling.png │ ├── DDQN_CartPole-v1_Dueling_PER.png │ ├── Replay_buffer.png │ └── SumTree.png └── PER.py ├── 06_CartPole-reinforcement-learning_PER_D3QN_CNN ├── Cartpole_PER_D3QN_CNN.py ├── Cartpole_PER_D3QN_CNN_TF2.py ├── PER.py └── random_game.py ├── 07_Pong-reinforcement-learning_DQN_CNN ├── IMAGES │ ├── DDQN_Pong-v0_CNN.png │ ├── DDQN_Pong-v0_Dueling_CNN.png │ ├── DDQN_Pong-v0_Dueling_PER_CNN.png │ └── DQN_Pong-v0_CNN.png ├── Models │ ├── Pong-v0_DDQN_CNN.h5 │ ├── Pong-v0_DDQN_Dueling_CNN.h5 │ ├── Pong-v0_DDQN_Dueling_PER_CNN.h5 │ └── Pong-v0_DQN_CNN.h5 ├── PER.py ├── Pong-v0_DQN_CNN.py └── Pong-v0_DQN_CNN_TF2.py ├── 08_Pong-v0_Policy_gradient ├── IMAGES │ ├── Pong-v0_PG_2.5e-05.png │ └── PongDeterministic-v4_PG_0.0001.png ├── Pong-v0_PG.py └── Pong-v0_PG_TF2.py ├── 09_Pong-v0_A2C ├── IMAGES │ ├── Pong-v0_A2C_2.5e-05.png │ └── PongDeterministic-v4_A2C_2.5e-05.png ├── Pong-v0_A2C.py └── Pong-v0_A2C_TF2.py ├── 10_Pong-v0_A3C ├── Pong-v0_A3C.py ├── Pong-v0_A3C_TF2.py └── PongDeterministic-v4_A3C_2.5e-05.png ├── 11_Pong-v0_PPO ├── Models │ └── Pong-v0_APPO_0.0001_Actor_CNN.h5 ├── Pong-v0_APPO_0.0001_CNN.png ├── Pong-v0_APPO_0.0001_RMSprop.png ├── Pong-v0_PPO.py ├── Pong-v0_PPO_TF2.py ├── Pong-v0_PPO_gif.py ├── PongDeterministic-v4_APPO_0.0001.png ├── gameplay.gif └── gameplay_CNN.gif ├── BipedalWalker-v3_PPO ├── BipedalWalker-v3_PPO.py ├── BipedalWalker-v3_PPO_Actor.h5 ├── BipedalWalker-v3_PPO_Critic.h5 ├── BipedalWalker-v3_training.png └── gameplay.gif ├── LICENSE.md ├── LunarLander-v2_PPO ├── LunarLander-v2.png ├── LunarLander-v2_PPO.py ├── LunarLander-v2_PPO_Actor.h5 ├── LunarLander-v2_PPO_Critic.h5 └── gameplay.gif ├── README.md └── requirements.txt /01_CartPole-reinforcement-learning/Cartpole_DQN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/Cartpole_DQN.py -------------------------------------------------------------------------------- /01_CartPole-reinforcement-learning/IMAGES/CartPole_test.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/IMAGES/CartPole_test.gif -------------------------------------------------------------------------------- /01_CartPole-reinforcement-learning/IMAGES/image.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/IMAGES/image.png -------------------------------------------------------------------------------- /01_CartPole-reinforcement-learning/IMAGES/math.PNG: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/IMAGES/math.PNG -------------------------------------------------------------------------------- /01_CartPole-reinforcement-learning/IMAGES/testing_model.PNG: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/IMAGES/testing_model.PNG -------------------------------------------------------------------------------- /01_CartPole-reinforcement-learning/IMAGES/training_model.PNG: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/IMAGES/training_model.PNG -------------------------------------------------------------------------------- /01_CartPole-reinforcement-learning/cartpole-dqn.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/cartpole-dqn.h5 -------------------------------------------------------------------------------- /01_CartPole-reinforcement-learning/cartpole_random.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/01_CartPole-reinforcement-learning/cartpole_random.py -------------------------------------------------------------------------------- /02_CartPole-reinforcement-learning_DDQN/Cartpole_DDQN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/02_CartPole-reinforcement-learning_DDQN/Cartpole_DDQN.py -------------------------------------------------------------------------------- /02_CartPole-reinforcement-learning_DDQN/Cartpole_DDQN_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/02_CartPole-reinforcement-learning_DDQN/Cartpole_DDQN_TF2.py -------------------------------------------------------------------------------- /02_CartPole-reinforcement-learning_DDQN/IMAGES/DDQN_CartPole-v1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/02_CartPole-reinforcement-learning_DDQN/IMAGES/DDQN_CartPole-v1.png -------------------------------------------------------------------------------- /02_CartPole-reinforcement-learning_DDQN/IMAGES/DDQN_CartPole-v1_soft.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/02_CartPole-reinforcement-learning_DDQN/IMAGES/DDQN_CartPole-v1_soft.png -------------------------------------------------------------------------------- /02_CartPole-reinforcement-learning_DDQN/IMAGES/DQN_CartPole-v1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/02_CartPole-reinforcement-learning_DDQN/IMAGES/DQN_CartPole-v1.png -------------------------------------------------------------------------------- /03_CartPole-reinforcement-learning_Dueling_DDQN/Cartpole_Double_DDQN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/03_CartPole-reinforcement-learning_Dueling_DDQN/Cartpole_Double_DDQN.py -------------------------------------------------------------------------------- /03_CartPole-reinforcement-learning_Dueling_DDQN/Cartpole_Double_DDQN_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/03_CartPole-reinforcement-learning_Dueling_DDQN/Cartpole_Double_DDQN_TF2.py -------------------------------------------------------------------------------- /03_CartPole-reinforcement-learning_Dueling_DDQN/IMAGES/DDQN_CartPole-v1.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/03_CartPole-reinforcement-learning_Dueling_DDQN/IMAGES/DDQN_CartPole-v1.png -------------------------------------------------------------------------------- /03_CartPole-reinforcement-learning_Dueling_DDQN/IMAGES/DDQN_CartPole-v1_Dueling.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/03_CartPole-reinforcement-learning_Dueling_DDQN/IMAGES/DDQN_CartPole-v1_Dueling.png -------------------------------------------------------------------------------- /03_CartPole-reinforcement-learning_Dueling_DDQN/IMAGES/DQN_CartPole-v1_Dueling.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/03_CartPole-reinforcement-learning_Dueling_DDQN/IMAGES/DQN_CartPole-v1_Dueling.png -------------------------------------------------------------------------------- /04_CartPole-reinforcement-learning_e_greedy_D3QN/Cartpole_e_greedy_D3QN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/04_CartPole-reinforcement-learning_e_greedy_D3QN/Cartpole_e_greedy_D3QN.py -------------------------------------------------------------------------------- /04_CartPole-reinforcement-learning_e_greedy_D3QN/Cartpole_e_greedy_D3QN_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/04_CartPole-reinforcement-learning_e_greedy_D3QN/Cartpole_e_greedy_D3QN_TF2.py -------------------------------------------------------------------------------- /04_CartPole-reinforcement-learning_e_greedy_D3QN/IMAGES/DDQN_CartPole-v1_Dueling_Greedy.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/04_CartPole-reinforcement-learning_e_greedy_D3QN/IMAGES/DDQN_CartPole-v1_Dueling_Greedy.png -------------------------------------------------------------------------------- /05_CartPole-reinforcement-learning_PER_D3QN/Cartpole_PER_D3QN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/05_CartPole-reinforcement-learning_PER_D3QN/Cartpole_PER_D3QN.py -------------------------------------------------------------------------------- /05_CartPole-reinforcement-learning_PER_D3QN/Cartpole_PER_D3QN_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/05_CartPole-reinforcement-learning_PER_D3QN/Cartpole_PER_D3QN_TF2.py -------------------------------------------------------------------------------- /05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/DDQN_CartPole-v1_Dueling.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/DDQN_CartPole-v1_Dueling.png -------------------------------------------------------------------------------- /05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/DDQN_CartPole-v1_Dueling_PER.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/DDQN_CartPole-v1_Dueling_PER.png -------------------------------------------------------------------------------- /05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/Replay_buffer.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/Replay_buffer.png -------------------------------------------------------------------------------- /05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/SumTree.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/05_CartPole-reinforcement-learning_PER_D3QN/IMAGES/SumTree.png -------------------------------------------------------------------------------- /05_CartPole-reinforcement-learning_PER_D3QN/PER.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/05_CartPole-reinforcement-learning_PER_D3QN/PER.py -------------------------------------------------------------------------------- /06_CartPole-reinforcement-learning_PER_D3QN_CNN/Cartpole_PER_D3QN_CNN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/06_CartPole-reinforcement-learning_PER_D3QN_CNN/Cartpole_PER_D3QN_CNN.py -------------------------------------------------------------------------------- /06_CartPole-reinforcement-learning_PER_D3QN_CNN/Cartpole_PER_D3QN_CNN_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/06_CartPole-reinforcement-learning_PER_D3QN_CNN/Cartpole_PER_D3QN_CNN_TF2.py -------------------------------------------------------------------------------- /06_CartPole-reinforcement-learning_PER_D3QN_CNN/PER.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/06_CartPole-reinforcement-learning_PER_D3QN_CNN/PER.py -------------------------------------------------------------------------------- /06_CartPole-reinforcement-learning_PER_D3QN_CNN/random_game.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/06_CartPole-reinforcement-learning_PER_D3QN_CNN/random_game.py -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DDQN_Pong-v0_CNN.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DDQN_Pong-v0_CNN.png -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DDQN_Pong-v0_Dueling_CNN.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DDQN_Pong-v0_Dueling_CNN.png -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DDQN_Pong-v0_Dueling_PER_CNN.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DDQN_Pong-v0_Dueling_PER_CNN.png -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DQN_Pong-v0_CNN.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/IMAGES/DQN_Pong-v0_CNN.png -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DDQN_CNN.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DDQN_CNN.h5 -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DDQN_Dueling_CNN.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DDQN_Dueling_CNN.h5 -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DDQN_Dueling_PER_CNN.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DDQN_Dueling_PER_CNN.h5 -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DQN_CNN.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/Models/Pong-v0_DQN_CNN.h5 -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/PER.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/PER.py -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/Pong-v0_DQN_CNN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/Pong-v0_DQN_CNN.py -------------------------------------------------------------------------------- /07_Pong-reinforcement-learning_DQN_CNN/Pong-v0_DQN_CNN_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/07_Pong-reinforcement-learning_DQN_CNN/Pong-v0_DQN_CNN_TF2.py -------------------------------------------------------------------------------- /08_Pong-v0_Policy_gradient/IMAGES/Pong-v0_PG_2.5e-05.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/08_Pong-v0_Policy_gradient/IMAGES/Pong-v0_PG_2.5e-05.png -------------------------------------------------------------------------------- /08_Pong-v0_Policy_gradient/IMAGES/PongDeterministic-v4_PG_0.0001.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/08_Pong-v0_Policy_gradient/IMAGES/PongDeterministic-v4_PG_0.0001.png -------------------------------------------------------------------------------- /08_Pong-v0_Policy_gradient/Pong-v0_PG.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/08_Pong-v0_Policy_gradient/Pong-v0_PG.py -------------------------------------------------------------------------------- /08_Pong-v0_Policy_gradient/Pong-v0_PG_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/08_Pong-v0_Policy_gradient/Pong-v0_PG_TF2.py -------------------------------------------------------------------------------- /09_Pong-v0_A2C/IMAGES/Pong-v0_A2C_2.5e-05.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/09_Pong-v0_A2C/IMAGES/Pong-v0_A2C_2.5e-05.png -------------------------------------------------------------------------------- /09_Pong-v0_A2C/IMAGES/PongDeterministic-v4_A2C_2.5e-05.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/09_Pong-v0_A2C/IMAGES/PongDeterministic-v4_A2C_2.5e-05.png -------------------------------------------------------------------------------- /09_Pong-v0_A2C/Pong-v0_A2C.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/09_Pong-v0_A2C/Pong-v0_A2C.py -------------------------------------------------------------------------------- /09_Pong-v0_A2C/Pong-v0_A2C_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/09_Pong-v0_A2C/Pong-v0_A2C_TF2.py -------------------------------------------------------------------------------- /10_Pong-v0_A3C/Pong-v0_A3C.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/10_Pong-v0_A3C/Pong-v0_A3C.py -------------------------------------------------------------------------------- /10_Pong-v0_A3C/Pong-v0_A3C_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/10_Pong-v0_A3C/Pong-v0_A3C_TF2.py -------------------------------------------------------------------------------- /10_Pong-v0_A3C/PongDeterministic-v4_A3C_2.5e-05.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/10_Pong-v0_A3C/PongDeterministic-v4_A3C_2.5e-05.png -------------------------------------------------------------------------------- /11_Pong-v0_PPO/Models/Pong-v0_APPO_0.0001_Actor_CNN.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/Models/Pong-v0_APPO_0.0001_Actor_CNN.h5 -------------------------------------------------------------------------------- /11_Pong-v0_PPO/Pong-v0_APPO_0.0001_CNN.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/Pong-v0_APPO_0.0001_CNN.png -------------------------------------------------------------------------------- /11_Pong-v0_PPO/Pong-v0_APPO_0.0001_RMSprop.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/Pong-v0_APPO_0.0001_RMSprop.png -------------------------------------------------------------------------------- /11_Pong-v0_PPO/Pong-v0_PPO.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/Pong-v0_PPO.py -------------------------------------------------------------------------------- /11_Pong-v0_PPO/Pong-v0_PPO_TF2.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/Pong-v0_PPO_TF2.py -------------------------------------------------------------------------------- /11_Pong-v0_PPO/Pong-v0_PPO_gif.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/Pong-v0_PPO_gif.py -------------------------------------------------------------------------------- /11_Pong-v0_PPO/PongDeterministic-v4_APPO_0.0001.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/PongDeterministic-v4_APPO_0.0001.png -------------------------------------------------------------------------------- /11_Pong-v0_PPO/gameplay.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/gameplay.gif -------------------------------------------------------------------------------- /11_Pong-v0_PPO/gameplay_CNN.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/11_Pong-v0_PPO/gameplay_CNN.gif -------------------------------------------------------------------------------- /BipedalWalker-v3_PPO/BipedalWalker-v3_PPO.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/BipedalWalker-v3_PPO/BipedalWalker-v3_PPO.py -------------------------------------------------------------------------------- /BipedalWalker-v3_PPO/BipedalWalker-v3_PPO_Actor.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/BipedalWalker-v3_PPO/BipedalWalker-v3_PPO_Actor.h5 -------------------------------------------------------------------------------- /BipedalWalker-v3_PPO/BipedalWalker-v3_PPO_Critic.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/BipedalWalker-v3_PPO/BipedalWalker-v3_PPO_Critic.h5 -------------------------------------------------------------------------------- /BipedalWalker-v3_PPO/BipedalWalker-v3_training.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/BipedalWalker-v3_PPO/BipedalWalker-v3_training.png -------------------------------------------------------------------------------- /BipedalWalker-v3_PPO/gameplay.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/BipedalWalker-v3_PPO/gameplay.gif -------------------------------------------------------------------------------- /LICENSE.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/LICENSE.md -------------------------------------------------------------------------------- /LunarLander-v2_PPO/LunarLander-v2.png: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/LunarLander-v2_PPO/LunarLander-v2.png -------------------------------------------------------------------------------- /LunarLander-v2_PPO/LunarLander-v2_PPO.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/LunarLander-v2_PPO/LunarLander-v2_PPO.py -------------------------------------------------------------------------------- /LunarLander-v2_PPO/LunarLander-v2_PPO_Actor.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/LunarLander-v2_PPO/LunarLander-v2_PPO_Actor.h5 -------------------------------------------------------------------------------- /LunarLander-v2_PPO/LunarLander-v2_PPO_Critic.h5: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/LunarLander-v2_PPO/LunarLander-v2_PPO_Critic.h5 -------------------------------------------------------------------------------- /LunarLander-v2_PPO/gameplay.gif: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/LunarLander-v2_PPO/gameplay.gif -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/README.md -------------------------------------------------------------------------------- /requirements.txt: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/pythonlessons/Reinforcement_Learning/HEAD/requirements.txt --------------------------------------------------------------------------------