├── README.md └── algorithm ├── policy gradient ├── A2C.py ├── A3C │ ├── main.py │ ├── model.py │ └── utils.py ├── Actor_Critic.py ├── DDPG │ ├── experience_replay.py │ ├── main.py │ └── model.py ├── DDPG_discrete │ ├── experience_replay.py │ ├── gumbel_softmax.py │ ├── main.py │ └── model.py ├── REINFORCE.py ├── SAC │ ├── experience_replay.py │ ├── main.py │ └── model.py ├── TD3 │ ├── experience_replay.py │ ├── main.py │ └── model.py ├── TRPO │ ├── main.py │ └── model.py └── baseline_REINFORCE.py └── value-based ├── DQN.py ├── DoubleDQN.py ├── DuelingDQN.py └── Sarsa.py /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/README.md -------------------------------------------------------------------------------- /algorithm/policy gradient/A2C.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/A2C.py -------------------------------------------------------------------------------- /algorithm/policy gradient/A3C/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/A3C/main.py -------------------------------------------------------------------------------- /algorithm/policy gradient/A3C/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/A3C/model.py -------------------------------------------------------------------------------- /algorithm/policy gradient/A3C/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/A3C/utils.py -------------------------------------------------------------------------------- /algorithm/policy gradient/Actor_Critic.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/Actor_Critic.py -------------------------------------------------------------------------------- /algorithm/policy gradient/DDPG/experience_replay.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/DDPG/experience_replay.py -------------------------------------------------------------------------------- /algorithm/policy gradient/DDPG/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/DDPG/main.py -------------------------------------------------------------------------------- /algorithm/policy gradient/DDPG/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/DDPG/model.py -------------------------------------------------------------------------------- /algorithm/policy gradient/DDPG_discrete/experience_replay.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/DDPG_discrete/experience_replay.py -------------------------------------------------------------------------------- /algorithm/policy gradient/DDPG_discrete/gumbel_softmax.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/DDPG_discrete/gumbel_softmax.py -------------------------------------------------------------------------------- /algorithm/policy gradient/DDPG_discrete/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/DDPG_discrete/main.py -------------------------------------------------------------------------------- /algorithm/policy gradient/DDPG_discrete/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/DDPG_discrete/model.py -------------------------------------------------------------------------------- /algorithm/policy gradient/REINFORCE.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/REINFORCE.py -------------------------------------------------------------------------------- /algorithm/policy gradient/SAC/experience_replay.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/SAC/experience_replay.py -------------------------------------------------------------------------------- /algorithm/policy gradient/SAC/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/SAC/main.py -------------------------------------------------------------------------------- /algorithm/policy gradient/SAC/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/SAC/model.py -------------------------------------------------------------------------------- /algorithm/policy gradient/TD3/experience_replay.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/TD3/experience_replay.py -------------------------------------------------------------------------------- /algorithm/policy gradient/TD3/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/TD3/main.py -------------------------------------------------------------------------------- /algorithm/policy gradient/TD3/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/TD3/model.py -------------------------------------------------------------------------------- /algorithm/policy gradient/TRPO/main.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/TRPO/main.py -------------------------------------------------------------------------------- /algorithm/policy gradient/TRPO/model.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/TRPO/model.py -------------------------------------------------------------------------------- /algorithm/policy gradient/baseline_REINFORCE.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/policy gradient/baseline_REINFORCE.py -------------------------------------------------------------------------------- /algorithm/value-based/DQN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/value-based/DQN.py -------------------------------------------------------------------------------- /algorithm/value-based/DoubleDQN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/value-based/DoubleDQN.py -------------------------------------------------------------------------------- /algorithm/value-based/DuelingDQN.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/value-based/DuelingDQN.py -------------------------------------------------------------------------------- /algorithm/value-based/Sarsa.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/LxzGordon/Deep-Reinforcement-Learning-with-pytorch/HEAD/algorithm/value-based/Sarsa.py --------------------------------------------------------------------------------