├── .vscode ├── .ropeproject │ ├── config.py │ └── objectdb └── settings.json ├── A2C └── advantage_actor_critic.py ├── A3C ├── SharedAdam.py ├── __pycache__ │ ├── SharedAdam.cpython-37.pyc │ └── utils.cpython-37.pyc ├── a3c_cartpole.py └── utils.py ├── AC └── actor_critic.py ├── ACER └── acer_cartpole.py ├── DDPG └── ddpg.py ├── DSAC └── distributional_sac_discrete.py ├── ICM_PPO └── icm.py ├── PPO_CLIP ├── gae_ppo_cartpole.py ├── ppo_cartpole.py └── ppo_pendulum.py ├── REINFORCE └── reinforce.py ├── RND_PPO └── rnd.py ├── Readme.md ├── SAC ├── sac.py └── sac_discrete.py ├── TD3 └── td3.py └── TRPO └── trpo_gae.py /.vscode/.ropeproject/config.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/.vscode/.ropeproject/config.py -------------------------------------------------------------------------------- /.vscode/.ropeproject/objectdb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/.vscode/.ropeproject/objectdb -------------------------------------------------------------------------------- /.vscode/settings.json: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/.vscode/settings.json -------------------------------------------------------------------------------- /A2C/advantage_actor_critic.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/A2C/advantage_actor_critic.py -------------------------------------------------------------------------------- /A3C/SharedAdam.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/A3C/SharedAdam.py -------------------------------------------------------------------------------- /A3C/__pycache__/SharedAdam.cpython-37.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/A3C/__pycache__/SharedAdam.cpython-37.pyc -------------------------------------------------------------------------------- /A3C/__pycache__/utils.cpython-37.pyc: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/A3C/__pycache__/utils.cpython-37.pyc -------------------------------------------------------------------------------- /A3C/a3c_cartpole.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/A3C/a3c_cartpole.py -------------------------------------------------------------------------------- /A3C/utils.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/A3C/utils.py -------------------------------------------------------------------------------- /AC/actor_critic.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/AC/actor_critic.py -------------------------------------------------------------------------------- /ACER/acer_cartpole.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/ACER/acer_cartpole.py -------------------------------------------------------------------------------- /DDPG/ddpg.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/DDPG/ddpg.py -------------------------------------------------------------------------------- /DSAC/distributional_sac_discrete.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/DSAC/distributional_sac_discrete.py -------------------------------------------------------------------------------- /ICM_PPO/icm.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/ICM_PPO/icm.py -------------------------------------------------------------------------------- /PPO_CLIP/gae_ppo_cartpole.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/PPO_CLIP/gae_ppo_cartpole.py -------------------------------------------------------------------------------- /PPO_CLIP/ppo_cartpole.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/PPO_CLIP/ppo_cartpole.py -------------------------------------------------------------------------------- /PPO_CLIP/ppo_pendulum.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/PPO_CLIP/ppo_pendulum.py -------------------------------------------------------------------------------- /REINFORCE/reinforce.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/REINFORCE/reinforce.py -------------------------------------------------------------------------------- /RND_PPO/rnd.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/RND_PPO/rnd.py -------------------------------------------------------------------------------- /Readme.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/Readme.md -------------------------------------------------------------------------------- /SAC/sac.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/SAC/sac.py -------------------------------------------------------------------------------- /SAC/sac_discrete.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/SAC/sac_discrete.py -------------------------------------------------------------------------------- /TD3/td3.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/TD3/td3.py -------------------------------------------------------------------------------- /TRPO/trpo_gae.py: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/deligentfool/policy_based_RL/HEAD/TRPO/trpo_gae.py --------------------------------------------------------------------------------