├── Open-Source.md ├── README.md ├── Reinforcement-Learning-Papers.md └── papers ├── Action-Conditional Video Prediction using Deep Networks in Atari Games.md ├── Continuous Deep Q-Learning with Model-based Acceleration.md ├── Deep Successor Reinforcement Learning.md ├── Generalizing Skills with Semi-Supervised Reinforcement Learning.md ├── High-Dimensional Continuous Control Using Generalized Advantage Estimation.md ├── Human-level control through deep reinforcement learning.md ├── Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution.md ├── Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer.md ├── Learning Tetris Using the Noisy Cross-Entropy Method.md ├── Mastering the game of Go with deep neural networks and tree search.md ├── Noisy Networks for Exploration.md ├── One-Shot Imitation Learning.md ├── Policy Distillation.md ├── Stochastic Neural Network For Hierarchical Reinforcement Learning.md ├── Towards Deep Symbolic Reinforcement Learning.md ├── Unsupervised Perceptual Rewards for Imitation Learning.md └── Value Iteration Networks.md /Open-Source.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/Open-Source.md -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/README.md -------------------------------------------------------------------------------- /Reinforcement-Learning-Papers.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/Reinforcement-Learning-Papers.md -------------------------------------------------------------------------------- /papers/Action-Conditional Video Prediction using Deep Networks in Atari Games.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Action-Conditional Video Prediction using Deep Networks in Atari Games.md -------------------------------------------------------------------------------- /papers/Continuous Deep Q-Learning with Model-based Acceleration.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Continuous Deep Q-Learning with Model-based Acceleration.md -------------------------------------------------------------------------------- /papers/Deep Successor Reinforcement Learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Deep Successor Reinforcement Learning.md -------------------------------------------------------------------------------- /papers/Generalizing Skills with Semi-Supervised Reinforcement Learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Generalizing Skills with Semi-Supervised Reinforcement Learning.md -------------------------------------------------------------------------------- /papers/High-Dimensional Continuous Control Using Generalized Advantage Estimation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/High-Dimensional Continuous Control Using Generalized Advantage Estimation.md -------------------------------------------------------------------------------- /papers/Human-level control through deep reinforcement learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Human-level control through deep reinforcement learning.md -------------------------------------------------------------------------------- /papers/Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Improving Stochastic Policy Gradients in Continuous Control with Deep Reinforcement Learning using the Beta Distribution.md -------------------------------------------------------------------------------- /papers/Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer.md -------------------------------------------------------------------------------- /papers/Learning Tetris Using the Noisy Cross-Entropy Method.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Learning Tetris Using the Noisy Cross-Entropy Method.md -------------------------------------------------------------------------------- /papers/Mastering the game of Go with deep neural networks and tree search.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Mastering the game of Go with deep neural networks and tree search.md -------------------------------------------------------------------------------- /papers/Noisy Networks for Exploration.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Noisy Networks for Exploration.md -------------------------------------------------------------------------------- /papers/One-Shot Imitation Learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/One-Shot Imitation Learning.md -------------------------------------------------------------------------------- /papers/Policy Distillation.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Policy Distillation.md -------------------------------------------------------------------------------- /papers/Stochastic Neural Network For Hierarchical Reinforcement Learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Stochastic Neural Network For Hierarchical Reinforcement Learning.md -------------------------------------------------------------------------------- /papers/Towards Deep Symbolic Reinforcement Learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Towards Deep Symbolic Reinforcement Learning.md -------------------------------------------------------------------------------- /papers/Unsupervised Perceptual Rewards for Imitation Learning.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Unsupervised Perceptual Rewards for Imitation Learning.md -------------------------------------------------------------------------------- /papers/Value Iteration Networks.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/andrewliao11/Deep-Reinforcement-Learning-Survey/HEAD/papers/Value Iteration Networks.md --------------------------------------------------------------------------------