├── README.md └── slides ├── All Lectures.pdf ├── Lecture 1 - introduction.pdf ├── Lecture 10- Approximate Dynamic Programming.pdf ├── Lecture 11- Off-policy and multi-step.pdf ├── Lecture 12- Deep RL 1 .pdf ├── Lecture 13 - Deep RL 2.pdf ├── Lecture 2- Exploration and control_slides.pdf ├── Lecture 3 - MDPs and Dynamic Programming.pdf ├── Lecture 4 - Theoretical Fundamentals of DP Algorithms.pdf ├── Lecture 5 - ModelFreePrediction.pdf ├── Lecture 6 - Model-free control.pdf ├── Lecture 7- Function approximation in reinforcement learning .pdf ├── Lecture 8 - Model Based Reinforcement Learning.pdf └── Lecture 9- Policy gradients and actor critics.pdf /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/README.md -------------------------------------------------------------------------------- /slides/All Lectures.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/All Lectures.pdf -------------------------------------------------------------------------------- /slides/Lecture 1 - introduction.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 1 - introduction.pdf -------------------------------------------------------------------------------- /slides/Lecture 10- Approximate Dynamic Programming.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 10- Approximate Dynamic Programming.pdf -------------------------------------------------------------------------------- /slides/Lecture 11- Off-policy and multi-step.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 11- Off-policy and multi-step.pdf -------------------------------------------------------------------------------- /slides/Lecture 12- Deep RL 1 .pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 12- Deep RL 1 .pdf -------------------------------------------------------------------------------- /slides/Lecture 13 - Deep RL 2.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 13 - Deep RL 2.pdf -------------------------------------------------------------------------------- /slides/Lecture 2- Exploration and control_slides.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 2- Exploration and control_slides.pdf -------------------------------------------------------------------------------- /slides/Lecture 3 - MDPs and Dynamic Programming.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 3 - MDPs and Dynamic Programming.pdf -------------------------------------------------------------------------------- /slides/Lecture 4 - Theoretical Fundamentals of DP Algorithms.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 4 - Theoretical Fundamentals of DP Algorithms.pdf -------------------------------------------------------------------------------- /slides/Lecture 5 - ModelFreePrediction.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 5 - ModelFreePrediction.pdf -------------------------------------------------------------------------------- /slides/Lecture 6 - Model-free control.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 6 - Model-free control.pdf -------------------------------------------------------------------------------- /slides/Lecture 7- Function approximation in reinforcement learning .pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 7- Function approximation in reinforcement learning .pdf -------------------------------------------------------------------------------- /slides/Lecture 8 - Model Based Reinforcement Learning.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 8 - Model Based Reinforcement Learning.pdf -------------------------------------------------------------------------------- /slides/Lecture 9- Policy gradients and actor critics.pdf: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/yjavaherian/deepmind-x-ucl-rl/HEAD/slides/Lecture 9- Policy gradients and actor critics.pdf --------------------------------------------------------------------------------