├── Average Reward Softmax Actor-Critic.ipynb ├── Bandits & Exploration Exploitation-Assignment 1.ipynb ├── Completing the Parameter Study.ipynb ├── Dyna-Q and Dyna-Q+ - assignment 5.ipynb ├── Function Approximation and Control.ipynb ├── Implement your agent.ipynb ├── MoonShot Technologies.ipynb ├── Optimal Policies with Dynamic Programming - Assignment2.ipynb ├── Policy Evaluation in Cliff Walking Environment-assignment 3.ipynb ├── Q-Learning and Expected Sarsa - assignment 4.ipynb ├── README.md └── Semi-gradient TD with a Neural Network.ipynb /Average Reward Softmax Actor-Critic.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Average Reward Softmax Actor-Critic.ipynb -------------------------------------------------------------------------------- /Bandits & Exploration Exploitation-Assignment 1.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Bandits & Exploration Exploitation-Assignment 1.ipynb -------------------------------------------------------------------------------- /Completing the Parameter Study.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Completing the Parameter Study.ipynb -------------------------------------------------------------------------------- /Dyna-Q and Dyna-Q+ - assignment 5.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Dyna-Q and Dyna-Q+ - assignment 5.ipynb -------------------------------------------------------------------------------- /Function Approximation and Control.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Function Approximation and Control.ipynb -------------------------------------------------------------------------------- /Implement your agent.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Implement your agent.ipynb -------------------------------------------------------------------------------- /MoonShot Technologies.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/MoonShot Technologies.ipynb -------------------------------------------------------------------------------- /Optimal Policies with Dynamic Programming - Assignment2.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Optimal Policies with Dynamic Programming - Assignment2.ipynb -------------------------------------------------------------------------------- /Policy Evaluation in Cliff Walking Environment-assignment 3.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Policy Evaluation in Cliff Walking Environment-assignment 3.ipynb -------------------------------------------------------------------------------- /Q-Learning and Expected Sarsa - assignment 4.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Q-Learning and Expected Sarsa - assignment 4.ipynb -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/README.md -------------------------------------------------------------------------------- /Semi-gradient TD with a Neural Network.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/KhashayarRahimi/Reinforcement-Learning-Specialization/HEAD/Semi-gradient TD with a Neural Network.ipynb --------------------------------------------------------------------------------