├── LICENSE ├── README.md ├── Section 1 ├── Reinforcement Learning with TensorFlow & TRFL -- Q Learning.ipynb ├── Reinforcement Learning with TensorFlow & TRFL -- SARSA & SARSE.ipynb └── Reinforcement Learning with TensorFlow & TRFL -- TD Learning.ipynb ├── Section 2 ├── Deep Q Network and Target Network Updating with TRFL.ipynb ├── Distributional Q Learning.ipynb └── Persistent Q Learning and Huber Loss.ipynb ├── Section 3 ├── Actor-Critic.ipynb ├── Deterministic Policy Gradients.ipynb ├── Policy Entropy Loss.ipynb └── Policy Gradient Loss.ipynb ├── Section 4 ├── Generalized Lambda Returns.ipynb ├── Multi-step Forward View.ipynb ├── Reinforcement Learning with TensorFlow & TRFL -- Q(λ).ipynb └── Reinforcement Learning with TensorFlow & TRFL -- TD(λ).ipynb └── Section 5 ├── Reinforcement Learning with TensorFlow & TRFL -- Pixel Control.ipynb ├── Reinforcement Learning with TensorFlow & TRFL -- Retrace(λ).ipynb └── Reinforcement Learning with TensorFlow & TRFL -- V-Trace.ipynb /LICENSE: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/LICENSE -------------------------------------------------------------------------------- /README.md: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/README.md -------------------------------------------------------------------------------- /Section 1/Reinforcement Learning with TensorFlow & TRFL -- Q Learning.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 1/Reinforcement Learning with TensorFlow & TRFL -- Q Learning.ipynb -------------------------------------------------------------------------------- /Section 1/Reinforcement Learning with TensorFlow & TRFL -- SARSA & SARSE.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 1/Reinforcement Learning with TensorFlow & TRFL -- SARSA & SARSE.ipynb -------------------------------------------------------------------------------- /Section 1/Reinforcement Learning with TensorFlow & TRFL -- TD Learning.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 1/Reinforcement Learning with TensorFlow & TRFL -- TD Learning.ipynb -------------------------------------------------------------------------------- /Section 2/Deep Q Network and Target Network Updating with TRFL.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 2/Deep Q Network and Target Network Updating with TRFL.ipynb -------------------------------------------------------------------------------- /Section 2/Distributional Q Learning.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 2/Distributional Q Learning.ipynb -------------------------------------------------------------------------------- /Section 2/Persistent Q Learning and Huber Loss.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 2/Persistent Q Learning and Huber Loss.ipynb -------------------------------------------------------------------------------- /Section 3/Actor-Critic.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 3/Actor-Critic.ipynb -------------------------------------------------------------------------------- /Section 3/Deterministic Policy Gradients.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 3/Deterministic Policy Gradients.ipynb -------------------------------------------------------------------------------- /Section 3/Policy Entropy Loss.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 3/Policy Entropy Loss.ipynb -------------------------------------------------------------------------------- /Section 3/Policy Gradient Loss.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 3/Policy Gradient Loss.ipynb -------------------------------------------------------------------------------- /Section 4/Generalized Lambda Returns.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 4/Generalized Lambda Returns.ipynb -------------------------------------------------------------------------------- /Section 4/Multi-step Forward View.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 4/Multi-step Forward View.ipynb -------------------------------------------------------------------------------- /Section 4/Reinforcement Learning with TensorFlow & TRFL -- Q(λ).ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 4/Reinforcement Learning with TensorFlow & TRFL -- Q(λ).ipynb -------------------------------------------------------------------------------- /Section 4/Reinforcement Learning with TensorFlow & TRFL -- TD(λ).ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 4/Reinforcement Learning with TensorFlow & TRFL -- TD(λ).ipynb -------------------------------------------------------------------------------- /Section 5/Reinforcement Learning with TensorFlow & TRFL -- Pixel Control.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 5/Reinforcement Learning with TensorFlow & TRFL -- Pixel Control.ipynb -------------------------------------------------------------------------------- /Section 5/Reinforcement Learning with TensorFlow & TRFL -- Retrace(λ).ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 5/Reinforcement Learning with TensorFlow & TRFL -- Retrace(λ).ipynb -------------------------------------------------------------------------------- /Section 5/Reinforcement Learning with TensorFlow & TRFL -- V-Trace.ipynb: -------------------------------------------------------------------------------- https://raw.githubusercontent.com/PacktPublishing/Hands-On-Reinforcement-Learning-with-TensorFlow-TRFL/HEAD/Section 5/Reinforcement Learning with TensorFlow & TRFL -- V-Trace.ipynb --------------------------------------------------------------------------------