-
Introduction
-
Setting up your Reinforcement Learning Environment
-
Markov Decision Processes
-
Introduction to the OpenAI Gym Interface
-
learning
-
Gym Wrappers
-
Function Approximation and Tensorflow
-
-learning with Tensorflow
-
Deep -learning
-
Rainbow - Improvements to Deep -learning
-
Policy Gradients
-
Advantage Actor-Critic (A2C)
-
Generalized Advantage Estimation (GAE)
-
Trust Region Policy Optimization (TRPO)
-
Proximal Policy Optimization (PPO)
-
Entropy
-
KL-Divergence
-
List of Important Papers
-
Neural Network Design