My implementations of some policy gradient algorithms
- All implementations are written in pytorch
- The algorithms are implemented as classes
- All implementations are encapsulated in a single file
- The algorithms take OpenAI gym envs and Pybullet envs as inputs
- Vanilla Policy Gradient (VPG)
- Deep Deterministic Policy Gradient (DDPG)
- Twin Delayed DDPG (TD3)
- Soft Actor Critic
- Proximal Policy Optimization
- Policy Gradient Methods for Reinforcement Learning with Function Approximation
- Continuous control with deep reinforcement learning
- Addressing Function Approximation Error in Actor-Critic Methods
- Soft Actor-Critic: Off-Policy Maximum Entropy Deep Reinforcement Learning with a Stochastic Actor
- Proximal Policy Optimization Algorithms