UMass undergraduate honors thesis project to train a mini-cheetah agent to walk in a simulated environment using deep reinforcement learning.
Video Examples:
The model/algorithm used for this projects was heavily based on the Concurrent Training of a Control Policy paper, while the design of the code was inspired by https://github.com/leggedrobotics/legged_gym