Soft-Actor-Critic-PyTorch

An implementation of Soft Actor Critic using PyTorch. This implementation doesn't include automatic temperature optimization (which could be a future work). It was trained for a 1000 games (or 1 million steps) on OpenAI Gymnasium Mujoco environment Half-Cheetah-v5.

Running Average Plot for Half-Cheetah

It can be observed that the average reward return curve had not yet saturated and the agent can be improved further upon training for a few thousand more games.

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
saved		saved
README.md		README.md
sac.py		sac.py
test.py		test.py
utils.py		utils.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Soft-Actor-Critic-PyTorch

Running Average Plot for Half-Cheetah

Simulation GIF

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Soft-Actor-Critic-PyTorch

Running Average Plot for Half-Cheetah

Simulation GIF

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages