ADD and AMP Training on Hand Animated Reference Motions

Hi @xbpeng , I have been training **AMP** and **ADD** on some of my _hand animated reference motions_ ( dynamically infeasible)  for Go2 and Go2w. It seems like AMP and  ADD returns are zero consistently throughout the iterations, my guess is the discriminator is trained on one cycle  of reference motion hence it is over fitting, to find any state transition generated by policy to be fake. But, when I tried runnning policies from ADD, it is somewhat following the motions, but is way worse than **Deepmimic** trained policy.  Do you have any  suggestions or thoughts on this?

This is log from training AMP on one cycle of trot motion which is hand animated.
<img width="1872" height="850" alt="Image" src="https://github.com/user-attachments/assets/394af4d5-8178-4c71-a886-bdfc78d94dbd" />

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ADD and AMP Training on Hand Animated Reference Motions #87

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

ADD and AMP Training on Hand Animated Reference Motions #87

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions