Hi! Sorry to bother you again!
I have a question regarding the motion representation:
In configs/model/basic_clip.yaml, I see that the dataset uses the following input_feats:
input_feats:
- "body_transl_delta_pelv"
- "body_orient_xy"
- "z_orient_delta"
- "body_pose"
- "body_joints_local_wo_z_rot"
However, during training, I noticed that:
batch["target_motion"] != torch.cat([
batch["body_transl_delta_pelv_target"],
batch["body_orient_xy_target"],
batch["z_orient_delta_target"],
batch["body_pose_target"],
batch["body_joints_local_wo_z_rot_target"]
], dim=-1)
(and similarly for batch["source_motion"])
Does this mean that batch["target_motion"]/["source_motion"] is not simply a concatenation of input_feats? If so, could you clarify how motion_target is constructed from the features?
By the way, I have a question regarding the motion features. Specifically, I would like to know the joint order for:
"body_pose" (21 joints*6)
"body_joints_local_wo_z_rot"(22 joints*3)
I assume these correspond to the SMPL-X model, but could you please confirm the exact joint ordering for these features?

Thank you very much for your help!
Hi! Sorry to bother you again!
I have a question regarding the motion representation:
In
configs/model/basic_clip.yaml, I see that the dataset uses the following input_feats:However, during training, I noticed that:
(and similarly for batch["source_motion"])
Does this mean that batch["target_motion"]/["source_motion"] is not simply a concatenation of input_feats? If so, could you clarify how motion_target is constructed from the features?
By the way, I have a question regarding the motion features. Specifically, I would like to know the joint order for:

"body_pose" (21 joints*6)"body_joints_local_wo_z_rot"(22 joints*3)I assume these correspond to the SMPL-X model, but could you please confirm the exact joint ordering for these features?
Thank you very much for your help!