Questions about the output of VLM

Thanks for opening such good work!

I have two questions about the work:

1. Is the target position output by the VLM the same as the action output by the VLA—namely, a trajectory composed of multiple points?

2. If so, why do we need to use this trajectory to steer the VLA? Why not directly feed the trajectory output by the VLM to the robot for execution?

I would greatly appreciate it if you could help answer my questions!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Questions about the output of VLM #3

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Questions about the output of VLM #3

Description

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions