Skip to content

Questions about the output of VLM #3

@w4nanch1

Description

@w4nanch1

Thanks for opening such good work!

I have two questions about the work:

  1. Is the target position output by the VLM the same as the action output by the VLA—namely, a trajectory composed of multiple points?

  2. If so, why do we need to use this trajectory to steer the VLA? Why not directly feed the trajectory output by the VLM to the robot for execution?

I would greatly appreciate it if you could help answer my questions!

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions