Added RoboReward model by int-smart · Pull Request #3354 · huggingface/lerobot

int-smart · 2026-04-11T00:33:58Z

Title

Added Roboreward mentioned in Paper: arXiv:2601.00675

Type / Scope

Type: Feature

Summary / Motivation

One-paragraph description of what changes and why.
Why this change is needed and any trade-offs or design notes.

Related issues

Fixes / Closes: Reward Models: call for contributions #3143
Related: # (if any)

What changed

Short, concrete bullets of the modifications (files/behaviour).
Short note if this introduces breaking changes and migration steps.

How was this tested (or how to run locally)

Tests added: test_robo_reward.py

Example:

Ran the relevant tests:

pytest tests/rewards/test_robo_reward.py -v

Checklist (required before merge)

Linting/formatting run (pre-commit run -a)
All tests pass locally (pytest)
Documentation updated
[] CI is green

Reviewer notes

Anything the reviewer should focus on (performance, edge-cases, specific files) or general notes.
Anyone in the community is free to review the PR.

…lasses

… to rewards/classifier/

…e from policies/factory.py

…ocessor tests

…place it with a generic samplerweight class in lerobot_train

…putation

xianglunkai · 2026-04-11T00:48:01Z

very great!
could you give some examples for VLA or RL demos?

…structure

…ot into feat/add-robo-reward

int-smart · 2026-04-17T06:46:40Z

@xianglunkai Can you explain more on the VLA demos.

philipmit · 2026-04-17T16:39:42Z

Hi @int-smart, thanks for your work on this! the following tests are failing. I think the first failure is caused by the __post_init__ function in configuration_robo_reward.py (where input_features is updated to include the provided image_key). The other two failures appear to come from issues in the _make_mock_vlm_and_processor helper used in the test code.

FAILED tests/rewards/test_robo_reward.py::test_robo_reward_config_validate_features_missing_key - Failed: DID NOT RAISE <class 'ValueError'>
FAILED tests/rewards/test_robo_reward.py::test_compute_reward_shape_single_frame - StopIteration
FAILED tests/rewards/test_robo_reward.py::test_compute_reward_score_mapping - StopIteration

…ard calculation. Corrected the exhaustion of iterator.

philipmit · 2026-04-18T15:47:43Z

@int-smart great all tests are passing now! new issue: I get the below error when trying your quick start code in robo_reward.mdx. The error doesn't happen when testing a batch with just one image or one video.

batch = {
    "observation.images.top": torch.rand(2, 3, 480, 640),   # (B, C, H, W)
    "observation.language_instruction": [
        "pick up the red cube",
        "place the block on the tray",
    ],
}
rewards = model.compute_reward(batch)  # tensor([0.75, 0.50])

Error

ValueError: expected sequence of length 489 at dim 1 (got 490)
...
ValueError: Unable to create tensor, you should probably activate padding with 'padding=True' to have batched tensors with the same length.

s1lent4gnt and others added 20 commits March 12, 2026 15:25

feat(rewards): add RewardModelConfig and PreTrainedRewardModel base c…

226980e

…lasses

refactor(rewards): migrate Classifier from policies/sac/reward_model/…

d4f504d

… to rewards/classifier/

refactor(rewards): migrate SARM from policies/sarm/ to rewards/sarm/

0508745

refactor(rewards): add rewards/factory.py and remove reward model cod…

0b992cf

…e from policies/factory.py

refactor(rewards): update imports and delete old reward model locations

445ffae

test(rewards): add reward model tests and update existing test imports

3c302e3

fix(rewards): restore full Classifier and SARM implementations

448a6aa

test(rewards): restore missing CUDA and mixed precision classifier pr…

7080266

…ocessor tests

refactor(lerobot_train.py): remove rabc specific configuration and re…

6ce7807

…place it with a generic samplerweight class in lerobot_train

refactor(lerobot_train.py): add missing sampling weight script

7b028ed

linter + missing files

70cfdfd

add testing for sampl weighter

fd44083

revert some useless changes, improve typing

557a305

update docs

b78e7f8

add automatic detection of the progress path

cb14264

remove type exp

b1cfbac

improve comment

00b913a

fix: move rabc.py to rewards/sarm/ and update import paths

49d64c9

Merge branch 'main' into refactor/reward-models

2412fc0

add RoboReward model and configuration for vision-language reward com…

a03bfa2

…putation

s1lent4gnt self-assigned this Apr 12, 2026

s1lent4gnt added 2 commits April 15, 2026 15:41

Merge remote-tracking branch 'origin/main' into refactor/reward-models

92aaf46

refactor(imports): update reward model imports to new module structure

31191ae

s1lent4gnt and others added 6 commits April 15, 2026 16:24

refactor(imports): update reward model imports to reflect new module …

1c2fbc7

…structure

refactor(imports): conditionally import pandas based on availability

21eb486

Merge branch 'refactor/reward-models' of github.com:huggingface/lerob…

c4b82b9

…ot into feat/add-robo-reward

Moved inference to batches

e9b4251

Multigpu device map

201e82b

Corrected hardcoded input image names

3afad48

Validated features according to what user provides as feature for rew…

06d4796

…ard calculation. Corrected the exhaustion of iterator.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Added RoboReward model#3354

Added RoboReward model#3354
int-smart wants to merge 29 commits intohuggingface:mainfrom
int-smart:feat/add-robo-reward

int-smart commented Apr 11, 2026

Uh oh!

xianglunkai commented Apr 11, 2026

Uh oh!

int-smart commented Apr 17, 2026

Uh oh!

philipmit commented Apr 17, 2026

Uh oh!

philipmit commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

Conversation

int-smart commented Apr 11, 2026

Title

Type / Scope

Summary / Motivation

Related issues

What changed

How was this tested (or how to run locally)

Checklist (required before merge)

Reviewer notes

Uh oh!

xianglunkai commented Apr 11, 2026

Uh oh!

int-smart commented Apr 17, 2026

Uh oh!

philipmit commented Apr 17, 2026

Uh oh!

philipmit commented Apr 18, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants