Skip to content

CRUX M6: Alignment — 6 missing stories #923

@noahgift

Description

@noahgift

Parent: #917

Scope

RLHF / DPO / PPO-style alignment primitives in aprender-train.

Contracts in scope

  • contracts/crux-D-{03,04,17,18,19,20}-v1.yaml

Exit criteria

  • 6 contracts promoted missingsupported
  • entrenar exposes alignment training loop with falsification tests

Dependencies

Metadata

Metadata

Assignees

No one assigned

    Labels

    cruxCRUX competitive-research-UX specepicEpic — multi-story umbrellaphase-3CRUX phase_3_missing — implement new stories

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions