Skip to content

✨ add step-wise intermediate rewards #670

@flowerthrower

Description

@flowerthrower

The RL agent currently only learns from terminal rewards. Intermediate rewards lead to more efficient policies.

Mostly implemented in #526

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Type

    No type
    No fields configured for issues without a type.

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions