Skip to content

Problems Encountered in Result Reproduction #43

@newwindhxx

Description

@newwindhxx

First of all, thank you for your work. I noticed that a large number of training sets are defined in both SFT and RL, but some of them are commented out. I would like to know which training data should be used to reproduce the results reported in your paper.
In addition, the part related to the cosine decay schedule in SFT has also been removed. Do I need to implement it to obtain the desired results?

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions