Problems Encountered in Result Reproduction

First of all, thank you for your work. I noticed that a large number of training sets are defined in both SFT and RL, but some of them are commented out. I would like to know which training data should be used to reproduce the results reported in your paper.
In addition, the part related to the cosine decay schedule in SFT has also been removed. Do I need to implement it to obtain the desired results?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Problems Encountered in Result Reproduction #43

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Problems Encountered in Result Reproduction #43

Description

Metadata

Metadata

Assignees

Labels

Projects

Milestone

Relationships

Development

Issue actions