Learning to Reason for Long-Form Story Generation

Official repo for Learning to Reason for Long-Form Story Generation, available on arxiv.

This work presents a new RL-reward paradigm, Verifiable Rewards via Completion Likelihood Improvement (VR-CLI), which is then used to train a model to predict useful plans for the next chapter of a story.

This repo contains five parts:

setup_data: Compile a Next-Chapter Prediction dataset, used for training and story-generation
rl_training: Train a model using our VR-CLI reward paradigm, using either the NCP task or another task of your choosing
sft_training: Train a model using supervised finetuning on the NCP task
story_generation: Generate reasoning and story continuations using either a pretrained model, or a model you have trained yourself
evaluation: Replicate our evaluations of the story generation models using human annotations and automated metrics

Consult the instructions.md files in each directory for more details.

Citation

If you find this work useful, please cite it as follows:

@misc{gurung2025learningreasonlongformstory,
      title={Learning to Reason for Long-Form Story Generation}, 
      author={Alexander Gurung and Mirella Lapata},
      year={2025},
      eprint={2503.22828},
      archivePrefix={arXiv},
      primaryClass={cs.CL},
      url={https://arxiv.org/abs/2503.22828}, 
}

Name		Name	Last commit message	Last commit date
Latest commit History 5 Commits
evaluation		evaluation
rl_training		rl_training
setup_data		setup_data
sft_training		sft_training
story_generation		story_generation
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Learning to Reason for Long-Form Story Generation

Citation

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 1

Languages

Folders and files

Latest commit

History

Repository files navigation

Learning to Reason for Long-Form Story Generation

Citation

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 1

Languages

Packages