natural-language-feedback

Here is 1 public repository matching this topic...

RajatDandekar / RL2F

Replicating 'Reinforcement Learning from Language Feedback' (Klissarov et al., 2026) — Gemma 3 12B, GRPO, multi-turn teacher-student training on Omni-MATH

reinforcement-learning gemma llm paper-replication grpo natural-language-feedback

Updated Apr 5, 2026
Python

Improve this page

Add a description, image, and links to the natural-language-feedback topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the natural-language-feedback topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

natural-language-feedback

Here is 1 public repository matching this topic...

RajatDandekar / RL2F

Improve this page

Add this topic to your repo