Skip to content
Change the repository type filter

All

    Repositories list

    • The official Python SDK for Eval Protocol
      Python
      MIT License
      16100039Updated Mar 31, 2026Mar 31, 2026
    • Eval Protocol (EP) is an open solution for doing reinforcement learning fine-tuning on existing agents — across any language, container, or framework.
      MDX
      MIT License
      83503Updated Mar 6, 2026Mar 6, 2026
    • Python
      0100Updated Feb 23, 2026Feb 23, 2026
    • geval

      Public
      Native G-Eval benchmark for Eval Protocol
      Python
      MIT License
      0100Updated Feb 22, 2026Feb 22, 2026
    • SWEBench Integration with eval-protocol
      Python
      0101Updated Feb 17, 2026Feb 17, 2026
    • ifbench

      Public
      Add IFBench, source: https://github.com/allenai/IFBench
      Python
      0100Updated Feb 8, 2026Feb 8, 2026
    • Python
      0000Updated Jan 15, 2026Jan 15, 2026
    • quickstart

      Public template
      Quickstart for Eval Protocol walking through an end to end example on how to fine-tune an SVG image generating agent.
      Python
      3510Updated Jan 6, 2026Jan 6, 2026
    • The eval-protocol integration with lilac for dataset subset selection. Example usage repository.
      Python
      0000Updated Jan 2, 2026Jan 2, 2026
    • Text to SQL Quickstart with GEPA and RFT
      Python
      MIT License
      1300Updated Dec 23, 2025Dec 23, 2025
    • Python
      Apache License 2.0
      0000Updated Dec 10, 2025Dec 10, 2025
    • openai-rft-quickstart

      Public template
      Python
      0000Updated Nov 19, 2025Nov 19, 2025
    • trl-final

      Public
      trl integration with eval protocol
      Python
      1000Updated Nov 18, 2025Nov 18, 2025
    • .github

      Public
      0000Updated Nov 12, 2025Nov 12, 2025
    • quickstart-gsm8k

      Public template
      Python
      0000Updated Nov 10, 2025Nov 10, 2025
    • Python
      15100Updated Oct 23, 2025Oct 23, 2025
    • TypeScript
      5200Updated Oct 23, 2025Oct 23, 2025
    • verl

      Public
      verl: Volcano Engine Reinforcement Learning for LLMs
      Python
      Apache License 2.0
      3.6k000Updated Oct 13, 2025Oct 13, 2025
    • arena-hard-auto-judge

      Public template
      Quickstart for eval-protocol and Langfuse
      Python
      16100Updated Oct 2, 2025Oct 2, 2025
    • Python
      0000Updated Aug 25, 2025Aug 25, 2025
    • Mock digitial store app based on chinooks database
      Python
      Apache License 2.0
      0100Updated Aug 14, 2025Aug 14, 2025
    ProTip! When viewing an organization's repositories, you can use the props. filter to filter by custom property.