Skip to content

add algorithmic for frontiercs eval#23

Open
jane-jhu wants to merge 7 commits intoace-agent:mainfrom
jane-jhu:feature/add-frontier-cs-eval
Open

add algorithmic for frontiercs eval#23
jane-jhu wants to merge 7 commits intoace-agent:mainfrom
jane-jhu:feature/add-frontier-cs-eval

Conversation

@jane-jhu
Copy link
Copy Markdown

@jane-jhu jane-jhu commented Feb 8, 2026

Expands the evaluation framework by integrating algorithmic tasks from the Frontier-CS dataset.

  • Added a specialized DataProcessor to handle C++ code validation.

  • Updated the prompt generator to support both structured JSON reasoning and raw code generation modes.

  • Introduced utility functions for reliable C++ code extraction and added the underlying dataset files to the repository.

@Alex-q-z Alex-q-z added the dataset New datasets or benchmarks label Feb 23, 2026
@jane-jhu jane-jhu marked this pull request as ready for review March 7, 2026 09:09
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

dataset New datasets or benchmarks

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants