Popular repositories Loading
Repositories
Showing 8 of 8 repositories
- IDE-Bench Public
Comprehensive framework for evaluating AI IDE agents on real-world, cross-stack SWE tasks
AfterQuery/IDE-Bench’s past year of commit activity - anvil Public
AfterQuery/anvil’s past year of commit activity - FullStackBoilerplate Public
AfterQuery/FullStackBoilerplate’s past year of commit activity - FinanceQA Public
FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities in Large Language Models
AfterQuery/FinanceQA’s past year of commit activity
People
This organization has no public members. You must be a member to see who’s a part of this organization.
Top languages
Loading…
Most used topics
Loading…