Skip to content

Popular repositories Loading

  1. vader vader Public

    Java 10 2

  2. FinanceQA FinanceQA Public

    FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities in Large Language Models

    6

  3. IDE-Bench IDE-Bench Public

    Comprehensive framework for evaluating AI IDE agents on real-world, cross-stack SWE tasks

    Python 2 7

  4. anvil anvil Public

    Python 1 1

  5. FullStackBoilerplate FullStackBoilerplate Public

    Python 7

  6. appbench.ai-docs appbench.ai-docs Public

    public documentation for appbench.ai

Repositories

Showing 8 of 8 repositories

People

This organization has no public members. You must be a member to see who’s a part of this organization.

Top languages

Loading…

Most used topics

Loading…