-
Notifications
You must be signed in to change notification settings - Fork 27
Pull requests: weval-org/configs
Author
Label
Projects
Milestones
Reviews
Assignee
Sort
Pull requests list
feat(blueprints): Add emergenz-biosecurity-gemini-news-classification-accuracy.yml
#26
opened May 27, 2026 by
emergenz-mm
Loading…
Remove outdated public comment and redaction tasks
#25
opened May 20, 2026 by
joalcip
Loading…
7 tasks
feat(blueprints): Add v5.1-evaluating-ai-performance-in-women-peace-security-scenarios.yml
#24
opened Apr 6, 2026 by
frantj
Loading…
feat(blueprints): Add v5-for-weval-evaluating-ai-performance-in-women-peace-security-scenarios.yml
#22
opened Mar 31, 2026 by
frantj
Loading…
feat(blueprints): Add v4-evaluating-ai-performance-in-women-peace-security-scenarios.yml
#21
opened Mar 31, 2026 by
frantj
Loading…
POMS-SSI-SSDI: LLM Benchmark (open end questions) for US SSI/SSDI eligibility
#19
opened Dec 25, 2025 by
NewJerseyStyle
•
Draft
POMS-SSI-SSDI: LLM Benchmark (Multiple Choice) for US SSI/SSDI eligibility
#18
opened Dec 25, 2025 by
NewJerseyStyle
Loading…
POMS-SSI-SSDI: LLM Benchmark (yes/no questions) for US SSI/SSDI eligibility
#17
opened Dec 25, 2025 by
NewJerseyStyle
Loading…
ProTip!
Type g i on any issue or pull request to go back to the issue listing page.