feat(edge-agentic): add BFCL v4 edge-agentic workload to edge benchma…#331

Draft

Palanivelg wants to merge 6 commits into

mlcommons:masterfrom

Palanivelg:feat/edge-agentic-rules

Palanivelg commented Jun 8, 2026

Summary

Adds the Edge Agentic workload (BFCL v4 function-calling accuracy) to the
MLPerf Inference edge benchmarks and required-scenarios tables.

Changes

Edge benchmarks table: new row — Language | Agentic Function Calling | Qwen3.6-27B | gorilla-llm/gorilla-eval-set (BFCL v4)
Edge required-scenarios table: new row — Language | Agentic Function Calling | Offline (accuracy-only)

TBD (blocking merge)

QSL size — pending WG agreement
Multi-turn overall accuracy threshold — pending full-dataset run
Accuracy thresholds (99% of reference) — pending WG review

Related

Reference implementation: feat(bfcl): add BFCL v4 edge-agentic accuracy + performance integration endpoints#346
Inference catalog entry: mlcommons/inference language/edge-agentic


          feat(edge-agentic): add BFCL v4 edge-agentic workload to edge benchma…

de542d1

…rks table

github-actions Bot commented Jun 8, 2026 •

edited

Loading

MLCommons CLA bot All contributors have signed the MLCommons CLA ✍️ ✅

Palanivelg added 5 commits

June 8, 2026 11:04


          fix(edge-agentic): move rows to correct edge tables

606b490


          Update inference_rules.adoc

1a3bb17


          Update inference_rules.adoc

82713a7


          Update inference_rules.adoc

c0d438e


          feat(edge-agentic): finalize single-turn gate (995 samples, 3% one-si…

58e10e6

…ded 0.97x band)

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet