feat(datadog): support per-request ml_app override via metadata by liranddd · Pull Request #25684 · BerriAI/litellm

liranddd · 2026-04-14T08:37:21Z

Summary

Allow callers to pass ml_app in request metadata to control the Application column in Datadog LLM Observability
Spans are grouped by ml_app at flush time and sent as separate batches (Datadog intake API requires ml_app at batch level)
Spans without an override fall back to DD_SERVICE — fully backwards compatible

Closes #20701

Motivation

When multiple services share a single LiteLLM proxy, all LLM traces appear under the same application in Datadog LLM Observability. There is currently no way to distinguish which service made the call. This PR lets callers tag their requests so they appear as distinct applications.

Usage

response = litellm.completion(
    model="gpt-4",
    messages=[{"role": "user", "content": "Hello"}],
    metadata={"ml_app": "my-service-name"}
)

Result in Datadog LLM Obs

Service	Application
litellm-server	my-service-name
litellm-server	other-service
litellm-server	litellm-server (default, no override)

Changes

litellm/integrations/datadog/datadog_llm_obs.py — read ml_app from metadata, group batches by it
litellm/types/integrations/datadog_llm_obs.py — add internal _dd_ml_app field (stripped before send)
tests/test_litellm/integrations/datadog/test_per_request_ml_app.py — 5 new tests

vercel · 2026-04-14T08:37:36Z

The latest updates on your projects. Learn more about Vercel for GitHub.

Project	Deployment	Actions	Updated (UTC)
litellm	Ready	Preview, Comment	Apr 14, 2026 9:30am

CLAassistant · 2026-04-14T08:38:09Z

All committers have signed the CLA.

codspeed-hq · 2026-04-14T08:39:52Z

Merging this PR will not alter performance

✅ 16 untouched benchmarks

_{Comparing liranddd:feat/per-request-ml-app-override (4a55027) with main (e64d98f)}

greptile-apps · 2026-04-14T08:42:51Z

Greptile Summary

This PR adds per-request ml_app override support for Datadog LLM Observability, allowing multiple services sharing one LiteLLM proxy to appear as distinct applications. Spans are grouped by ml_app at flush time and sent as separate batches; spans without an override fall back to DD_LLMOBS_ML_APP / DD_SERVICE.

Two imports (get_datadog_ml_app, safe_dumps) are placed inline inside async_send_batch instead of at module level — a minor style violation per the project's CLAUDE.md guidelines.
With multi-group flushing, a partial-success scenario is now reachable: if group A's POST succeeds but group B's raises, log_queue.clear() is skipped and group A's spans will be re-sent on the next flush, producing duplicates in Datadog.

Confidence Score: 5/5

Safe to merge; all findings are P2 style/quality suggestions that do not block correct operation.
The core grouping logic is correct, the previous reviewer's mutation concern is addressed with clean copies, tests are thorough and mock-only, and the change is fully backwards-compatible. Remaining findings are a style violation (inline imports) and a non-critical partial-success edge case that only produces duplicate data under transient errors with ≥2 ml_app values.
litellm/integrations/datadog/datadog_llm_obs.py — async_send_batch multi-group partial-success and inline imports.

Important Files Changed

Filename	Overview
litellm/integrations/datadog/datadog_llm_obs.py	Core change: groups log_queue by _dd_ml_app and sends separate batches per ml_app. Clean-copy approach correctly avoids mutating queue entries. Two inline imports violate project style. Multi-group partial-success can cause duplicate spans on retry.
litellm/integrations/datadog/datadog_handler.py	Adds get_datadog_ml_app() helper that reads DD_LLMOBS_ML_APP and falls back to get_datadog_service(). Clean, backwards-compatible addition.
litellm/types/integrations/datadog_llm_obs.py	Adds optional _dd_ml_app field to LLMObsPayload TypedDict as an internal routing hint. Field is documented as stripped before sending. Clean change.
tests/test_litellm/integrations/datadog/test_per_request_ml_app.py	5 new mock-only tests covering payload field, absence without override, env-var default, multi-app grouping, failure-keeps-queue, and success-clears-queue. All mock the HTTP client; no real network calls. Good coverage.

Sequence Diagram

sequenceDiagram
    participant Caller
    participant LiteLLM
    participant DDLLMObsLogger
    participant DatadogAPI

    Caller->>LiteLLM: "completion(model, messages, metadata={ml_app: "svc-a"})"
    LiteLLM->>DDLLMObsLogger: async_log_success_event(kwargs)
    DDLLMObsLogger->>DDLLMObsLogger: "create_llm_obs_payload()<br/>reads metadata.ml_app → stores _dd_ml_app="svc-a""
    DDLLMObsLogger->>DDLLMObsLogger: log_queue.append(payload)

    Note over DDLLMObsLogger: On flush (batch_size or periodic)

    DDLLMObsLogger->>DDLLMObsLogger: "async_send_batch()<br/>group spans by _dd_ml_app"

    DDLLMObsLogger->>DatadogAPI: "POST /api/intake/llm-obs/v1/trace/spans<br/>ml_app="svc-a", spans=[…stripped of _dd_ml_app]"
    DatadogAPI-->>DDLLMObsLogger: 202 Accepted

    DDLLMObsLogger->>DatadogAPI: "POST /api/intake/llm-obs/v1/trace/spans<br/>ml_app="svc-b", spans=[…]"
    DatadogAPI-->>DDLLMObsLogger: 202 Accepted

    DDLLMObsLogger->>DatadogAPI: "POST /api/intake/llm-obs/v1/trace/spans<br/>ml_app=DD_LLMOBS_ML_APP (default), spans=[…]"
    DatadogAPI-->>DDLLMObsLogger: 202 Accepted

    DDLLMObsLogger->>DDLLMObsLogger: log_queue.clear()

_{Reviews (4): Last reviewed commit: "feat(datadog): support per-request ml_ap..." | Re-trigger Greptile}

codecov · 2026-04-14T08:53:09Z

Codecov Report

❌ Patch coverage is 92.30769% with 2 lines in your changes missing coverage. Please review.

Files with missing lines	Patch %	Lines
litellm/integrations/datadog/datadog_llm_obs.py	91.30%	2 Missing ⚠️

📢 Thoughts on this report? Let us know!

Allow callers to pass ml_app in request metadata to control the Application column in Datadog LLM Observability. Also adds support for the DD_LLMOBS_ML_APP env var. Fallback chain: metadata.ml_app → DD_LLMOBS_ML_APP → DD_SERVICE. Closes BerriAI#20701

vercel bot deployed to Preview April 14, 2026 08:39 View deployment

vercel bot deployed to Preview April 14, 2026 08:41 View deployment

greptile-apps bot reviewed Apr 14, 2026

View reviewed changes

Comment thread litellm/integrations/datadog/datadog_llm_obs.py

vercel bot deployed to Preview April 14, 2026 08:45 View deployment

vercel bot deployed to Preview April 14, 2026 08:48 View deployment

liranddd force-pushed the feat/per-request-ml-app-override branch from 8273a84 to e9a9e8b Compare April 14, 2026 08:48

vercel bot deployed to Preview April 14, 2026 08:50 View deployment

github-advanced-security AI found potential problems Apr 14, 2026

View reviewed changes

Comment thread litellm/integrations/datadog/datadog_llm_obs.py Fixed

liranddd force-pushed the feat/per-request-ml-app-override branch from e9a9e8b to 3eb11e5 Compare April 14, 2026 09:04

vercel bot deployed to Preview April 14, 2026 09:06 View deployment

liranddd force-pushed the feat/per-request-ml-app-override branch from 3eb11e5 to c6785a1 Compare April 14, 2026 09:26

vercel bot deployed to Preview April 14, 2026 09:28 View deployment

liranddd force-pushed the feat/per-request-ml-app-override branch from c6785a1 to 4a55027 Compare April 14, 2026 09:28

vercel bot deployed to Preview April 14, 2026 09:30 View deployment

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

feat(datadog): support per-request ml_app override via metadata#25684

feat(datadog): support per-request ml_app override via metadata#25684
liranddd wants to merge 1 commit intoBerriAI:mainfrom
liranddd:feat/per-request-ml-app-override

liranddd commented Apr 14, 2026 •

edited

Loading

Uh oh!

vercel bot commented Apr 14, 2026 •

edited

Loading

Uh oh!

CLAassistant commented Apr 14, 2026 •

edited

Loading

Uh oh!

codspeed-hq bot commented Apr 14, 2026 •

edited

Loading

Uh oh!

greptile-apps bot commented Apr 14, 2026 •

edited

Loading

Important Files Changed

Uh oh!

Uh oh!

codecov bot commented Apr 14, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Conversation

liranddd commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Summary

Motivation

Usage

Result in Datadog LLM Obs

Changes

Uh oh!

vercel bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

CLAassistant commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

codspeed-hq bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Merging this PR will not alter performance

Uh oh!

greptile-apps bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Greptile Summary

Confidence Score: 5/5

Important Files Changed

Sequence Diagram

Uh oh!

Uh oh!

codecov bot commented Apr 14, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

liranddd commented Apr 14, 2026 •

edited

Loading

vercel bot commented Apr 14, 2026 •

edited

Loading

CLAassistant commented Apr 14, 2026 •

edited

Loading

codspeed-hq bot commented Apr 14, 2026 •

edited

Loading

greptile-apps bot commented Apr 14, 2026 •

edited

Loading

codecov bot commented Apr 14, 2026 •

edited

Loading