[bot] No Cohere usage extraction for chat, embeddings, or rerank responses

## Summary

The SDK provides usage extractors for OpenAI (`extract_openai_usage`) and Anthropic (`extract_anthropic_usage`) but has no equivalent for Cohere. Cohere uses a structurally different usage format (`usage.billed_units` and `usage.tokens` with nested objects, plus `cached_tokens`) that neither existing extractor can parse. Braintrust documents Cohere as a supported provider with chat, embeddings, and rerank tracing in other SDKs.

## What is missing

Cohere's v2 Chat API returns a `usage` object with a different structure than OpenAI or Anthropic:

```json
{
  "usage": {
    "billed_units": {
      "input_tokens": 50,
      "output_tokens": 25,
      "search_units": 3,
      "classifications": 0
    },
    "tokens": {
      "input_tokens": 62,
      "output_tokens": 30
    },
    "cached_tokens": 15
  }
}
```

Key differences from OpenAI/Anthropic formats:
1. **Nested structure**: Token counts are inside `usage.tokens` and `usage.billed_units` sub-objects, not flat at `usage` level
2. **Billing-specific fields**: `billed_units.input_tokens` differs from `tokens.input_tokens` because Cohere adds internal tokens users aren't charged for
3. **Search and classification units**: `search_units` and `classifications` track non-token billing dimensions (relevant for RAG and classify endpoints)
4. **Cache field**: `cached_tokens` is a flat integer at the `usage` level, not nested in a details sub-object

Passing a Cohere response through `extract_openai_usage()` would return empty metrics because the function looks for `usage.prompt_tokens` / `usage.completion_tokens` at the top level, which don't exist in Cohere's format.

Cohere also has dedicated **Embed API** and **Rerank API** responses with their own usage structures that would need extraction support.

## Braintrust docs status

**supported** — Braintrust's Cohere integration page documents: "instruments the native Cohere Python SDK so you can inspect prompts, responses, streaming behavior, embeddings, and rerank calls in Braintrust." Other Braintrust SDKs (Python, TypeScript) provide `wrap_cohere()` / `wrapCohere()` that capture token usage from all Cohere API surfaces.

- https://www.braintrust.dev/docs/integrations/ai-providers/cohere — documents Cohere chat, embeddings, and rerank tracing
- https://www.braintrust.dev/docs/instrument/trace-llm-calls — lists Cohere as a supported provider

## Upstream sources

- Cohere v2 Chat API reference (response structure with `usage`): https://docs.cohere.com/v2/reference/chat
- Cohere pricing and billed_units explanation: https://docs.cohere.com/docs/how-does-cohere-pricing-work
- Cohere Embed API: https://docs.cohere.com/v2/reference/embed
- Cohere Rerank API: https://docs.cohere.com/v2/reference/rerank

## Relationship to existing issues

- **Distinct from #34** (Google Gemini usage extraction): #34 covers Gemini's `usageMetadata` with camelCase fields. This covers Cohere's nested `usage.tokens` / `usage.billed_units` structure — a different provider with a different response schema.
- **Distinct from #43** (embeddings instrumentation parity): #43 is about missing embeddings span helpers and types generally. This issue is specifically about the Cohere provider's usage extraction across all its API surfaces (chat, embed, rerank).

## Local files inspected

- `src/extractors.rs` — only `extract_openai_usage()` and `extract_anthropic_usage()` exist; no Cohere extractor
- `src/types.rs` — `UsageMetrics` struct could represent Cohere token data if mapped, but no mapping exists; no fields for `search_units` or `classifications`
- `src/stream.rs` — stream aggregator only parses OpenAI Chat Completions chunk format
- `src/lib.rs` — public API exports; no Cohere references
- Full codebase grep for "cohere", "billed_units", "search_units" — zero results

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[bot] No Cohere usage extraction for chat, embeddings, or rerank responses #49

Summary

What is missing

Braintrust docs status

Upstream sources

Relationship to existing issues

Local files inspected

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

[bot] No Cohere usage extraction for chat, embeddings, or rerank responses #49

Description

Summary

What is missing

Braintrust docs status

Upstream sources

Relationship to existing issues

Local files inspected

Metadata

Metadata

Assignees

Labels

Type

Projects

Milestone

Relationships

Development

Issue actions