feat: add Avian as LLM provider by avianion · Pull Request #494 · CodebuffAI/codebuff

avianion · 2026-04-04T10:47:56Z

Summary

Add Avian as a new OpenAI-compatible LLM provider (https://api.avian.io/v1)
New provider module (web/src/llm-api/avian.ts) following the same pattern as Fireworks/SiliconFlow, with streaming + non-streaming support, per-model pricing, usage/billing tracking, TTFT measurement, and error handling
Route avian/* model IDs through the Avian provider in the chat completions API
Add AVIAN_API_KEY to server env schema (optional, like other third-party provider keys)
Register Avian models in model-config.ts constants, providerModelNames, providerDomains, and all three copies of the ModelName type

Models available via Avian

Model	Context	Input	Output
`avian/deepseek-v3.2`	164K	$0.14/M	$0.28/M
`avian/kimi-k2.5`	128K	$0.14/M	$0.28/M
`avian/glm-5`	128K	$0.25/M	$0.50/M
`avian/minimax-m2.5`	1M	$0.15/M	$0.30/M

Test plan

Verify isAvianModel() correctly matches only avian/* model IDs
Test streaming and non-streaming requests route to Avian when model starts with avian/
Confirm billing/credit consumption works correctly with Avian pricing
Verify AvianError is caught and returned with proper status codes
Check that non-Avian models still route to their existing providers (no regression)

Add Avian (api.avian.io) as a new OpenAI-compatible inference provider with four models: DeepSeek V3.2, Kimi K2.5, GLM-5, and MiniMax M2.5. Changes: - New provider module (web/src/llm-api/avian.ts) with streaming and non-streaming support, per-model pricing, usage tracking, and billing - Route avian/* models through the Avian provider in chat completions API - Add AVIAN_API_KEY to server env schema - Register avian models in model-config constants and agent type definitions

hiSandog · 2026-04-09T09:34:17Z

One thing I'd double-check before merging: web/src/llm-api/avian.ts currently bills as soon as any streamed chunk contains usage. In this repo, the newer provider implementations in web/src/llm-api/canopywave.ts and web/src/llm-api/siliconflow.ts added a billedAlready + isFinalChunk(...) guard because some providers emit cumulative usage on non-final chunks or can repeat a terminal usage frame.

Without that guard here, Avian could insert duplicate BigQuery rows and consume credits more than once for the same response. I'd be inclined to copy the CanopyWave/SiliconFlow pattern and strip usage from non-final chunks before forwarding them to the SDK.

avianion · 2026-04-09T10:47:13Z

Addressed feedback: added billedAlready + isFinalChunk() guard to prevent duplicate BigQuery rows and credit charges. Follows the same pattern used in canopywave.ts and siliconflow.ts — usage is now stripped from non-final chunks and billing only triggers once on the final chunk with finish_reason.

avianion requested review from brandonkachen, charleslien and jahooma as code owners April 4, 2026 10:47

fix: add billing guard to prevent duplicate charges

12b440f

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: add Avian as LLM provider#494

feat: add Avian as LLM provider#494
avianion wants to merge 2 commits intoCodebuffAI:mainfrom
avianion:feat/add-avian-provider

avianion commented Apr 4, 2026

Uh oh!

hiSandog commented Apr 9, 2026

Uh oh!

avianion commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

avianion commented Apr 4, 2026

Summary

Models available via Avian

Test plan

Uh oh!

hiSandog commented Apr 9, 2026

Uh oh!

avianion commented Apr 9, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants