The AI-Native IDE with a Built-In Provider Control Center.
Connect 169+ AI providers. Route across free & paid models. Zero config.
π Quick Start β’ π Providers β’ π― Combos β’ π Cost Savings β’ π‘οΈ OmniProxy β’ πΈ Screenshots β’ π Docs
π₯ The Promise β’ π€ Why β’ π What Sets Apart β’ π Security β’ ποΈ Architecture β’ π€ Contributing
One IDE. 169+ providers. Every AI model you need β and a free coding agent that never stops.
OmniCode is an intelligent development environment that centralizes and streamlines every AI interaction directly inside your editor. Stop juggling browser dashboards, dead API keys, and surprise bills. OmniCode brings it all home.
| π« Never hit limits | πΈ $0 to start | π Full cost visibility |
| Auto-fallback across providers. Quota out? Next model takes over β zero downtime. | Free-tier OAuth providers (Kiro, Qoder, GitHub Copilot, Gemini CLI). No card needed. | Real-time dashboards for token usage, per-provider costs, and model-level spend. |
| π Every model works | π§© One unified picker | π‘οΈ Production-grade |
| 169+ providers β Claude, GPT, Gemini, DeepSeek, Grok, Qwen β through one interface. | All models synced into the native editor picker. No separate UI or fragmented experience. | Semantic caching, combo routing, budget controls, PII guards, encryption at rest. |
Stop juggling 10 dashboards, dead API keys, and surprise bills.
| β The daily pain | β How OmniCode fixes it |
|---|---|
| π Subscription quota expires unused every month | Maximize subscriptions β track quota, use every token before reset |
| π Rate limits stop you mid-coding | Auto-fallback combos β route to the next provider in milliseconds |
| πΈ Expensive APIs ($20β50/mo per provider) | Free coding agent β route routine tasks to free-tier models at $0 |
| π§° Each AI tool wants its own setup | One IDE, one picker β every model in one unified dropdown |
| π API keys scattered across .env files | Encrypted secret storage β AES-256-GCM at rest, never in source control |
| π No visibility into what you're spending | Cost & analytics dashboards β per-request, per-model, per-provider breakdowns |
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β OmniCode IDE (Native Chat & AI Agent Interface) β
βββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββ
β Embedded OmniProxy Runtime
βΌ
ββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ
β OmniProxy β Smart Router & Cache β
β Semantic caching Β· Combo routing Β· Budget controls β
β PII guards Β· Encrypted secrets Β· Cost analytics β
βββββββββββββββββββββββββββ¬βββββββββββββββββββββββββββββββββββ
βββββββββββββββ¬βββββ΄βββββββββ¬ββββββββββββββ
βΌ Tier 1 βΌ Tier 2 βΌ Tier 3 βΌ Tier 4
SUBSCRIPTION API KEY FREE TIER CUSTOM
Claude Code, OpenAI, Kiro, Qoder, Local LLMs,
Codex, Copilot DeepSeek Gemini CLI Ollama
quota out? ββββΆ budget hit? ββΆ always on βββΆ your server
A Combo is a chain of models OmniCode routes across automatically. Quota runs out, a provider fails, or costs spike β the combo silently slides to the next model. This is what makes OmniCode unbreakable. π‘οΈ
Configure fallback chains in the OmniProxy dashboard. OmniCode evaluates your rules in real-time:
Combo: "always-on-coding" Strategy: priority
1. cc/claude-opus-4-7 β OAuth subscription (use it fully)
2. cx/gpt-5.5 β Codex OAuth (second subscription)
3. kr/claude-sonnet-4.6 β Kiro FREE (unlimited Claude)
4. qoder/deepseek-v3.2 β Qoder FREE (always available)
Result: 4 layers of fallback = zero downtime
| Goal | Strategy |
|---|---|
| π₯ Drain my subscription before paying | Priority / Fill-first |
| βοΈ Spread load across accounts | Round-robin |
| πΈ Always cheapest viable model | Cost-optimized |
| π§ Best model for complex reasoning | Premium escalation |
| π Free-first for routine tasks | Free agent routing |
| Layer | Scope | What it does |
|---|---|---|
| π Circuit breaker | Whole provider | Stops hammering a failing provider; auto-probes to recover |
| π€ Connection cooldown | One account/key | Skips a rate-limited key while others keep serving |
| πΎ Semantic cache | Identical prompts | Returns cached response instantly β cost: $0, latency: ~0ms |
| Feature | OmniCode | Traditional IDEs | Other AI Tools |
|---|---|---|---|
| π Built-in AI providers | 169+ (OAuth + API key) | 1β2 | External only |
| π Free-tier routing | β Automatic | β | β |
| π Multi-model combos | β Configurable chains | β | Rare |
| π Cost & usage dashboards | β Native | β | Separate tool |
| πΎ Semantic caching | β Built-in | β | Rare |
| π§© Unified model picker | β All models, one dropdown | 1 provider | N/A |
| π Custom endpoints | β Any OpenAI-compatible | β | Limited |
| π‘οΈ PII & injection guards | β Built-in | β | Rare |
| π Encrypted secret storage | β AES-256-GCM | Plaintext .env | Varies |
| π₯οΈ Full IDE experience | β VS Code-based editor | β | β CLI only |
| π¦ Batch testing | β Cross-model evaluation | β | β |
| π¨ Image generation | β Built-in media tools | β | Separate |
The most comprehensive provider ecosystem of any AI-native IDE. Connect via OAuth (one-click) or API key β all models appear instantly in the native editor picker.
| Provider | Auth Flow | Models | Status |
|---|---|---|---|
| Claude (Anthropic) | Authorization Code + PKCE | Opus 4.7, Sonnet 4.6, Haiku 4.5 | β |
| Codex (OpenAI) | Authorization Code + PKCE | GPT-5.5, GPT-5.4, GPT-5.3 Codex | β |
| Gemini (Google) | Standard OAuth2 | Synced via API (all models) | β |
| Gemini CLI | Standard OAuth2 | Gemini 3.1 Pro, 3 Flash, 3.1 Flash Lite | β |
| GitHub Copilot | Device Code Flow | GPT-5.5, Claude Opus 4.7, Gemini 3.1 Pro | β |
| Qwen (Alibaba) | Device Code + PKCE | Qwen3 Coder Plus/Flash, Vision | β |
| Kimi Coding (Moonshot) | Device Code Flow | Kimi K2.6, K2.6 Thinking | β |
| Antigravity (Google Cloud) | Standard OAuth2 | Passthrough models | β |
| Kiro (AWS) | SSO OIDC / Social Login | Claude Opus 4.7, Sonnet 4.6, Haiku 4.5 | β |
| Cursor | Token Import | 70+ models (GPT, Claude, Gemini, Grok) | β |
| Cline | Local Callback Flow | Claude, GPT, Gemini, DeepSeek, Kimi | β |
| KiloCode | Custom Device Auth | OpenRouter models, GPT, Claude, Gemini | β |
| GitLab Duo | Authorization Code + PKCE | Enterprise models | β |
| Amazon Q | AWS Builder ID | AWS models | β |
| Qoder AI | API Key / OAuth | Qwen3, Kimi K2, DeepSeek, ROME | β |
| Provider | Highlights |
|---|---|
| OpenAI | GPT-5.5, GPT-5.4, GPT-5.4 Mini/Nano, O3 |
| Anthropic | Claude Opus 4.7, Sonnet 4.6, Haiku 4.5 |
| DeepSeek | V4 Pro, V4 Flash |
| Groq | Llama 3.3 70B, Llama 4 Maverick, Qwen3 32B |
| xAI | Grok 4.3, Grok 4.20 (Reasoning, Multi Agent) |
| Mistral | Large 3, Medium 3.5, Small 4, Devstral 2 |
| Together | Llama 3.3 70B π, DeepSeek R1 π, Qwen3 235B |
| Fireworks | Kimi K2.6, MiniMax M2.7, Qwen3.6 Plus |
| Cerebras | GLM 4.7, GPT OSS 120B |
| Perplexity | Sonar Deep Research, Reasoning Pro, Pro |
| Cohere | Command A Reasoning, Vision |
| NVIDIA NIM | GLM 5.1, Gemma 4, Nemotron 3 Super, DeepSeek V4 |
| SiliconFlow | DeepSeek V3.2, Qwen3 Coder 480B |
| OpenRouter | Auto (Best Available) β access to 200+ models |
| HuggingFace | Open-source model catalog |
| Ollama Cloud | DeepSeek V4, Kimi K2.6, GLM 5.1 |
| MiniMax | M2.7, M2.5, Highspeed variants |
| GLM / ZAI | GLM 5.1, GLM 5, GLM 5 Turbo |
| Provider | What's Free | Limits |
|---|---|---|
| Kiro AI | Claude Opus 4.7, Sonnet 4.6, Haiku 4.5 | OAuth, generous limits |
| Qoder AI | Qwen3, DeepSeek, Kimi K2 | Unlimited for supported models |
| Gemini CLI | Gemini 3.1 Pro, 3 Flash | 180K requests/mo free |
| GitHub Copilot | GPT-5 Mini, Claude Haiku 4.5 | Free tier available |
| Together AI | Llama 3.3 70B, DeepSeek R1 Distill, Llama Vision | Free tier models |
| NVIDIA NIM | 13+ models | ~40 RPM free |
| Cerebras | GLM 4.7, GPT OSS 120B | Free tier with limits |
| OpenCode | GLM 5.1, Kimi K2.6, DeepSeek V4, MiMo V2.5 | Free tier available |
Integrate any OpenAI-compatible endpoint in 3 steps:
- Provide a group name
- Supply the API key
- Enter the Base URL
OmniCode automatically queries the /models endpoint and populates available models into the unified picker.
OmniCode is built to dramatically reduce API spend through intelligent request handling:
flowchart LR
A[Incoming Request] --> B{Is it Cached?}
B -- Yes --> C["Return Cached Response<br/><b>Cost: $0</b>"]
B -- No --> D{Requires Premium AI?}
D -- No --> E["Route to Free Agent/Tier<br/><b>Cost: $0</b>"]
D -- Yes --> F["Route to Premium Model<br/><b>Cost: Standard API Rate</b>"]
| Strategy | Savings | How it works |
|---|---|---|
| πΎ Semantic Caching | 100% on cache hits | Duplicate or semantically identical prompts served instantly from local cache |
| π Free Agent Routing | 100% on routine tasks | Standard coding queries routed to free-tier models or local endpoints |
| π― Strategic Fallbacks | 50β90% | Premium models reserved for complex reasoning; routine tasks use cheap providers |
| π Quota Maximization | Subscription ROI β | Drain your paid subscriptions fully before falling back to API-key providers |
| Scenario | Without OmniCode | With OmniCode | Savings |
|---|---|---|---|
| 100 routine code completions/day | ~$2.50/day (paid API) | $0 (free agent) | $75/mo |
| 50 repeated similar queries | ~$1.25/day (paid API) | $0 (semantic cache) | $37/mo |
| Subscription quota overflow | New API charges | Auto-fallback to free | Variable |
A comprehensive, built-in management workspace embedded directly in the editor. No browser dashboards, no external tools β everything is native.
| Section | Description |
|---|---|
| π Home | Global runtime status, provider overview, model sync state, and proxy health |
| π Providers | Connect, test, and manage integrations for 169+ AI providers with live status cards |
| π― Combos | Configure multi-model routing chains and automated fallback strategies |
| π§ͺ Batch Testing | Evaluate and test prompts across multiple models simultaneously |
| π° Costs | Detailed tracking of token usage and financial spend per provider and model |
| π Analytics | Deep dive into request patterns, latency metrics, and usage trends |
| πΎ Cache | Advanced semantic and prompt cache controls for optimized performance |
| π Limits & Quotas | Set rate limits, monitor quotas, and enforce budget controls |
| πΌοΈ Media | Manage image generation and rich media assets |
OmniCode's architecture is designed for speed, security, and seamless integration:
flowchart TD
User([User Prompt]) --> ChatUI[Native Chat Interface]
ChatUI --> OmniProxy{OmniProxy Router}
OmniProxy -->|Check Cache| Cache[(Semantic Cache)]
Cache -- Cache Hit --> ChatUI
OmniProxy -- Cache Miss --> Evaluator{Combo / Fallback Rules}
Evaluator -->|Free Tier| Free[Free Agent / Models]
Evaluator -->|Premium Tier| Premium[Premium API Providers]
Evaluator -->|Local/Custom| Custom[Custom Endpoints]
Free --> ChatUI
Premium --> ChatUI
Custom --> ChatUI
- User Interaction: You submit a prompt via the native Chat interface or AI agent in the editor.
- Model Selection: The editor retrieves available models from
chatLanguageModels.json, continuously synced by the OmniProxy extension. - Smart Routing: The request passes through the OmniRoute Extension Bridge, which evaluates active routing rules, combos, and fallbacks.
- Provider Execution: The Embedded Runtime securely authenticates using locally stored credentials (never committed to code) and dispatches to the appropriate provider.
- Response & Caching: The response streams back in real-time, with optional semantic caching applied to reduce costs and latency on future requests.
| Area | Path | Description |
|---|---|---|
| π¦ Product metadata | product.json |
Application identifiers, naming, and configuration |
| π¨ Resources | resources/ |
Icon assets for macOS, Windows, Linux, and web |
| π‘οΈ OmniProxy Dashboard | src/vs/workbench/contrib/chat/browser/omniProxyManagement/ |
Native management dashboard source code |
| βοΈ OmniProxy Runtime | omniproxy-runtime/ |
Embedded Node.js backend for AI routing |
| π Provider Registry | omniproxy-runtime/open-sse/config/providerRegistry.ts |
Single source of truth for all 169+ providers |
| π§© Extension Bridge | extensions/omniroute/ |
Connects workbench UI to OmniProxy runtime |
| π§ Model Discovery | src/vs/workbench/contrib/chat/common/languageModels.ts |
Core logic for model discovery and endpoint integration |
Tailor your AI assistant to your exact workflow. Through Combos configuration in OmniProxy, you define:
- π― Custom system prompts β personalize the AI's behavior and tone
- π§ Specialized context boundaries β scope the AI to your project's architecture
- π Preferred model fallbacks β set your ideal model chain
- π Hot-swap the brain β switch from local open-source to state-of-the-art paid models without changing your editing habits
The result: a deeply personalized coding agent that adapts to your coding style, no matter which backend AI provider is currently active.
The dashboard provides an at-a-glance view of your runtime status, connected providers, synced models, and proxy configuration.
| Requirement | Version |
|---|---|
| π₯οΈ macOS / Linux / Windows | Latest stable |
| π¦ Node.js | 22.x LTS (recommended) |
| π¦ npm | 10.x |
git clone https://github.com/cyborgateuk-arch/OmniCode.git
cd OmniCode
npm installnpm run gulp compile
node build/next/index.ts bundle --out out --target desktop# macOS
open -na '.build/electron/OmniCode.app' --args '.'
# Linux
.build/electron/OmniCode --no-sandbox .
# Windows
.build\electron\OmniCode.exe .The OmniProxy runtime handles all AI requests locally and securely. On first run:
# Navigate to the runtime directory
cd omniproxy-runtime
# Copy the example environment file
cp .env.example .env
# Generate security secrets
openssl rand -base64 48 # JWT secret
openssl rand -hex 32 # API key encryption secretEnter your provider API keys or OAuth credentials in the .env file. The runtime automatically initializes when you open the OmniProxy dashboard in the editor.
Open the OmniProxy Dashboard β Providers β Connect Kiro AI (free Claude) or Qoder AI (free Qwen/DeepSeek) β done.
All connected models appear instantly in the native model picker. Select a model and start chatting.
Your keys, your machine, your data. OmniCode is designed with a security-first architecture.
| π 100% local | π Encrypted at rest |
| Runs entirely on your hardware. No OmniCode cloud in the request path. | API keys & OAuth tokens sealed with AES-256-GCM. |
| π« Zero telemetry by default | π‘οΈ Hardened gateway |
| Your prompts go only to the providers you choose, nowhere else. | PII sanitization, prompt-injection guards, loopback-only routes. |
| π MIT licensed | π No secrets in source |
| Fully open-source β audit every line, self-host forever. | API keys, OAuth tokens, and secrets strictly excluded from version control. |
Security highlights:
- Sensitive values managed at runtime via secure secret storage or local
.envfile .envand generated artifacts (logs, databases) strictly ignored by version control- PII sanitization and prompt injection guards built into the proxy layer
- Database encryption at rest supported
See SECURITY.md for the full security policy.
| Platform | Install | Highlights |
|---|---|---|
| π macOS | npm install && npm run gulp compile |
Native .app bundle with system tray |
| π§ Linux | Same build pipeline | AppImage / deb / rpm packages |
| πͺ Windows | Same build pipeline | Native .exe with installer |
| π³ Docker | docker build |
Headless server mode for remote development |
| π» From source | git clone && npm install |
Full development setup |
| Component | Technology |
|---|---|
| Runtime | Node.js 22.x LTS |
| Language | TypeScript 6.0 β 100% TypeScript across src/ and omniproxy-runtime/ |
| Framework | Electron 39.x + VS Code workbench architecture |
| OmniProxy Runtime | Next.js 16 + React 19 + Tailwind CSS 4 |
| Database | better-sqlite3 (SQLite) + LowDB (JSON) β routing decisions, cache, analytics |
| Auth | OAuth 2.0 (PKCE, Device Code, SSO OIDC) + API Keys + Encrypted Secret Storage |
| Streaming | Server-Sent Events (SSE) for real-time model responses |
| Security | AES-256-GCM encryption, PII sanitization, prompt injection guards |
| Testing | Mocha + Playwright (unit, integration, E2E) |
| CI/CD | GitHub Actions (automated builds and releases) |
| License | MIT |
| Document | Description |
|---|---|
| Full OmniCode Documentation | Architecture, detailed flows, and build instructions |
| Quick Start | 6-step install β connect β code |
| Copilot Instructions | Project architecture, coding guidelines, and validation steps |
| Document | Description |
|---|---|
| Security Policy | Vulnerability reporting and security practices |
| Contributing Guide | Development setup and contribution guidelines |
| License | MIT License |
| File | Purpose |
|---|---|
product.json |
Product identifiers, naming, and application metadata |
omniproxy-runtime/.env.example |
Complete environment variable reference |
omniproxy-runtime/open-sse/config/providerRegistry.ts |
Single source of truth for all provider configurations |
src/vs/workbench/contrib/chat/browser/omniProxyManagement/ |
OmniProxy dashboard implementation |
extensions/omniroute/ |
Extension bridge connecting UI to runtime |
We welcome contributions! Whether it's bug reports, feature requests, or pull requests β every contribution matters.
- Fork the repository
- Create your feature branch (
git checkout -b feature/amazing-feature) - Commit your changes (
git commit -m 'Add amazing feature') - Push to the branch (
git push origin feature/amazing-feature) - Open a Pull Request
# Clone and install
git clone https://github.com/cyborgateuk-arch/OmniCode.git
cd OmniCode
npm install
# Start development watch mode
npm run watch
# Run tests
npm run test-node
# Type-check
npm run compile-check-ts-nativeSee CONTRIBUTING.md for detailed guidelines.
ai-ide ai-coding ai-assistant code-editor omnicode omniproxy multi-model ai-routing free-ai developer-tools vscode-fork typescript electron llm-routing model-management ai-proxy cost-optimization semantic-caching oauth-providers coding-agent
- AI-native IDE β not just a plugin, but a full editor with AI built in
- Multi-provider routing β the only IDE that routes across 169+ AI providers
- Free coding agent β $0 to start, with free-tier providers and semantic caching
- Built-in dashboard β no external tools needed for provider management
- Open source β MIT licensed, fully auditable, community-driven
MIT License β free forever, open source, no restrictions.
Built with β€οΈ for developers who demand more from their AI tools.
β Star us on GitHub β’ π Report a Bug β’ π¬ Discussions






