feat(ai): add Ollama provider integration with parallel LLM calls by aGallea · Pull Request #35 · aGallea/embedding-clusters

aGallea · 2026-03-31T11:47:25Z

Summary

Replace free-text provider input with a closed dropdown (OpenAI, Google, Anthropic, Ollama)
When Ollama is selected, fetch available models from local instance via POST /api/ai/ollama/models and display as selectable dropdown
Skip API key requirement for Ollama across frontend (Settings, ClusterLegend, ClusterDetailDrawer) and backend (litellm calls)
Normalize base URL by stripping /v1 suffix for Ollama models since litellm's ollama/ provider uses the native API, not the OpenAI-compatible endpoint
Parallelize cluster naming LLM calls using asyncio.gather + thread pool executor for significantly faster naming (N clusters in ~1 call duration instead of N x call duration)

Changed Files

Backend:

ai_naming.py — Add _normalize_base_url(), conditionally pass api_key to litellm
server/models.py — Add OllamaModelsRequest, OllamaModel, OllamaModelsResponse
server/routes/ai.py — Add Ollama models endpoint, parallelize name_clusters and name_sub_clusters

Frontend:

api/ai.ts — Add AI_PROVIDERS constant, AiProvider type, fetchOllamaModels()
pages/SettingsPage.tsx — Provider dropdown, Ollama model dropdown, conditional API key
components/plot/ClusterLegend.tsx — Allow naming without API key for Ollama
components/plot/ClusterDetailDrawer.tsx — Allow sub-cluster naming without API key for Ollama
types/index.ts — Add OllamaModel, OllamaModelsResponse types

Test Plan

All 265 tests pass
mypy + ruff clean
Pre-commit hooks pass
Manual E2E: Select Ollama -> models load -> Name with AI works without API key

- Add provider dropdown (OpenAI, Google, Anthropic, Ollama) replacing free-text input - Fetch and display available Ollama models via POST /api/ai/ollama/models endpoint - Skip API key requirement for Ollama in frontend and backend (litellm) - Normalize base URL by stripping /v1 suffix for Ollama native API compatibility - Parallelize cluster naming LLM calls with asyncio.gather for faster responses

github-actions · 2026-03-31T11:49:28Z

Tests Report 📄

Tests Succees ✅

JUnit Details

Total Tests	Failures	Errors	Skipped	Time ⏳
281	0	0	0	71.83s

Coverage Details (100% >= 90%) ✅

Diff Cover Details

File	Covered Lines
embedding_cluster/ai_naming.py	19/19	100%
embedding_cluster/server/models.py	15/15	100%
embedding_cluster/server/routes/ai.py	86/86	100%
Total	120/120	100%

test(ai): add tests for Ollama integration and parallel LLM calls

63229da

aGallea merged commit dab125a into master Mar 31, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat(ai): add Ollama provider integration with parallel LLM calls#35

feat(ai): add Ollama provider integration with parallel LLM calls#35
aGallea merged 2 commits into
masterfrom
feat/ollama-provider-integration

aGallea commented Mar 31, 2026

Uh oh!

github-actions Bot commented Mar 31, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

aGallea commented Mar 31, 2026

Summary

Changed Files

Test Plan

Uh oh!

github-actions Bot commented Mar 31, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Tests Report 📄

Tests Succees ✅

JUnit Details

Coverage Details (100% >= 90%) ✅

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

github-actions Bot commented Mar 31, 2026 •

edited

Loading