OpenScout is an open-source Agentic Retrieval-Augmented Generation (RAG) system built using the LangGraph framework. It intelligently routes user queries between a local vectorstore and real-time web search, ensuring optimal relevance, freshness, and efficiency.
OpenScout leverages open-source LLMs for both routing and generation, enabling a transparent, controllable, and vendor-neutral AI stack suitable for production and research use cases.
-
Agentic Query Routing with LangGraph Deterministic, graph-based agent orchestration using LangGraph for explicit and debuggable control flow.
-
Dual-Model Architecture
- Router Model:
qwen3:8bfor lightweight, fast decision-making - Generator Model:
gpt-oss:20bfor high-quality response generation
- Router Model:
-
Local-First RAG Strategy Queries are routed to the local vectorstore whenever possible, preserving privacy and reducing latency.
-
Real-Time Web Search via Tavily Automatically invoked for queries requiring fresh or external information.
-
Open Embedding Stack Uses
embeddinggemma:300mfor efficient and high-quality semantic embeddings. -
Modular and Extensible Each component i.e., routing, retrieval, search, and generation is independently swappable.
OpenScout is implemented as a LangGraph-powered agentic workflow, where each node represents a discrete reasoning or retrieval step.
-
User submits a query
-
Router agent (
qwen3:8b) evaluates intent and freshness requirements -
Query is routed to:
- Local vectorstore retrieval, or
- Tavily-powered live web search
-
Retrieved context is normalized and aggregated
-
Generator model (
gpt-oss:20b) produces the final answer
LangGraph ensures:
- Explicit state transitions
- Deterministic routing logic
- Full traceability and debugging support
- LangGraph: Graph-based agent orchestration and control flow
- Router:
qwen3:8b - Generator:
gpt-oss:20b
- embeddinggemma:300m
- SKLearnVectorStore
- Tavily API (real-time, LLM-optimized search)
- LLM as a Judge implemetation to check for Hallusinations and Answer correctness
Internal Knowledge Query
"What is CoT prompting?"
→ Routed to vectorstore
Fresh / External Query
"What are the latest developments in the EU AI Act?"
→ Routed to Tavily web search
-
Powered by
qwen3:8b -
Classifies:
- Knowledge domain (internal vs external)
- Freshness requirements
- Confidence threshold for local retrieval
- Vectorstore retriever
- Tavily search retriever
gpt-oss:20b- Receives structured context and provenance metadata
- Enterprise internal knowledge assistants
- Research copilots with real-time awareness
- Privacy-sensitive RAG systems
- Local-first AI assistants with web fallback
- Developer documentation bots