Skip to content

Python: Add agent-framework-azure-cosmos-memory context provider#6719

Open
TheovanKraay wants to merge 4 commits into
microsoft:mainfrom
TheovanKraay:feature/cosmos-memory-context-provider
Open

Python: Add agent-framework-azure-cosmos-memory context provider#6719
TheovanKraay wants to merge 4 commits into
microsoft:mainfrom
TheovanKraay:feature/cosmos-memory-context-provider

Conversation

@TheovanKraay

@TheovanKraay TheovanKraay commented Jun 24, 2026

Copy link
Copy Markdown
Contributor

Motivation & Context

Right now there's no first-party way to give agents long-term, per-user memory backed by Azure Cosmos DB. This PR adds an optional package, agent-framework-azure-cosmos-memory, that plugs Cosmos-backed memory into the standard ContextProvider extension point. On each run it pulls the user's relevant past memories and running summary into the agent's context, then saves new memories after the run. So you get the "agent that remembers the user across sessions" scenario without coupling core to Cosmos.

It builds on the azure-cosmos-agent-memory toolkit for storage, embeddings, and reconciliation, and installs as a standalone, opt-in package.

Heads up: this is a draft for you to review the approach and packaging before I polish it further. The emulator-based integration tests depend on an upstream toolkit change, so they're not turned on in CI yet.

Description & Review Guide

  • What are the major changes?

    • New package python/packages/azure-cosmos-memory/ with CosmosMemoryContextProvider, implementing the framework's ContextProvider contract.
    • before_run retrieves per-user memories (semantic search) plus a user summary, injecting memories as context messages and the summary as instructions. Search and summary failures are handled separately so one doesn't block the other, and it warns once if there's no user_id.
    • after_run persists new memories from the turn.
    • README, AGENTS.md, two runnable samples (basic_usage.py, interactive_chat.py), 32 unit tests, and an integration test scaffold.
    • The package is excluded from the shared uv workspace (see below for why).
  • What is the impact of these changes?

    • Additive only. No changes to existing packages or public APIs.
    • The package is excluded from the uv workspace in python/pyproject.toml via [tool.uv.workspace] exclude. That's needed because its dependency azure-cosmos-agent-memory requires Python >=3.11 and a prompty prerelease (>=2.0.0a9), both of which are unsatisfiable against the monorepo's >=3.10 floor and if-necessary-or-explicit prerelease policy. Excluding it keeps root uv sync resolvable and leaves the committed uv.lock untouched, while the package is built and tested standalone.
    • The trade-off: it won't be built or tested by the monorepo CI matrix. If you'd rather have it as a full workspace member, that would mean raising the workspace floor to 3.11 and enabling the prompty prerelease. Happy to go whichever way you prefer.
  • What do you want reviewers to focus on?

    • The exclude decision in python/pyproject.toml, and whether you want this standalone or as a workspace member.
    • The ContextProvider integration semantics (how memories vs. summary are injected, and the independent failure handling).

Related Issue

Fixes #

Contribution Checklist

  • The code builds clean without any errors or warnings
  • All unit tests pass, and I have added new tests where possible
  • The PR follows the Contribution Guidelines
  • This PR is linked to an issue and there is no other open PR for this issue (see Related Issue above).
  • This is not a breaking change.

Copilot AI review requested due to automatic review settings June 24, 2026 20:06
@moonbox3 moonbox3 added documentation Usage: [Issues, PRs], Target: documentation in the code base and learn docs python Usage: [Issues, PRs], Target: Python labels Jun 24, 2026
@github-actions github-actions Bot changed the title Add agent-framework-azure-cosmos-memory context provider Python: Add agent-framework-azure-cosmos-memory context provider Jun 24, 2026

Copilot AI left a comment

Copy link
Copy Markdown
Contributor

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Pull request overview

Adds a new Python integration package, agent-framework-azure-cosmos-memory, introducing a CosmosMemoryContextProvider that persists and recalls long-term memories via Azure Cosmos DB using the azure-cosmos-agent-memory toolkit (including user-summary injection and retrieval-time context augmentation).

Changes:

  • Introduces CosmosMemoryContextProvider (async context manager + before_run/after_run hooks) plus package exports.
  • Adds documentation and samples demonstrating basic usage and an interactive Foundry-backed chat experience.
  • Adds unit tests (mocked client) and live-Azure integration tests (pytest markers).

Reviewed changes

Copilot reviewed 11 out of 11 changed files in this pull request and generated 8 comments.

Show a summary per file
File Description
python/packages/azure-cosmos-memory/agent_framework_azure_cosmos_memory/_context_provider.py Implements the Cosmos-backed memory context provider (retrieval + storage + user-summary injection + flush).
python/packages/azure-cosmos-memory/agent_framework_azure_cosmos_memory/init.py Exports CosmosMemoryContextProvider and package version.
python/packages/azure-cosmos-memory/pyproject.toml Defines the new package, dependencies, and pytest/tooling configuration (Python >=3.11).
python/packages/azure-cosmos-memory/README.md End-user documentation, configuration guidance, and usage examples.
python/packages/azure-cosmos-memory/AGENTS.md Package-level developer guidance and key behaviors (user_id/thread_id, flush).
python/packages/azure-cosmos-memory/LICENSE Package license.
python/packages/azure-cosmos-memory/samples/basic_usage.py Minimal “raw hooks” sample calling before_run()/after_run() directly.
python/packages/azure-cosmos-memory/samples/interactive_chat.py Interactive CLI sample demonstrating an agent wired with Foundry + Cosmos memory provider.
python/packages/azure-cosmos-memory/tests/test_context_provider.py Unit tests for provider behavior with a mocked memory client (incl. context manager + flush).
python/packages/azure-cosmos-memory/tests/test_integration.py Live-Azure integration tests gated by env vars and markers.
python/packages/azure-cosmos-memory/tests/conftest.py Pytest marker registration for the package’s tests.

Comment on lines +156 to +169
# Create Azure credential using the standard chain: EnvironmentCredential →
# ManagedIdentityCredential → AzureCliCredential → InteractiveBrowserCredential.
# This works seamlessly in production (via ManagedIdentity) and local dev (via az login).
if credential is None:
credential = DefaultAzureCredential() # type: ignore

memory_client = AsyncCosmosMemoryClient(
cosmos_endpoint=cosmos_endpoint,
cosmos_database=cosmos_database,
ai_foundry_endpoint=ai_foundry_endpoint,
embedding_deployment_name=embedding_deployment_name,
chat_deployment_name=chat_deployment_name,
use_default_credential=True,
)
Comment on lines +318 to +362
"""Store conversation turns and optionally trigger memory extraction.

Args:
agent: The agent that ran this invocation.
session: The current session.
context: The invocation context with response populated.
state: Provider-scoped mutable state.
"""
# Get user_id and thread_id from state or session (warns once if no stable user_id)
user_id = self._resolve_user_id(state, session)
thread_id = state.get("thread_id") or session.state.get("thread_id") or session.session_id or "default"

try:
# Store input messages
for msg in context.input_messages:
if hasattr(msg, "role") and hasattr(msg, "text") and msg.text:
role_value = msg.role.value if hasattr(msg.role, "value") else str(msg.role)
if role_value in {"user", "assistant", "system"}:
await self.memory_client.add_cosmos(
user_id=user_id,
thread_id=thread_id,
role=self._ROLE_MAP.get(role_value, role_value),
content=msg.text,
)

# Store response messages
if context.response and context.response.messages:
for msg in context.response.messages:
if hasattr(msg, "role") and hasattr(msg, "text") and msg.text:
role_value = msg.role.value if hasattr(msg.role, "value") else str(msg.role)
if role_value in {"user", "assistant", "system"}:
await self.memory_client.add_cosmos(
user_id=user_id,
thread_id=thread_id,
role=self._ROLE_MAP.get(role_value, role_value),
content=msg.text,
)

# Auto-extraction and processing:
# The AsyncCosmosMemoryClient uses an InProcessProcessor that runs in the background
# and automatically extracts facts, generates summaries, and reconciles memories based on
# configured thresholds (FACT_EXTRACTION_EVERY_N, DEDUP_EVERY_N, etc.).
# This happens asynchronously after add_cosmos() completes, so no explicit process_now() call is needed.
# To disable auto-extraction, set auto_extract=False and call memory_client.process_now() manually.

Comment on lines +331 to +341
# Store input messages
for msg in context.input_messages:
if hasattr(msg, "role") and hasattr(msg, "text") and msg.text:
role_value = msg.role.value if hasattr(msg.role, "value") else str(msg.role)
if role_value in {"user", "assistant", "system"}:
await self.memory_client.add_cosmos(
user_id=user_id,
thread_id=thread_id,
role=self._ROLE_MAP.get(role_value, role_value),
content=msg.text,
)
Comment on lines +343 to +355
# Store response messages
if context.response and context.response.messages:
for msg in context.response.messages:
if hasattr(msg, "role") and hasattr(msg, "text") and msg.text:
role_value = msg.role.value if hasattr(msg.role, "value") else str(msg.role)
if role_value in {"user", "assistant", "system"}:
await self.memory_client.add_cosmos(
user_id=user_id,
thread_id=thread_id,
role=self._ROLE_MAP.get(role_value, role_value),
content=msg.text,
)

Comment on lines +381 to +389
content = memory.get("content", "")
memory_type = memory.get("memory_type", "")
confidence = memory.get("confidence", 0.0)

# Format: [Type] Content (confidence: X.XX)
if memory_type and confidence:
formatted.append(f"[{memory_type}] {content} (confidence: {confidence:.2f})")
else:
formatted.append(content)
Comment on lines +8 to +10
def pytest_configure(config: pytest.Config) -> None:
"""Register custom markers."""
config.addinivalue_line("markers", "integration: mark test as integration test requiring live Azure accounts")
Comment on lines +170 to +181
1. **Complete [Development Setup](#development-setup)** - Create venv and install package **with sample dependencies**:
```bash
pip install -e ".[dev,samples]"
```
Or install separately:
```bash
pip install -e ".[dev]"
pip install -e ".[samples]"
``` with sample dependencies:
```bash
pip install -e ".[dev,samples]"
```
description = "Azure Cosmos DB Agent Memory Toolkit integration for Microsoft Agent Framework - semantic memory with fact extraction and user profiles."
authors = [{ name = "Microsoft", email = "af-support@microsoft.com"}]
readme = "README.md"
requires-python = ">=3.11"

Copy link
Copy Markdown
Contributor Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Excluded package.

@github-actions

github-actions Bot commented Jun 25, 2026

Copy link
Copy Markdown
Contributor

Python Test Coverage

Python Test Coverage Report •
FileStmtsMissCoverMissing
TOTAL42556508688% 
report-only-changed-files is enabled. No files were changed during this commit :)

Python Unit Test Overview

Tests Skipped Failures Errors Time
8325 37 💤 0 ❌ 0 🔥 2m 14s ⏱️

Introduces CosmosMemoryContextProvider, a ContextProvider that wraps the azure-cosmos-agent-memory toolkit to give agents long-term, Cosmos DB-backed memory (fact/procedural recall + user summaries). Includes package scaffolding, unit tests (mocked client), live Azure integration tests (marked), samples, README, and AGENTS.md.

Draft: uv.lock is intentionally left unchanged. This package depends on azure-cosmos-agent-memory (requires Python >=3.11), which is unsatisfiable against the workspace's current >=3.10 floor, so adding it to the shared lock requires a workspace decision (raise floor to 3.11 or exclude from workspace). Test coverage to be expanded.
The package depends on azure-cosmos-agent-memory which requires Python
>=3.11 and a prompty pre-release (>=2.0.0a9). Both are unsatisfiable
against the workspace's >=3.10 floor and pre-release policy, causing
uv sync to fail in every Python CI job. Exclude the package from the
shared workspace so it is resolved and tested as a standalone package.
- Strip trailing whitespace from package files (pre-commit trailing-whitespace hook)
- Exclude the package README from markdown-code-lint: the package is excluded
  from the uv workspace, so its README snippets import a module that is not
  installed in the workspace env and Pyright cannot resolve it
@TheovanKraay TheovanKraay force-pushed the feature/cosmos-memory-context-provider branch from ab30779 to 4bc5c8e Compare June 25, 2026 14:36
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

documentation Usage: [Issues, PRs], Target: documentation in the code base and learn docs python Usage: [Issues, PRs], Target: Python

Projects

None yet

Development

Successfully merging this pull request may close these issues.

3 participants