Agentune Simulate

Launching an AI Agent? Stop guessing, start simulating.

Many developers and data scientists struggle to test and validate AI agents effectively. Some deploy directly to production, testing on real customers! Others perform A/B testing, which also means testing on real customers. Many rely on predefined tests that cover main use cases but fail to capture real user intents.

Agentune Simulate creates a customer simulator (twin) based on a set of real conversations. It captures the essence of your customers' inquiries and the way they converse, allowing you to simulate conversations with your AI agent, ensuring it behaves as expected before deployment.

Ready to deploy your improved AI agent? Use Agentune Simulate to validate it first against real customer interactions!

Need help? Please contact us. We are committed to assist early adopters in making the most of it!

How Does It Work?

Running a simulation with Agentune Simulate generates realistic conversations between your AI agent and simulated customers. This lets you evaluate your agent's performance, identify edge cases, and validate behavior before real deployment.

How do we validate the twin customer simulator? We create a twin AI-Agent and let them converse. we then evaluate the conversations to check that the customer simulator behaves as the real customer:

Capture Conversations - Collect real conversations between customers and your existing AI-agent
Create Simulator - Create twin Customer Simulator and AI-Agent from the captured conversations
Simulate & Evaluate - Simulate interactions to evaluate if the twin Customer Simulator behaves as your real customers

Testing & Costs

We've tested Agentune Simulate with gpt-4o. In our tests, the cost per conversation was approximately 5-10 cents per conversation.

Quick Start

Install Agentune Simulate

pip install agentune-simulate

Basic usage example

from agentune.simulate import SimulationSessionBuilder
from langchain_openai import ChatOpenAI

# Load your conversations and create outcomes
session = SimulationSessionBuilder(
    default_chat_model=ChatOpenAI(model="gpt-4o"),
    outcomes=outcomes,
    vector_store=vector_store
).build()

# Run simulation
results = await session.run_simulation(real_conversations=conversations)

Learn with examples

Quick Start - getting_started.ipynb for a quick getting started example
Production Setup - persistent_storage_example.ipynb for a closer to real life, scalable, persistent example
Validate Your Data - Adapt the 2nd example to load your conversations data and validate the simulation. Here is an example of how to load conversations from tabular data: loading_conversations.ipynb
Connect Real Agent - real_agent_integration.ipynb for integrating your existing agent systems

📧 Need help? Have feedback? Contact us at agentune-dev@sparkbeyond.com

Contributing

Environment Setup: Environment Setup Guide
Coding Standards: Style Guide

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Agentune Simulate

How Does It Work?

Testing & Costs

Quick Start

Install Agentune Simulate

Basic usage example

Learn with examples

Contributing

FilesExpand file tree

README.md

Latest commit

History

README.md

File metadata and controls

Agentune Simulate

How Does It Work?

Testing & Costs

Quick Start

Install Agentune Simulate

Basic usage example

Learn with examples

Contributing