Getting Started with Agentic Data Scientist

This guide will help you understand and use the Agentic Data Scientist multi-agent workflow.

Installation

# Install from PyPI using uv
uv tool install agentic-data-scientist

# Or use with uvx (no installation needed)
uvx agentic-data-scientist "your query here"

Prerequisites

Python 3.12 or later
Node.js (for Claude Code)
API keys:
- ANTHROPIC_API_KEY for Claude (required)
- OPENROUTER_API_KEY for planning/review models (required)

Quick Start

1. Set up environment variables

Create a .env file in your project root:

# Required: API keys
ANTHROPIC_API_KEY=your_anthropic_key_here
OPENROUTER_API_KEY=your_openrouter_key_here

# Optional: Model configuration
DEFAULT_MODEL=google/gemini-2.5-pro
CODING_MODEL=claude-sonnet-4-5-20250929

Get your API keys:

OpenRouter: https://openrouter.ai/keys
Anthropic: https://console.anthropic.com/

2. Run your first query

Important: You must specify --mode to choose your execution strategy.

# Complex analysis with full workflow
agentic-data-scientist "Perform differential expression analysis" --mode orchestrated --files data.csv

# Quick scripting task
agentic-data-scientist "Write a Python script to parse CSV" --mode simple

# Question answering
agentic-data-scientist "Explain gradient boosting" --mode simple

3. Working Directory Options

By default, files are saved to ./agentic_output/ and preserved after completion:

# Default behavior (files preserved)
agentic-data-scientist "Analyze data" --mode orchestrated --files data.csv

# Temporary directory (auto-cleanup)
agentic-data-scientist "Quick exploration" --mode simple --files data.csv --temp-dir

# Custom location
agentic-data-scientist "Project analysis" --mode orchestrated --files data.csv --working-dir ./my_project

Understanding the Workflow

When you submit a query, Agentic Data Scientist goes through a multi-phase workflow designed to produce high-quality, validated results.

What Happens When You Run a Query

USER QUERY: "Analyze customer churn patterns in this dataset"
     |
     v
┌────────────────────────────────────────────────────────┐
│ PHASE 1: PLANNING (Iterative)                         │
├────────────────────────────────────────────────────────┤
│ 1. Plan Maker creates comprehensive analysis plan     │
│    - Breaks down task into logical stages             │
│    - Defines clear success criteria                   │
│    - Recommends appropriate methodologies             │
│                                                        │
│ 2. Plan Reviewer validates the plan                   │
│    - Checks completeness                              │
│    - Verifies all requirements are addressed          │
│    - Provides feedback if improvements needed         │
│                                                        │
│ 3. Loop repeats until plan is approved                │
│                                                        │
│ 4. Plan Parser structures it for execution            │
│    - Converts to executable stages                    │
│    - Sets up success criteria tracking                │
│                                                        │
│ RESULT: Validated, comprehensive execution plan       │
└────────────────────────────────────────────────────────┘
     |
     v
┌────────────────────────────────────────────────────────┐
│ PHASE 2: EXECUTION (Stage by Stage)                   │
├────────────────────────────────────────────────────────┤
│ For each stage in the plan:                           │
│                                                        │
│ A. IMPLEMENTATION LOOP (Iterative)                    │
│    1. Coding Agent implements the stage               │
│       - Has access to 380+ scientific Skills          │
│       - Can read/write files, run code                │
│       - Creates scripts, analyses, visualizations     │
│                                                        │
│    2. Review Agent validates implementation           │
│       - Checks code quality and correctness           │
│       - Verifies stage requirements are met           │
│       - Provides specific feedback if issues found    │
│                                                        │
│    3. Loop repeats until approved                     │
│                                                        │
│ B. PROGRESS VALIDATION                                │
│    4. Criteria Checker updates progress               │
│       - Inspects generated files and results          │
│       - Updates which success criteria are now met    │
│       - Provides objective evidence                   │
│                                                        │
│ C. ADAPTIVE REPLANNING                                │
│    5. Stage Reflector adapts remaining work           │
│       - Considers what's been accomplished            │
│       - Identifies what still needs to be done        │
│       - Modifies or extends remaining stages          │
│                                                        │
│ Then proceeds to next stage...                        │
│                                                        │
│ RESULT: All stages implemented and validated          │
└────────────────────────────────────────────────────────┘
     |
     v
┌────────────────────────────────────────────────────────┐
│ PHASE 3: SUMMARY                                       │
├────────────────────────────────────────────────────────┤
│ Summary Agent creates final report                     │
│ - Synthesizes all work performed                      │
│ - Documents key findings and results                  │
│ - Lists all generated files and outputs               │
│ - Provides comprehensive analysis narrative           │
│                                                        │
│ RESULT: Publication-ready comprehensive report        │
└────────────────────────────────────────────────────────┘

Key Workflow Characteristics

Iterative Refinement

Plans are reviewed and refined before execution begins
Implementations are validated before proceeding to the next stage
Multiple opportunities to catch and fix issues early

Adaptive Execution

Discoveries during implementation inform subsequent stages
Plan adapts based on actual progress and findings
Flexible enough to handle unexpected insights

Continuous Validation

Success criteria tracked objectively throughout execution
Clear visibility into what's been accomplished vs. what remains
Objective evidence for each criterion's status

Separation of Concerns

Planning agents focus on strategy without implementation details
Coding agent focuses on implementation without planning burden
Review agents provide independent validation

Python API Usage

Basic Usage

from agentic_data_scientist import DataScientist

# Create an instance and run a query
with DataScientist() as ds:
    result = ds.run("What is data science?")
    print(result.response)
    
# Access results
print(f"Status: {result.status}")
print(f"Duration: {result.duration}s")
print(f"Files created: {result.files_created}")

With File Upload

from agentic_data_scientist import DataScientist

with DataScientist() as ds:
    result = ds.run(
        "Analyze trends in this time series data",
        files=[
            ("sales.csv", open("sales.csv", "rb").read()),
            ("inventory.csv", open("inventory.csv", "rb").read()),
        ]
    )
    print(result.response)
    print(f"Working directory: {ds.working_dir}")

Async Usage with Streaming

import asyncio
from agentic_data_scientist import DataScientist

async def analyze_data():
    async with DataScientist() as ds:
        async for event in await ds.run_async(
            "Perform differential expression analysis",
            files=[("data.csv", open("data.csv", "rb").read())],
            stream=True
        ):
            # Watch the workflow in real-time
            if event['type'] == 'message':
                author = event['author']
                content = event['content']
                print(f"[{author}] {content}")
            elif event['type'] == 'completed':
                print(f"✓ Completed in {event['duration']}s")

asyncio.run(analyze_data())

Multi-turn Conversation

import asyncio
from agentic_data_scientist import DataScientist

async def chat():
    async with DataScientist() as ds:
        context = {}
        
        # First turn
        result1 = await ds.run_async(
            "What are the main techniques for dimensionality reduction?",
            context=context
        )
        print("AI:", result1.response)
        
        # Second turn (maintains context)
        result2 = await ds.run_async(
            "Which one would you recommend for high-dimensional gene expression data?",
            context=context
        )
        print("AI:", result2.response)

asyncio.run(chat())

Understanding Streaming Events

When using stream=True, you'll receive events as the workflow progresses:

async for event in await ds.run_async("Your query", stream=True):
    event_type = event['type']
    
    if event_type == 'message':
        # Regular text output from agents
        print(f"[{event['author']}] {event['content']}")
        
    elif event_type == 'function_call':
        # Agent is using a tool
        print(f"Calling {event['name']}...")
        
    elif event_type == 'function_response':
        # Tool returned a result
        print(f"Tool {event['name']} completed")
        
    elif event_type == 'usage':
        # Token usage information
        tokens = event['usage']
        print(f"Tokens: {tokens['total_input_tokens']} in, {tokens['output_tokens']} out")
        
    elif event_type == 'completed':
        # Workflow finished
        print(f"Done in {event['duration']}s")
        print(f"Created {len(event['files_created'])} files")

Execution Modes

Orchestrated Mode (Recommended)

Full multi-agent workflow with planning, validation, and adaptive execution.

When to use:

Complex data analyses
Multi-step workflows
Tasks requiring validation
Production analyses

Example:

agentic-data-scientist "Perform DEG analysis comparing treatment vs control" \
  --mode orchestrated \
  --files treatment.csv --files control.csv

Simple Mode

Direct coding without planning overhead.

When to use:

Quick scripts
Code generation
Question answering
Rapid prototyping

Example:

agentic-data-scientist "Write a function to merge CSV files" --mode simple

Next Steps

See the docs/ folder for additional guides on API usage, CLI options, customization, and technical architecture.

Troubleshooting

Common Issues

ImportError: No module named 'agentic_data_scientist'

Install the package: pip install agentic-data-scientist or uv sync

API Key Errors

Ensure your .env file is in the correct location
Verify API keys are valid and active
Check that keys have sufficient credits

Node.js Issues

Ensure Node.js is installed: node --version
Required for Claude Code agent
Restart terminal after installing Node.js

Workflow Seems Stuck

Enable streaming to see progress: --stream or stream=True
Check logs for error messages
Workflow may be running long computations - be patient

Getting Help

Check the full documentation in the docs/ folder
Open an issue on GitHub

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Getting Started with Agentic Data Scientist

Installation

Prerequisites

Quick Start

1. Set up environment variables

2. Run your first query

3. Working Directory Options

Understanding the Workflow

What Happens When You Run a Query

Key Workflow Characteristics

Python API Usage

Basic Usage

With File Upload

Async Usage with Streaming

Multi-turn Conversation

Understanding Streaming Events

Execution Modes

Orchestrated Mode (Recommended)

Simple Mode

Next Steps

Troubleshooting

Common Issues

Getting Help

FilesExpand file tree

getting_started.md

Latest commit

History

getting_started.md

File metadata and controls

Getting Started with Agentic Data Scientist

Installation

Prerequisites

Quick Start

1. Set up environment variables

2. Run your first query

3. Working Directory Options

Understanding the Workflow

What Happens When You Run a Query

Key Workflow Characteristics

Python API Usage

Basic Usage

With File Upload

Async Usage with Streaming

Multi-turn Conversation

Understanding Streaming Events

Execution Modes

Orchestrated Mode (Recommended)

Simple Mode

Next Steps

Troubleshooting

Common Issues

Getting Help