Testing Guide

Test strategy and commands for the RUSH Policy RAG system.

Last Updated: 2026-02-14

See also: RAG_TESTING_PLAN.md for the full RAG evaluation strategy, pre-production gates, and weekly monitoring metrics.

Quick Start

# Backend unit tests
cd apps/backend
pytest tests/ -v

# Frontend type check
cd apps/frontend
npm run check

# Full evaluation suite
cd apps/backend
python scripts/run_test_dataset.py

Test Types

1. Backend Unit Tests

Location: apps/backend/tests/

cd apps/backend

# Run all tests
pytest tests/ -v

# Run with coverage
pytest tests/ -v --cov=app --cov-report=html

# Run specific test file
pytest tests/test_chat_service.py -v

# Run specific test
pytest tests/test_chat_service.py::test_process_chat -v

Test Files:

File	Tests
`test_auth.py`	Azure AD authentication
`test_chat_service.py`	Chat orchestration
`test_cohere_rerank_service.py`	Cohere reranking
`test_on_your_data_service.py`	Azure OpenAI integration
`test_query_validation.py`	Input validation
`test_query_enhancer.py`	Query rewriting and enhancement
`test_security.py`	Security checks
`test_api_security.py`	API endpoint security (headers, CORS, rate limits)
`test_prompt_injection.py`	Prompt injection defense (unicode, homoglyphs)
`test_rag_security.py`	RAG pipeline security (adversarial queries)
`test_synonym_service.py`	Query expansion
`test_synonym_expansion.py`	Synonym expansion integration
`test_citation_formatter.py`	Citation formatting
`test_location_normalization.py`	Location parsing
`test_search_metadata.py`	Search metadata and entity filters
`test_rag_evaluation.py`	RAG evaluation metrics
`test_script_index_safety.py`	Index safety guard rails
`test_monthly_release_gate.py`	Monthly release gate checks
`conftest.py`	Shared fixtures and test configuration
`example_queries.py`	Example query data for tests

Frontend Test Files:

File	Tests
`src/__tests__/security.test.ts`	Frontend security (XSS, CSP)

2. Frontend Checks

Location: apps/frontend/

cd apps/frontend

# TypeScript type checking
npm run check

# ESLint
npm run lint

# Build (catches additional errors)
npm run build

3. RAG Evaluation Suite

Location: apps/backend/ (uses root scripts)

cd apps/backend

# Core test dataset (36 tests)
python scripts/run_test_dataset.py

# Enhanced evaluation (60+ tests)
python scripts/run_enhanced_evaluation.py

# Specific category tests
python scripts/run_enhanced_evaluation.py --category cohere_negation
python scripts/run_enhanced_evaluation.py --category hallucination_fabrication
python scripts/run_enhanced_evaluation.py --category risen_citation

Test Categories:

Category	Tests	Purpose
`cohere_negation`	8	Cross-encoder negation understanding
`cohere_contradiction`	4	Premise contradiction detection
`hallucination_fabrication`	5	Prevent inventing policies
`hallucination_extrapolation`	3	Prevent speculation
`risen_role`	4	RAG-only, no opinions
`risen_citation`	3	Mandatory citation compliance
`risen_refusal`	3	Safety bypass refusal
`risen_adversarial`	5	Jailbreak resistance
`risen_unclear`	4	Gibberish/typo handling
`safety_critical`	4	Life-safety accuracy
`verbatim_accuracy`	4	Exact numbers/timeframes

Running Tests Locally

Prerequisites

# Backend
cd apps/backend
python -m venv venv
source venv/bin/activate  # or venv\Scripts\activate on Windows
pip install -r requirements.txt
pip install pytest pytest-cov pytest-asyncio

# Frontend
cd apps/frontend
npm install

Environment Setup

Tests require environment variables. Create apps/backend/.env:

# Minimum for unit tests (many tests mock Azure services)
SEARCH_ENDPOINT=https://test.search.windows.net
SEARCH_API_KEY=test-key
AOAI_ENDPOINT=https://test.openai.azure.com/
AOAI_API_KEY=test-key

Run Full Test Suite

# Backend
cd apps/backend
pytest tests/ -v --tb=short

# Frontend
cd apps/frontend
npm run check && npm run lint && npm run build

CI/CD Tests

GitHub Actions runs on every PR (.github/workflows/ci.yml):

Backend Linting: Black, isort, flake8
Backend Tests: pytest with coverage
Frontend TypeScript: npm run check
Frontend Lint: npm run lint
Frontend Build: npm run build
CodeQL: Security analysis

Writing Tests

Backend Test Pattern

# tests/test_example.py
import pytest
from unittest.mock import Mock, patch

@pytest.fixture
def mock_search_index():
    """Fixture for mocked search index."""
    mock = Mock()
    mock.search.return_value = [...]
    return mock

def test_search_returns_results(mock_search_index):
    """Test that search returns expected results."""
    results = mock_search_index.search("test query")
    assert len(results) > 0

@pytest.mark.asyncio
async def test_async_chat():
    """Test async chat endpoint."""
    # ... async test code

Test Naming Convention

test_<function_name>_<scenario> for unit tests
test_<feature>_<expected_behavior> for integration tests
Example: test_chat_service_returns_citations

Coverage Requirements

Area	Minimum Coverage
Core services	70%
Security code	90%
New features	80%

Generate coverage report:

pytest tests/ --cov=app --cov-report=html
open htmlcov/index.html

Evaluation Results

Test results are saved to:

test_results.json - Basic test results
enhanced_evaluation_results.json - Full evaluation report

View summary:

cat enhanced_evaluation_results.json | jq '.report.summary'

Troubleshooting Tests

Tests Failing with Import Errors

# Ensure you're in the right directory
cd apps/backend

# Activate virtual environment
source venv/bin/activate

# Reinstall dependencies
pip install -r requirements.txt

Tests Timing Out

# Increase timeout
pytest tests/ -v --timeout=60

# Run single test to debug
pytest tests/test_chat_service.py::test_specific -v -s

Mock Issues

# Ensure correct patch target
@patch('app.services.chat_service.PolicySearchIndex')  # Full path
def test_with_mock(mock_index):
    ...

Performance Benchmarks

Expected baselines:

Cold start: < 5 seconds
Warm query: < 2 seconds
Cache hit: < 100ms

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Testing Guide

Quick Start

Test Types

1. Backend Unit Tests

2. Frontend Checks

3. RAG Evaluation Suite

Running Tests Locally

Prerequisites

Environment Setup

Run Full Test Suite

CI/CD Tests

Writing Tests

Backend Test Pattern

Test Naming Convention

Coverage Requirements

Evaluation Results

Troubleshooting Tests

Tests Failing with Import Errors

Tests Timing Out

Mock Issues

Performance Benchmarks

FilesExpand file tree

TESTING.md

Latest commit

History

TESTING.md

File metadata and controls

Testing Guide

Quick Start

Test Types

1. Backend Unit Tests

2. Frontend Checks

3. RAG Evaluation Suite

Running Tests Locally

Prerequisites

Environment Setup

Run Full Test Suite

CI/CD Tests

Writing Tests

Backend Test Pattern

Test Naming Convention

Coverage Requirements

Evaluation Results

Troubleshooting Tests

Tests Failing with Import Errors

Tests Timing Out

Mock Issues

Performance Benchmarks