The AI Quality Orchestrator for modern engineering teams.
Agnox is a unified platform that integrates with your CI/CD pipelines, executes tests across any framework, and delivers actionable AI-powered insights — directly where your team works.
Test automation teams waste hours every day on:
- Debugging failures — manually sifting through thousands of log lines with no clear root cause
- Flaky tests — intermittent failures that erode confidence but are hard to identify and quarantine
- Wasted CI time — running the full test suite on every PR, even when only a handful of files changed
- Fragmented tooling — different frameworks, environments, and teams with no unified view
Agnox sits between your CI/CD pipeline and your test framework, bringing everything together in one platform:
| Capability | What it does |
|---|---|
| AI Triage | Dual-agent LLM pipeline analyzes failures, generates a root cause, and self-critiques to eliminate hallucinations |
| Flakiness Detective | Detects unstable tests across historical runs, scores flakiness, and surfaces actionable recommendations |
| Smart PR Routing | An AI reads your changed files and dispatches only the relevant tests — not the full suite |
| Test Optimizer | A dual-agent analyzer rewrites test cases into clean BDD format and removes duplicate steps |
| Quality Chatbot | Ask natural-language questions about your test data — powered by a sanitized, AI-generated aggregation pipeline |
| Auto Bug Generation | One click generates a structured bug report pre-filled with the AI's root cause analysis, ready to submit to Jira, Linear, or Monday.com |
| Spec-to-Test Generation | Upload a PDF, DOCX, or Markdown spec — a 4-stage agentic pipeline (Extractor → Generator → Critic → Formatter) streams progress via SSE and generates draft test cases directly into your suite |
| Automation Planner | Turns your manual test library into a prioritised automation roadmap — AI scores each test's risk level, you select up to 30, and it generates a complete Markdown automation strategy document ready to paste into Cursor or GitHub Copilot |
| Framework-Agnostic | Bring your own Docker image — Playwright, Cypress, Pytest, JUnit, Selenium, or any custom framework |
| Multi-Tenant SaaS | Complete data isolation, RBAC, and per-organization quotas and billing |
| Notifications | Configurable Slack and MS Teams webhooks deliver colour-coded execution summaries with AI analysis snippets and deep links to the Investigation Hub — each channel is independent and per-status configurable |
| Linear Integration | Create Linear issues from failed executions in one click; bidirectional linkage keeps your issue tracker in sync with your test results |
| Monday.com Integration | Create Monday.com board items from the Investigation Hub or Auto-Bug Generator; description attached as an update on the item; bidirectional mondayItems[] linkage on each execution |
| Generic Outbound Webhook | Fire-and-forget execution.finished event POSTed to any HTTPS endpoint on execution completion with optional HMAC-SHA256 signing |
When tests fail, Agnox captures logs automatically and runs a dual-agent LLM pipeline:
- An Analyzer agent produces a structured root cause and suggested fix as JSON
- A Critic agent evaluates the analysis against the raw logs and overrides hallucinations
- The final, vetted analysis is rendered as developer-facing Markdown in the Investigation Hub
Your LLM key stays yours. Agnox supports BYOK (Bring Your Own Key) for Gemini, GPT-4o, and Claude, with a platform-default fallback.
Agnox scores each test group's flakiness over time and delivers a structured verdict with findings and recommendations — so you can quarantine unstable tests before they derail your pipeline.
Connect Agnox to your GitHub repository via a webhook. When a PR fires:
- Agnox extracts the changed files from the payload
- An LLM maps the changes to the most relevant test subfolder
- A targeted test run is dispatched — not the full suite
- The execution appears in the Dashboard with a Smart PR badge and full CI context
# GitHub Actions example
- name: Trigger Smart PR Tests
run: |
curl -X POST ${{ secrets.AGNOX_API_URL }}/api/webhooks/ci/pr?token=${{ secrets.AGNOX_ORG_ID }}&env=staging \
-H "Content-Type: application/json" \
-H "X-Hub-Signature-256: sha256=$(echo -n '${{ toJson(github.event) }}' | openssl dgst -sha256 -hmac '${{ secrets.WEBHOOK_SECRET }}' | cut -d' ' -f2)" \
-d '${{ toJson(github.event) }}'Ask questions about your test data in plain English — "What is the flakiness rate for the checkout group this week?" — and get a clear answer backed by real aggregation data from your executions.
Agnox is framework-agnostic. Any Docker-based test runner works out of the box:
| Framework | Docker Image Example | Command |
|---|---|---|
| Playwright | mcr.microsoft.com/playwright:v1.40.0 |
npx playwright test |
| Cypress | cypress/included:13.6.0 |
cypress run |
| Pytest | python:3.11-slim |
pytest --html=report.html |
| JUnit | openjdk:17-slim |
mvn test |
| Selenium | selenium/standalone-chrome:latest |
pytest tests/ |
| Custom | Any Docker image | Any command |
There are two ways to connect your automation project to Agnox.
The Agnox CLI scaffolds a production-ready Docker setup for your test repo in seconds.
npx @agnox/agnox-cli@latest initWhat it does:
- Generates a
Dockerfile,entrypoint.sh, and.dockerignoretailored to your framework - Auto-detects your Playwright version from
package.jsonand pins the correct base image - Builds a multi-platform Docker image (
linux/amd64+linux/arm64) and pushes it to Docker Hub
After running the CLI:
- Open the Agnox Dashboard
- Go to Settings → Run Settings
- Create a project and enter the Docker image name
- Configure your environment URLs (Dev / Staging / Prod)
The platform uses this image for all runs triggered from the Execution Modal or your CI pipeline.
Updating Your Image & CI/CD:
To update your Docker image seamlessly (e.g., in your CI/CD pipeline), use the push command:
npx @agnox/agnox-cli@latest push -i <image_tag>This allows you to update the connected image tag programmatically, ensuring Agnox always executes against your latest test suite version.
npm:
@agnox/agnox-cli
Stream live Playwright results directly to your Agnox dashboard — no Dockerfile, no CI changes required.
npm install --save-dev @agnox/playwright-reporter// playwright.config.ts
import AgnoxReporter from '@agnox/playwright-reporter';
export default defineConfig({
reporter: [
['list'],
[AgnoxReporter, {
apiKey: process.env.AGNOX_API_KEY,
projectId: process.env.AGNOX_PROJECT_ID,
}],
],
});Key features:
- Live ingest — results stream to the dashboard as each test completes
- Auto CI detection — automatically attaches branch, PR number, and commit SHA from GitHub Actions, GitLab CI, Azure DevOps, and Jenkins
- Do No Harm — all reporter errors are silently suppressed; your test suite is never affected
- Source filter — runs appear as
External CIin the Dashboard, separate from Docker-hosted runs - Structured failure context (v2.0) — optionally re-export
testfrom@agnox/playwright-reporterin your shared fixtures file; an auto-fixture captures the failure URL, page title, console errors, and failed network requests at the moment of failure and injects them directly into the AI root-cause analysis — no test file modifications required
Trigger Agnox from any CI pipeline using a simple curl call.
name: Run E2E Tests via Agnox
on: [push, pull_request]
jobs:
test:
runs-on: ubuntu-latest
steps:
- uses: actions/checkout@v3
- name: Trigger Tests
run: |
curl -X POST https://api.agnox.dev/api/ci/trigger \
-H "Content-Type: application/json" \
-H "x-api-key: ${{ secrets.AGNOX_API_KEY }}" \
-d '{
"projectId": "${{ secrets.AGNOX_PROJECT_ID }}",
"image": "myorg/my-tests:latest",
"command": "npx playwright test",
"folder": "tests/e2e",
"config": { "environment": "staging" },
"ciContext": {
"source": "github",
"repository": "${{ github.repository }}",
"prNumber": ${{ github.event.number || 0 }},
"commitSha": "${{ github.sha }}"
}
}'e2e-tests:
stage: test
script:
- |
curl -X POST https://api.agnox.dev/api/ci/trigger \
-H "Content-Type: application/json" \
-H "x-api-key: $AGNOX_API_KEY" \
-d '{
"projectId": "'"$AGNOX_PROJECT_ID"'",
"image": "myorg/my-tests:latest",
"command": "npx playwright test",
"folder": "tests/e2e",
"config": { "environment": "staging" },
"ciContext": {
"source": "gitlab",
"repository": "'"$CI_PROJECT_PATH"'",
"commitSha": "'"$CI_COMMIT_SHA"'"
}
}'For the full API reference, see docs.agnox.dev.
- Sign up at agnox.dev — your organization is created automatically
- Generate an API Key in Settings → Profile → API Access
- Connect your project using the CLI or the Native Playwright Reporter
- Trigger your first run and watch results stream live in the dashboard
Agnox is built with security-first principles:
- JWT + RBAC — HS256 with Redis-backed revocation; Admin, Developer, and Viewer roles
- AES-256-GCM — all stored secrets (API keys, integration tokens) are encrypted at rest
- Rate Limiting — per-organization and per-IP limits (Redis-backed)
- HMAC Webhook Verification — constant-time
X-Hub-Signature-256validation for PR Routing webhooks - 5-Layer AI Sanitizer — every LLM-generated MongoDB pipeline is validated against a strict allowlist before execution, preventing NoSQL injection via AI-generated queries
- Log Sanitization — secret values are automatically redacted from all streamed container logs
| Resource | Link |
|---|---|
| Full Documentation | docs.agnox.dev |
| Website | agnox.dev |
| Agnox CLI on npm | @agnox/agnox-cli |
| Playwright Reporter on npm | @agnox/playwright-reporter |
| Contact & Support | info@agnox.dev |
This repository contains the public-facing CLI and documentation for Agnox. We welcome bug reports, documentation improvements, and feature suggestions.
- Found a bug in the CLI? Open an issue in this repository.
- Have a feature idea? Start a discussion or reach out at info@agnox.dev.
- Want to read the full docs? Head to docs.agnox.dev.
Built for engineering teams who care about quality.
agnox.dev · docs.agnox.dev