Skip to content

A macOS app for running parallel AI agents in sandboxed local VMs

Notifications You must be signed in to change notification settings

johnbean393/Hivecrew

Repository files navigation

Hivecrew app icon

Hivecrew

A macOS app that runs parallel AI computer use agents in local VMs

Download DMG

Hivecrew is a native macOS app that runs AI computer use agents in dedicated virtual machines. It's like having N white collar employees, each with a computer.

Dispatch tasks from a central dashboard, watch agents work autonomously, and step in to guide them whenever needed—all while keeping your host system completely isolated and safe.

Our Mission: Make AI agents practical for real work by ensuring they are:

  • Parallel: Run multiple agents simultaneously
  • Auditable: Track every action agents take
  • Safe: Isolate agents from your host system
  • Transparent: Monitor agent behavior in real time

Watch the demo
Click to watch the demo video

Table of Contents

Features

Task Management

Agent Environment View

  • Natural Language Tasks: Describe what you want done in plain language; agents handle the rest
  • File Attachments: Attach input files using @ mentions or drag-and-drop, and specify output directories for deliverables
  • Batch Execution: Run multiple copies of the same task (1x, 2x, 4x, 8x) across parallel agents
  • Task Queue: Queue tasks for later and monitor status (queued, running, completed, failed)

Subagent System

  • Parallel Research and Execution: Spawn focused subagents for research, data gathering, and verification while the main agent continues the task
  • Specialized Workflows: Delegate subtasks like competitor scans, documentation lookups, or multi-source comparisons to targeted subagents
  • Inter-Agent Messaging: Agents can send messages to each other (point-to-point or broadcast) that are automatically delivered into the recipient's context
  • Faster Convergence: Combine results from multiple subagents to reduce back-and-forth and reach decisions sooner
  • Agent Swarms: Leverage's Kimi-K2.5's training for agent swarms

LLM Providers

Provider Settings

  • Multi-Provider: Works with Anthropic, OpenAI, OpenRouter, and any OpenAI-compatible API
  • Per-Task Selection: Choose which provider and model to use for each task
  • Local LLMs: Connect to local LLM servers with custom base URLs
  • Recommended Provider: We suggest using OpenRouter for easy switching between different models with a single API key

Recommended Models

Model Best For Notes
Kimi K2.5 Most general tasks Best balance between cost and performance
Claude Sonnet 4.5 Screen interaction tasks Only recommended for tasks requiring heavy clicking, pointing, and visual navigation (e.g., completing UI tests of a web app)

Safety Controls

  • Full Isolation: Each agent runs in its own macOS VM—your host system stays protected
  • Network Control: Configure per-VM network access (internet, offline, or host-only)
  • Timeouts & Limits: Set task timeouts and iteration limits
  • Emergency Stop: Instantly halt any agent at any time

Agent Supervision

Session Trace View

  • Live Monitoring: Watch agents work in real time with live screenshots and activity streams
  • Reasoning Traces: View streamed reasoning tokens from models with extended thinking capabilities
  • Session Traces: Review detailed step-by-step traces with synchronized screenshots after completion
  • Video Export: Export session traces as video for documentation or review

Human-in-the-Loop

Human-in-the-Loop

  • Take Control: Pause any agent, use your mouse and keyboard directly, then resume
  • Answer Questions: Respond to agent questions via text or multiple choice when they need guidance
  • Add Instructions: Inject clarifying instructions mid-task without restarting
  • Approve Actions: Review and approve/deny tool permission requests

Plan Mode

  • Plan Before Executing: Toggle Plan mode to have agents create a detailed execution plan before starting work
  • Visual Plan Review: Review plans with interactive Mermaid diagrams and structured checklists
  • Edit & Approve: Modify the plan, add steps, or reject and regenerate before the agent begins execution
  • Streamed Planning: Watch the plan generate in real time with live reasoning updates

Scheduling

Scheduling View

  • One-Time & Recurring: Schedule tasks for a specific time or set up daily, weekly, or monthly recurrence
  • Automatic Notifications: Get notified when scheduled tasks start running
  • Manual Trigger: Instantly run any scheduled task on demand

Skills System

Skills Browser

  • Pre-Built Skills: Browse and apply skills for common tasks (web research, document processing, webapp testing)
  • Skill Discovery: Skills are automatically matched to tasks based on your description
  • Import Skills: Add skills from GitHub repositories or local directories
  • Extract Skills: Create new skills from successful task completions

MCP Servers

  • Model Context Protocol: Connect agents to external tools and services via the MCP standard
  • Multiple Transports: Configure servers using Standard I/O (local processes) or HTTP (remote servers)
  • Custom Configuration: Set commands, arguments, working directories, and environment variables per server
  • Enable/Disable: Toggle individual MCP servers on or off without removing their configuration

Image Generation

  • Built-in Tool: Agents can generate images on demand during task execution
  • Multiple Providers: Supports OpenRouter and Gemini image generation APIs
  • Reference Images: Generate variations or edits using reference images as input

Credentials & Security

  • Secure Storage: Store login credentials safely in Keychain
  • On-Demand Access: Credentials are passed to agents only when needed via secure tokens
  • CSV Import: Bulk import credentials from CSV files

Web Interface

Web UI on Mobile

  • Remote Access: Built-in web UI lets you manage agents from any browser. Pair with a service like Tailscale to securely access Hivecrew running via your phone from anywhere in the world
  • Full Task Management: Create, plan, monitor, pause, resume, cancel, rerun, and delete tasks directly from the browser

API & Automation

  • REST API: Control Hivecrew programmatically—create tasks, manage schedules, upload files, and download results
  • Python SDK: Use the hivecrew package for easy integration with Python workflows

Requirements

  • macOS Sequoia (15.0) or later
  • Apple Silicon Mac (M1 or newer)
  • At least 16GB RAM recommended for running concurrent agents
  • ~64GB free disk space per VM

Installation

  1. Download the latest release from the releases page
  2. Double-click the downloaded Hivecrew.dmg file to mount the disk image
  3. Drag the Hivecrew app icon to your Applications folder
  4. Run the Hivecrew app

Build from Source

  1. Clone the repository:
git clone https://github.com/johnbean393/Hivecrew.git
cd Hivecrew
  1. Open the workspace:
open Hivecrew.xcworkspace
  1. Download and place the cloudflared binary:
# Download the latest macOS ARM64 release
curl -L -o cloudflared.tgz https://github.com/cloudflare/cloudflared/releases/latest/download/cloudflared-darwin-arm64.tgz
tar xzf cloudflared.tgz
mv cloudflared Hivecrew/Hivecrew/Resources/cloudflared/
chmod +x Hivecrew/Hivecrew/Resources/cloudflared/cloudflared
rm cloudflared.tgz
  1. Build and run from Xcode (requires signing with appropriate entitlements)

Note: The Virtualization framework requires specific entitlements that must be granted via provisioning profiles or notarization. The cloudflared binary is not included in the repository due to its size — see Hivecrew/Hivecrew/Resources/cloudflared/README.md for details.

API

Hivecrew includes a REST API for programmatic task control. Enable it in Settings → API and generate an API key.

Python SDK

pip install hivecrew

Example: Automated UI Testing

from hivecrew import HivecrewClient

client = HivecrewClient()  # Uses HIVECREW_API_KEY env var

result = client.tasks.run(
    description="""
    Test the login flow:
    1. Open Safari and go to https://staging.example.com
    2. Click "Sign In" and enter test@example.com / testpass123
    3. Verify the dashboard loads and shows "Welcome back"
    4. Take a screenshot and save it to the outbox
    """,
    provider_name="OpenRouter",
    model_id="anthropic/claude-sonnet-4.5",
    output_directory="./test-results",
    timeout=600.0
)

if result.was_successful:
    print(f"Test passed: {result.result_summary}")
else:
    print(f"Test failed: {result.result_summary}")

Example: Scheduled Task with File Attachments

from hivecrew import HivecrewClient
from datetime import datetime, timedelta

client = HivecrewClient()

# Schedule a weekly report task with input files attached
schedule = client.schedules.create(
    title="Weekly Sales Report",
    description="""
    Process the attached sales data files:
    1. Open the CSV files and analyze the data
    2. Create a summary report with key metrics
    3. Generate charts for revenue trends
    4. Save the report as PDF to the outbox
    """,
    provider_name="OpenRouter",
    model_id="anthropic/claude-sonnet-4.5",
    files=["./data/sales_q1.csv", "./data/sales_q2.csv"],
    recurrence={
        "type": "weekly",
        "days_of_week": [2],  # Monday (1=Sunday, 7=Saturday)
        "hour": 9,
        "minute": 0
    }
)

print(f"Scheduled task created: {schedule.id}")
print(f"Next run: {schedule.next_run_at}")

See the hivecrew-python repository for full documentation.

Contributing

Contributions are welcome! Areas where help would be particularly valuable:

  • Additional automation tools and capabilities
  • UI/UX improvements
  • Testing and reliability
  • Documentation

License

MIT License