Skip to content

weijt606/ai-agent-map

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

26 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

AI Agent Map

ZH EN License Agent

Pixel-art style AI Agent Map banner showing four regions — daily coding agents, general autonomous agents, frameworks and platforms, and runtimes and tools — with agent icons placed on an illustrated treasure-map landscape

AI Agent Map is a practical, visual-first guide for comparing mainstream AI agents, agent platforms, runtimes, and orchestration tools.

The goal is simple: help readers get to a sensible shortlist faster.

What This Repo Is For

  • The agent landscape is crowded.
  • Many resources explain ideas, but not fit, anti-fit, or operating cost.
  • People usually need a comparison layer, not another pile of links.

This repo stays focused on selection: what a system is good at, where it breaks down, and what kind of operator cost comes with it.

Where To Start

If your question is... Start here
I need a shortlist first Open agents
I need help choosing for coding automation Read coding guide
I already have candidates and want a side-by-side view View mainstream matrix
I care about dimensions like approval, memory, scheduling, and deployment Browse capabilities

Recent Heat Ranking

Popularity is not fit.

This table tracks projects that showed up as especially hot in the latest weekly GitHub snapshot. The rank follows the 7-day gain. The total star counts below were checked when this repo was updated.

Last updated: 2026-05-23 · Snapshot window: 2026-05-16 → 2026-05-22 (7-day gain, approximate) · Star counts: checked at update time

Project names link to the upstream GitHub repo. When this map has a written profile, it is linked separately in the "Map status" column.

Rank Project Current stars Snapshot gain Map status How to read it
#1 mattpocock/skills 100.9k +7,600 Watchlist (Skills Wave) Matt Pocock's curated .claude/skills directory crossed 100k this week — still the largest week-over-week gainer among already-tracked entries
#2 (new) anthropics/skills 139.3k ~+7,000 Watchlist (Skills Wave canonical) Anthropic's own reference .claude/skills repo — the upstream source of the wave; tracked as content asset, not agent surface
#3 openhuman 25.7k +6,300 In scope · profile (new) Second straight week of strong growth — open-source desktop life-integration agent (Rust + Tauri, GPL-3.0); positioning now reads clearly enough to write up
#4 (new) colbymchenry/codegraph 16.4k ~+6,000 In scope · profile (new) Pre-indexed code knowledge graph + MCP server targeting Claude Code, Cursor, Codex CLI, opencode, Hermes Agent — first agent-context-infrastructure entry
#5 Hermes Agent 163.1k +5,800 In scope · profile Fifth consecutive week of growth — still the in-scope absolute leader (now past 163k)
#6 (new) HKUDS/CLI-Anything 39.5k ~+5,500 In scope · profile (new) HKUDS lab's "make all software agent-native" framework — auto-generates Click-based CLIs from arbitrary source code so agents can drive non-API apps
#7 Superpowers 202.8k +5,100 In scope · profile Crossed 200k — the agentic skills framework keeps growing alongside the broader wave
#8 (new) K-Dense-AI/scientific-agent-skills 25.2k ~+4,500 Watchlist (Skills Wave) Ready-to-use Agent Skills for research / science / engineering / finance / writing — another curated collection in the wave
#9 (new) Imbad0202/academic-research-skills 19.0k ~+4,500 Watchlist (Skills Wave) Curated academic research skills for Claude Code — research → write → review → revise → finalize pipeline
#10 (new) humanlayer/12-factor-agents 21.7k ~+3,500 Out of scope "12-factor" principles doc for production LLM-powered software — methodology asset, not an agent surface, but a meaningful market signal
  • Heat is useful for discovery, not for selection by itself.
  • This week added three new in-scope profiles — each one opens a route the map had not yet covered: openhuman (life-integration desktop agent), codegraph (agent context infrastructure), CLI-Anything (agent-native software bridge).
  • The .claude/skills wave that started last week has intensified. Six of the top ten are skills collections, frameworks, or methodology docs (mattpocock/skills, anthropics/skills, Superpowers, scientific-agent-skills, academic-research-skills, 12-factor-agents). Policy unchanged: curated collections and principles docs are tracked as Skills Wave entries, not profiled — the framework end is already covered through Superpowers.
  • humanlayer/12-factor-agents is intentionally out of scope: principles and methodology docs are valuable signals but this map only profiles runnable agent surfaces and runtime infrastructure.
  • agentmemory 16.4k (+2.8k), DeepSeek-TUI 33.5k (+1.2k), Ruflo 54.2k (+1.2k), Pi 52.9k (+1.4k), addyosmani/agent-skills 44.9k (+1.3k), TradingAgents 78.6k (+1.4k), Codex CLI 84.7k (+1.0k), financial-services 26.7k (+1.0k) all continued to grow at modest pace, dropped out of (or never made) the top 10 as the new wave absorbed most of the trending oxygen.

Market Event: The .claude/skills Wave (May 2026, expanding)

The wave that broke into trending in mid-May has intensified. Across the May 12 → 22 snapshots, six of the top ten trending repositories are now skills frameworks, curated skills collections, or methodology docs built on Anthropic's skill pattern:

Repo Stars Shape
anthropics/skills 139.3k Anthropic's own canonical Agent Skills reference repository — the upstream source of the pattern
Superpowers 202.8k A complete skills framework + methodology, with plugin integrations into Claude Code, Codex, Cursor, GitHub Copilot, Gemini, OpenCode, and Factory Droid
mattpocock/skills 100.9k A curated personal .claude/skills directory from Matt Pocock — now past 100k stars
addyosmani/agent-skills 44.9k A production-grade engineering skills set for AI coding agents from Addy Osmani
K-Dense-AI/scientific-agent-skills 25.2k Ready-to-use Agent Skills for research / science / engineering / analysis / finance / writing
Imbad0202/academic-research-skills 19.0k Curated academic research pipeline (research → write → review → revise → finalize) for Claude Code

What it means for selection:

  • The .claude/skills directory pattern Anthropic introduced has crossed from curiosity into shared infrastructure — engineers are now publishing their personal skill libraries the way they used to publish dotfiles, and Anthropic's own reference repo is the canonical anchor.
  • For people choosing a coding agent, the underlying agent matters less than it did six months ago — the skill layer on top is doing more of the work, and the breadth of available skill collections is now domain-spanning (engineering, science, academic research, finance, productivity).
  • This map treats the "agentic skills framework" as a route in its own right via the Superpowers profile. Curated skill collections (anthropics/skills, mattpocock/skills, addyosmani/agent-skills, scientific-agent-skills, academic-research-skills) are tracked as Skills Wave watchlist entries rather than profiled, because they are content assets rather than agent surfaces.

Market Event: GPT-5.5 Enters The Agent Landscape

On April 23 2026, OpenAI released GPT-5.5 — positioned as "a new class of intelligence for real work and powering agents." This is not a GitHub-trending project but a model release that reshapes the capability ceiling across multiple agent surfaces already tracked in this repo.

What changed Impact on this map
82.7% on Terminal-Bench 2.0 Highest agentic coding benchmark at launch — raises the bar for Codex and all OpenAI-API-based agents
1M token context window Long-context tasks that were impractical before become viable for API-based agent builders
2x price vs GPT-5.4 Cost-sensitive teams must re-benchmark per-task economics
SWE-Bench Pro at 58.6% Still trails Claude Opus 4.7 (64.3%) — model choice depends on workload

GPT-5.5 does not replace Codex as a product entry. It is the model layer underneath. See the GPT-5.5 profile for the full breakdown.

Codex Product Update (April 16 2026)

A week before the GPT-5.5 release, OpenAI shipped "Codex for (almost) everything," a major capability expansion on the Codex product surface itself.

What changed Why it matters
Background Computer Use Codex can see, click, and type with its own cursor across any macOS app — even ones without an API
Parallel multi-agent execution Multiple Codex agents can run on the same Mac in parallel without interfering with foreground work
90+ new plugins Atlassian Rovo, CircleCI, CodeRabbit, GitLab Issues, Microsoft Suite, and more
In-app browser + proactive suggestions Direct iteration on frontend designs and proactive proposals from project context and memory
3M weekly active developers Reported in April 2026, nearly 2x early-March 2026

Combined with the GPT-5.5 backbone, this is the most consequential agent product update in the snapshot window.

Agent Harness Frameworks At A Glance

A "harness" is the minimal scaffolding around an LLM that turns it into an agent — the loop, the tool surface, the permission model, the skills hook. These are projects you can fork, audit, and own end-to-end, rather than vendor products you adopt as-is.

Project Stars License Sweet spot Footprint
Pi 52.9k MIT (TS) Terminal-first coding harness with broad LLM provider coverage Small core + opt-in skills/extensions
OpenHands 74.5k Open source Full open-source SWE agent (CLI + GUI + cloud option) Heaviest — closer to a product
SWE-agent 19.3k MIT (Py) Research reference behind SWE-bench, single-YAML config Medium; upstream moving focus to mini-swe-agent
mini-swe-agent 4.5k MIT (Py) ~100-line successor; SWE-bench Verified >74% Tiny — readable in one sitting
OpenHarness 13.0k MIT (Py) 10-subsystem open harness with anthropics/skills + MCP + 43 tools Medium; production-shaped, sibling to CLI-Anything

How to read this row:

  • Pick by footprint, not by stars. The right harness is the one whose surface area you are willing to maintain.
  • If you want the smallest credible base to fork: mini-swe-agent.
  • If you want a production-shaped open runtime to self-host: OpenHarness.
  • If you want to publish SWE-bench numbers: SWE-agent is the canonical reference; mini-swe-agent is the working successor.
  • If you want a terminal-first day-to-day coding harness: Pi.
  • If you want a more complete SWE agent product that is still open source: OpenHands.

The First Cut Of The Map

Route Representative projects Typical user
Direct execution Claude Code, Aider, Codex, DeepSeek-TUI, Devin, Jules Someone who wants to hand a concrete coding task to an agent
Agent harness framework Pi, OpenHands, SWE-agent, mini-swe-agent, OpenHarness Someone who wants to own the agent loop, tool surface, and permissions instead of inheriting a vendor's product
Frontier agentic model GPT-5.5 Someone choosing which model to wire into their own agent system or evaluating the capability ceiling of OpenAI-based surfaces
Agentic skills framework Superpowers Someone who wants a methodology + composable skills layer that plugs into Claude Code, Codex, Cursor, and similar agents
Workflow / orchestration layer oh-my-claudecode, oh-my-codex, Ruflo Someone who already likes Claude Code or Codex and wants stronger orchestration on top (Ruflo extends this to multi-machine federation and 100+ specialized agents)
Editor-centric AI workflow Cursor, Windsurf, Continue Someone who wants the editor itself to stay central
Review-first automation Cline, GitHub Copilot, Froge Code Someone who wants review and human control to stay central
Managed background path Claude Managed Agents Someone who needs scheduled, cloud, or detached Anthropic workflows
General-purpose autonomous agent AutoGPT, Agent Zero, BabyAGI, Julep, GenericAgent, ml-intern Someone who wants autonomous, general-purpose task execution (or, in ml-intern's case, autonomous ML engineering)
Build-your-own system LangChain, LangGraph, CrewAI, LlamaIndex, Haystack, Semantic Kernel, DSPy, Pydantic AI Teams building their own agent platform instead of buying one
Runtime and tools n8n, MemGPT, Open Interpreter, LiteLLM, Flowise, CodeGraph, CLI-Anything Teams that need workflow automation, code execution, LLM gateways, agent context infrastructure, agent-driven CLIs, or visual builders
Self-hosted / local runtime AI Edge Gallery, Goose, Hermes Agent, OpenClaw, Mercury Agent, OpenHuman Users who need on-device privacy, long-running agents, local control, channels, devices, or personal-data life integration

Repo Structure

Directory Purpose
Open agents One page per agent with positioning, boundary, and trade-off.
Open use cases Problem-first selection guides.
Open comparisons Side-by-side comparisons for real decisions.
Open capabilities Shared vocabulary for capability differences.

Current Mainstream Coverage

Project Route One-line positioning
Aider Direct execution Terminal-first AI pair programmer close to git
Claude Code Direct execution Local and IDE-first coding agent
Claude Managed Agents Managed background path Anthropic managed / cloud execution mapping
Codex Direct execution Async coding delegation in isolated cloud environments
oh-my-claudecode Workflow layer Teams-first orchestration layer on top of Claude Code
oh-my-codex Workflow layer Stronger workflow, teams, and persistent state around Codex CLI
Cursor Editor-centric platform AI editor spanning local coding, cloud agents, and integrations
GitHub Copilot Platform Multi-surface agent platform across VS Code and GitHub
Cline Review-first execution Approval-first editor-native coding agent
Windsurf AI-native IDE Cascade-centered AI IDE
OpenHands Open-source execution Open-source software engineering agent
Devin Managed execution End-to-end managed software engineering execution
Jules Managed cloud execution GitHub-connected coding delegation with PR handoff
AI Edge Gallery On-device local runtime Mobile-first local assistant sandbox with agent skills
Goose Open-source local platform Extensible local agent across desktop, CLI, and API
Hermes Agent Multi-agent / self-hosted Long-lived self-hosted environment with memory and skills
OpenClaw Runtime Local-first multi-channel runtime layer
LangChain Platform High-level framework for building custom agents quickly
LangGraph Platform Low-level framework for durable stateful workflows
Continue Editor-centric Open-source IDE extension with full model freedom
GPT-5.5 Frontier agentic model OpenAI's agentic model powering Codex, ChatGPT, and API agent builders
AutoGPT Autonomous agent platform Visual agent builder with workflows, marketplace, and multi-model support
CrewAI Multi-agent framework Role-based agent collaboration with fast prototyping
LlamaIndex Data-first framework RAG and agentic applications over documents and data
n8n Workflow automation Visual workflow platform with native AI agent nodes and 400+ integrations
MemGPT Stateful agent platform Persistent memory agents that learn across sessions (now Letta)
Agent Zero Autonomous agent Self-building autonomous agent with dynamic tool creation
BabyAGI Experimental Pioneering autonomous agent experiment — educational, not production
Julep Workflow engine Temporal-backed durable workflow engine for stateful AI agents
Haystack Framework Production-oriented RAG and agent framework by deepset
Semantic Kernel Framework Microsoft's AI orchestration SDK for .NET, Python, and Java
DSPy Framework Programmatic prompt optimization — programming, not prompting, LMs
Open Interpreter Runtime Natural language to local code execution, no sandbox
LiteLLM Infrastructure Unified API gateway for 100+ LLM providers
Pydantic AI Framework Type-safe Python agent framework with structured outputs
Flowise Visual builder Drag-and-drop LLM app and agent builder on top of LangChain
Froge Code Review-first automation Provisionally mapped to Automagik Genie
Mercury Agent Self-hosted multi-channel Permission-hardened agent for CLI and Telegram with token budgets
Pi Direct execution Minimal terminal coding-agent harness with multi-provider LLM support
ml-intern Domain-specific autonomous agent Hugging Face's autonomous ML engineer — research, code, and ship ML using HF tooling
GenericAgent Self-evolving autonomous agent Small-seed agent that grows a personal skill tree on every task
Superpowers Agentic skills framework Methodology and composable skills layer that plugs into Claude Code, Codex, Cursor, and other agents
DeepSeek-TUI Direct execution DeepSeek-native terminal coding agent
Ruflo Workflow / orchestration layer Multi-agent orchestration platform for Claude with federation across machines, neural memory, and 100+ specialized agents
OpenHuman Self-hosted / local runtime Desktop life-integration agent with 118+ connectors, local Memory Tree, and Ollama support
CodeGraph Runtime and tools Pre-indexed code knowledge graph + MCP server for Claude Code, Cursor, Codex CLI, opencode, and Hermes Agent
CLI-Anything Runtime and tools Auto-generates Click-based CLIs for arbitrary software so agents can drive non-API apps
SWE-agent Agent harness framework Princeton + Stanford's original SWE-bench harness with single-YAML configuration
mini-swe-agent Agent harness framework The ~100-line Python successor to SWE-agent that still scores >74% on SWE-bench Verified
OpenHarness Agent harness framework HKUDS's 10-subsystem open agent harness with 43+ tools, anthropics/skills, and MCP

Example Reading Paths

If you are still deciding where to begin, use one of these quick routes and then branch out.

If you sound like this... Follow this path What it helps you answer
I want a day-to-day coding agent and need to choose terminal vs editor AiderClaude CodeCursorClinecoding automation guide Terminal-first local loop vs editor-led flow vs approval-first control
I already like Claude Code or Codex but want stronger orchestration Claude Codeoh-my-claudecodeCodexoh-my-codexmainstream matrix When the base agent is enough and when a workflow layer actually adds value
I want to understand GPT-5.5's impact on the agent landscape GPT-5.5CodexClaude Codemainstream matrix How a frontier model release shifts the capability ceiling and what it means for product choice
I want a dedicated AI IDE instead of stitching tools together CursorWindsurfGitHub Copilotmainstream matrix Dedicated AI editor vs ecosystem platform
I want to hand off tickets and check back later CodexJulesDevinClaude Managed Agentsmainstream matrix Async cloud delegation vs managed background automation
I need something open-source or self-hosted AiderOpenHandsGooseHermes Agentcapabilities Terminal control, open-source execution, and local runtime ownership
I am building an internal agent stack, not buying a product LangChainLangGraphcapabilitiesmainstream matrix Framework vs runtime vs product boundaries

Disclaimer

Star counts and 7-day gains are point-in-time GitHub snapshots taken when the repo is updated; numbers shift quickly between weekly refreshes and small rounding differences are expected. Project descriptions, vendors, and capability summaries reflect public information at the time of writing and may change as projects evolve, get acquired, or pivot. This map is selection guidance — not endorsement, financial advice, or a production-readiness guarantee. Verify against each project's own docs before committing to a choice.

About

A practical, visual-first guide to comparing AI agents, platforms, runtimes, and orchestration tools so you can shortlist the right options faster.

Topics

Resources

License

Contributing

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors