Skip to content

aayushsoam/clawbot-plus

Folders and files

NameName
Last commit message
Last commit date

Latest commit

ย 

History

5 Commits
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 
ย 

Repository files navigation

๐Ÿค– ClawBot Plus โ€” AI-Powered Computer Control Agent

Your PC. Your AI. Full Control.

GitHub release MIT License Stars PyPI

ClawBot Plus is the ultimate personal AI assistant and open-source autonomous computer control agent for your Windows PC. Whether you need a powerful OpenClaw alternative, a browser automation tool using Playwright, or a Voice AI agent that talks back to you via Edge TTS, ClawBot Plus does it all. It can autonomously open apps, write code, run Python scripts, generate PowerPoint presentations, scrape the web, control system settings, and manage a multi-agent swarm โ€” all from a single CLI or a remote Telegram bot.

If you are looking for an AI that feels local, fast, persistent, and acts as your robotic pair programmer and RPA (Robotic Process Automation) engine โ€” this is it.

Install (recommended)

Runtime: Python 3.11+

pip install clawbot-plus
clawbot

That's it. ClawBot will auto-detect your system, scan your apps, and set up its workspace on first run.

Quick Start

# Start ClawBot
clawbot

# Choose your AI provider (Gemini, OpenAI, Groq, Ollama, GitHub Models, NVIDIA NIM, etc.)
# Choose your model
# Start giving commands!

โฏ open chrome and go to youtube
โฏ make a portfolio website
โฏ what's the weather today
โฏ /voice   # Start voice mode
โฏ switch   # Change AI model
โฏ quit     # Exit

From Source (development)

git clone https://github.com/aayushsoam/clawbot-plus.git
cd clawbot-plus
pip install -e .
clawbot

Supported AI Providers

Provider Models Auth
๐Ÿ”ท Google Gemini gemini-3.1-pro, gemini-2.5-flash, gemini-2.0-flash, etc. GEMINI_API_KEY
๐Ÿ”ต OpenAI gpt-4o, gpt-4.1-mini, o3-mini, o4-mini OPENAI_API_KEY
๐ŸŸ  Anthropic claude-3-7-sonnet, claude-3-5-sonnet, claude-3-opus ANTHROPIC_API_KEY
โšก Groq llama-3.3-70b-versatile, gemma2-9b-it GROQ_API_KEY
๐Ÿ‹ DeepSeek deepseek-chat, deepseek-reasoner DEEPSEEK_API_KEY
๐ŸŸฉ NVIDIA NIM nemotron-120b, llama-3.3-70b, deepseek-r1, phi-4 NVIDIA_API_KEY
๐ŸŒ™ Kimi (Moonshot) moonshot-v1-8k/32k/128k MOONSHOT_API_KEY
๐Ÿ™ GitHub Models gpt-4o, gpt-4.1, Codestral-2501, Llama-3.3-70B GITHUB_TOKEN
๐Ÿฆ™ Ollama (Local) qwen3-vl, llama3.3, deepseek-r1, mistral, phi3 Free (local)
โ˜๏ธ Ollama (Cloud) Same as local, custom host OLLAMA_BASE_URL

Highlights

  • ๐Ÿ–ฅ๏ธ Full Computer Control โ€” open apps, click, type, scroll, screenshot, drag & drop. ClawBot sees your screen and acts.
  • ๐Ÿ’ป Senior Developer Coding โ€” writes production-quality code with separate HTML/CSS/JS files, verifies via shell, and shows output.
  • ๐Ÿ”Š Voice Mode โ€” /voice to talk to ClawBot using your microphone. It speaks back using Edge TTS.
  • ๐Ÿ“ฑ Telegram Bot โ€” connect your Telegram bot and control your PC from your phone.
  • ๐Ÿง  Persistent Memory โ€” remembers your name, preferences, past sessions, and daily logs across restarts.
  • ๐Ÿ›ก๏ธ Anti-Loop Engine โ€” detects when the AI is stuck and forces a different approach or graceful exit.
  • ๐Ÿ” Web Search โ€” search the web and fetch pages without opening a browser.
  • ๐Ÿ“Š PPT Generation โ€” generate_ppt action creates professional PowerPoint presentations.
  • ๐ŸŽฌ Video Generation โ€” local Python or cloud API video creation via Koda-Stack.
  • ๐Ÿค– Multi-Agent Swarm โ€” spawn worker agents for complex projects (ClawTeam).
  • ๐Ÿ”’ Security Layer โ€” dangerous commands are blocked. All shell commands are audited.
  • ๐Ÿ“ฆ Skills System โ€” extensible SKILL.md-based plugin architecture for CCTV, CapCut, LaTeX, Office docs, and more.

How It Works

                            โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
                            โ”‚  You (CLI / Telegram / Voice)โ”‚
                            โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ
                                           โ”‚ (Commands, Voice, Images)
                                           โ–ผ
         โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
         โ”‚ ๐Ÿค– ClawBot Plus (Agentic Core)                                    โ”‚
         โ”‚   โ€ข Brain (LLMs)      โ€ข RAG Memory       โ€ข Anti-Loop Engine       โ”‚
         โ”‚   โ€ข Skill Plugins     โ€ข Error Healing    โ€ข Multi-Agent Team       โ”‚
         โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ฌโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ
                                         โ”‚
             โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ผโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
             โ”‚                           โ”‚                           โ”‚
             โ–ผ                           โ–ผ                           โ–ผ
 โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ   โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ   โ•ญโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฎ
 โ”‚ ๐Ÿ–ฅ๏ธ System Control     โ”‚   โ”‚ ๐Ÿ’ป Code Engine        โ”‚   โ”‚ ๐ŸŒ Browser Automation โ”‚
 โ”‚ --------------------- โ”‚   โ”‚ --------------------- โ”‚   โ”‚ --------------------- โ”‚
 โ”‚ โ€ข OS APIs / PyAutoGUI โ”‚   โ”‚ โ€ข read/write_file     โ”‚   โ”‚ โ€ข browser-use Agent   โ”‚
 โ”‚ โ€ข UI Automation       โ”‚   โ”‚ โ€ข Shell (Subprocess)  โ”‚   โ”‚ โ€ข Playwright / DOM    โ”‚
 โ”‚ โ€ข Window Management   โ”‚   โ”‚ โ€ข Syntax Validation   โ”‚   โ”‚ โ€ข Scrape & Extract    โ”‚
 โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ   โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ   โ•ฐโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ•ฏ

Key Subsystems

  • Computer Agent (clawbot/computer/agent.py) โ€” the brain. Receives tasks, plans actions, executes them, and learns.
  • CLI (clawbot/cli.py) โ€” the face. Interactive prompt with model selection, mode switching, and live task execution.
  • Telegram Bot (clawbot/computer/telegram_bot.py) โ€” remote control from your phone.
  • Voice Engine (clawbot/computer/voice.py) โ€” speech-to-text + text-to-speech loop.
  • Skills (skills/) โ€” modular capabilities loaded on-demand based on task keywords.
  • Memory (~/.clawbot/) โ€” sessions, daily logs, preferences, and persona files.

Workspace & Persona

  • Workspace root: ~/.clawbot/workspace/
  • Persona files: IDENTITY.md, SOUL.md, USER.md, AGENTS.md, BOOTSTRAP.md
  • Skills: skills/<skill_name>/SKILL.md
  • Sessions: ~/.clawbot/sessions/ (one .md per task)
  • Daily memory: ~/.clawbot/memory/ (one .md per day)

Configuration

Minimal ~/.clawbot/clawbot.json:

{
  "default_provider": "๐Ÿ”ท Google Gemini (Free)",
  "default_model": "gemini-2.5-flash"
}

CLI Commands

Command Action
clawbot Start the interactive CLI
clawbot --version Show version
clawbot devices Manage Claw3D device connections
clawbot tunnel Setup remote access tunnel
/voice Start voice control mode
/video Switch to video generation mode
/code Route tasks to OpenCode
/normal Back to standard agent mode
switch Change AI provider/model
quit Exit ClawBot

Chat Commands (Telegram)

Send these from Telegram when connected:

  • Any text โ†’ executes as a task on your PC
  • Images โ†’ analyzed by vision models
  • Voice messages โ†’ transcribed and executed

Tech Stack

  • Python 3.11+ โ€” core runtime
  • browser-use โ€” LLM classes and browser automation
  • PyAutoGUI โ€” screen control, clicks, typing
  • pywin32 โ€” Windows native API (UIAutomation)
  • Rich โ€” beautiful terminal UI
  • InquirerPy โ€” interactive prompts
  • edge-tts โ€” text-to-speech
  • SpeechRecognition โ€” voice input
  • python-telegram-bot โ€” Telegram integration
  • Pillow โ€” image processing for vision
  • httpx โ€” async HTTP client

Project Structure

clawbot/
  __init__.py
  cli.py              # Main CLI entry point
  computer/
    agent.py           # Core AI agent loop
    telegram_bot.py    # Telegram integration
    voice.py           # Voice mode
    actions.py         # System actions
    system_control.py  # OS-level controls
    window_manager.py  # Window focus/snap
    security.py        # Command auditing
    gateway.py         # Claw3D WebSocket gateway
    memory_search.py   # RAG-based memory search
    self_healing.py    # Auto-fix failed code
    workflows.py       # Macro recording/replay
    orchestrator.py    # Multi-agent coordination
  clawteam/            # Multi-agent swarm system
skills/                # Extensible skill plugins
workspace/             # Persona & identity files
agents/                # AI reviewer/planner agents

Star History

Star History Chart

Built By

ClawBot Plus was built by Aayush Soam with significant contributions from Claude (Anthropic) โ€” pair-programmed from architecture to deployment.

  • ๐Ÿ‘ค Aayush Soam โ€” Creator, Vision & Product
  • ๐Ÿค– Claude (Anthropic) โ€” Co-Developer, Architecture & Agent Engineering

License

MIT

About

Autonomous AI agent for browser & computer automation with multi-agent system, RAG memory, and voice control.

Topics

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors