Visual perception MCP server — screenshots, UI inspection, element detection.
Gives AI agents visual perception of running desktop applications. Takes screenshots, inspects UI trees, finds elements by text or role, and reports window geometry. Works across Electron, Tauri, Flutter, Qt, and SwiftUI apps.
npx orellius-eyeOr globally:
npm install -g orellius-eyeAdd to your MCP config:
{
"mcpServers": {
"eye": {
"command": "npx",
"args": ["orellius-eye"]
}
}
}- 6 tools: screenshot, detect_app, inspect_ui, find_element, get_window_info, list_windows
- CDP (Chrome DevTools Protocol) + accessibility API support
- Cross-platform: macOS, Linux, Windows
- Works with Electron, Tauri, Flutter, Qt, SwiftUI apps
- Image optimization via sharp
- Element detection by text, role, or CSS selector
TypeScript, MCP SDK, chrome-remote-interface, sharp
MIT — Orellius Labs