Senior software engineer, 12 years production experience. Currently focused on multi-agent systems and LLM inference optimization.
- mlx-od-moe — On-Demand Mixture of Experts for Apple Silicon. Run 375GB models in 192GB RAM via memory-mapped expert loading.
- AgentOS — Multi-agent orchestration layer for Windsurf IDE over Chrome DevTools Protocol.
- cascade-multiagent — Programmatic multi-agent control for Windsurf Cascade via CDP.
- live-translation-local — Real-time multi-modal conversation system. Whisper, NLLB-200, pyannote.audio, Even Realities G2 glasses.
- localllm-hub — Local LLM routing and serving layer.
Inference optimization · Agent orchestration · Local LLMs · Real-time pipelines · Apple Silicon · Multi-agent systems
Based in Dallas, TX.


