3-Tier hybrid AI router that orchestrates FunctionGemma-270M on-device and Gemini 2.5 Flash Lite in the cloud for 99% function-calling accuracy at 548ms avg latency. Built at the Cactus × Google DeepMind Hackathon.
hackathon gemini edge-ai on-device-ai google-deepmind hybrid-ai function-calling llm-routing ai-tinkerers functiongemma cactus-compute localhost-router
-
Updated
Feb 22, 2026 - Python