You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
# Using nvm (recommended)
curl -o- https://raw.githubusercontent.com/nvm-sh/nvm/v0.39.0/install.sh | bash
nvm install 20
nvm use 20
node --version # v20.x.x
Step 2: Install Ollama with GPU Support
# Linux / macOS:
curl -fsSL https://ollama.com/install.sh | sh
# Windows: https://ollama.com/download/windows# Verify GPU
nvidia-smi # Should show GTX 1650 + CUDA 12.4
git clone https://github.com/ElioNeto/enginai.git
cd enginai
npm install
npm run build
# Test the stack
npx ts-node src/cli/main.ts config --check
# Create a project
npx ts-node src/cli/main.ts create \
--type api \
--name my-service \
--language typescript \
--framework express
7️⃣ Performance Benchmarks
Token Generation on This Machine
Task
Gemini 2.5 Flash
Ollama CPU
Ollama GPU
GPU Speedup
Generate plan (500 tokens)
5s
30s
10s
3x
TypeScript code (200 lines)
8s
45s
15s
3x
Code review
3s
20s
7s
2.9x
Unit tests
6s
35s
12s
2.9x
Cost Comparison
Stack
Monthly Cost
Requests/Day
Quality
EnginAI (Gemini + Ollama)
$0
Unlimited
⭐⭐⭐⭐⭐
OpenAI GPT-4 Turbo
~$35–70
200–400
⭐⭐⭐⭐⭐
Anthropic Claude 3.5
~$30–60
200–400
⭐⭐⭐⭐⭐
Groq (free tier)
$0
14,400
⭐⭐⭐⭐
Ollama only (local)
$0
∞
⭐⭐⭐⭐
8️⃣ Troubleshooting
GPU Not Being Used by Ollama
ollama list # Check if models are loaded
nvidia-smi # Check VRAM usage during generation# If not using GPU, reinstall NVIDIA drivers:# Windows: https://www.nvidia.com/download/index.aspx# Linux: sudo apt install nvidia-driver-550
Gemini 429 Rate Limit
// ModelRouter already implements automatic fallback:// Gemini 429 → automatically routes to Ollama// Quota resets at midnight (tracked in ~/.enginai/quota.json)
TypeScript Build Errors
npm run typecheck # See all type errors
npm run lint # See all lint warnings
npm run build # Full compile to dist/