|
I build backend systems that handle real load β not toy demos. The work I'm proudest of usually looks boring from the outside: a sharded write path that doesn't fall over at 1B rows, a workflow engine that uses Kahn's algorithm because cycles are bugs not features, a 300M parameter SLM model I trained from scratch on my laptop because I wanted to actually understand transformers β not import them. I don't care about being right. I care about systems that stay up at 3am. |
role: platform engineer
company: Steps AI
shipping: agent infra
training: 300M param SLM
mood: write more, talk less |
βΈ βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βΈ βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
parameters: 300M
architecture: decoder-only Β· custom hybrid
positional: RoPE
attention: multi-head causal Β· KV-cache
ffn: SwiGLU
norm: Pre-RMSNorm
tokenizer: custom BPE (from scratch)
training: bf16 Β· MPS Β· Apple M5
|
status: in production
stack: go Β· postgres Β· clickhouse
role: led product team end-to-end
adopted by: The Chatterjee Group
(45-country conglomerate)
Cricut (US consumer tech)
|
βΈ βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
βΈ βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
|
FLAGSHIP Quper used by TCG + Cricut |
COMPETITIVE ICPC '23 regionalist Β· rank 73 |
ALGORITHMS LeetCode top 7% globally |
SCALE 10K+ users asksenior backend |
HACKATHON Hack-O-Octo winner Β· blockchain |
βΈ βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β
|
Refactored core modules to NestJS, built repo analytics tooling for the API testing platform. |
100+ tests added, full PostgreSQL upgrade, comprehensive API documentation rewrite. |
βΈ βββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββββ β




