Saksham Dubey saksham1211

Saksham Dubey

Software engineer with five years building AI/ML systems in production. I work across the full stack - LLMs, classical ML, deep learning, and the infrastructure underneath.

What I work on

Generative AI & LLMs RAG pipelines, agentic systems, multi-agent routing, prompt engineering. Production work with GPT, Claude, and open-source foundation models.

Classical ML & Deep Learning Classification, clustering (HDBSCAN, LDA), topic modeling, BERT and transformer fine-tuning, XGBoost. Applied to real problems with messy data.

ML Infrastructure Model serving, GPU inference, vector databases, Kubernetes autoscaling, Redis caching, latency optimization.

Backend & Distributed Systems FastAPI, Kafka, PostgreSQL, event-driven microservices. Backend design as part of the AI system, not separate from it.

System design patterns I've worked with

Agentic routing          Intent classification → dynamic agent dispatch
                         Fallback chains, confidence thresholds, retry logic

RAG systems              Chunking strategies, dense + sparse retrieval
                         Reranking, context compression, hallucination mitigation

Document intelligence    Schema extraction, field validation, structured output
                         Azure AI Search, vector + keyword hybrid retrieval

Log & event analytics    Temporal windowing, density-based clustering (HDBSCAN)
                         Noise filtering, pattern surfacing at scale

ML serving               GPU inference on Kubernetes, horizontal autoscaling
                         Redis-backed caching, p99 latency budgeting

NLP pipelines            BERT fine-tuning, topic modeling (LDA), similarity scoring
                         Batch inference, compliance classification

Stack

Languages    Python · Java · SQL

AI / ML      LangChain · HuggingFace · PyTorch · TensorFlow
             FAISS · XGBoost · Transformers · OpenAI · Anthropic

Cloud        Azure (AKS, AI Search, Cosmos DB)
             AWS (SageMaker, EKS, Lambda, S3)

Infra        Kubernetes · Docker · Redis · Kafka

Backend      FastAPI · Flask · PostgreSQL · Node.js · Microservices

Projects

Project	What it is
PromptIQ	Prompt evaluation engine - TF-IDF task classification, model-fit scoring, no LLM in the eval loop
self-healing-rag-pipeline	Minimal RAG pipeline built to understand the retrieval layer properly, not abstract it
agent-routing-benchmark	Intent-based multi-agent router with a benchmark comparing routing strategies
llm-eval-kit	Lightweight harness scoring LLM outputs on faithfulness, relevance, and coherence

Currently

Building open-source LLM eval and prompt quality tooling. Working through system design and DSA alongside a full-time job.

Open to conversations about AI/ML engineering roles.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Saksham Dubey saksham1211

Achievements

Achievements

Highlights

Block or report saksham1211