Skip to content
View saksham1211's full-sized avatar
🎯
Focusing
🎯
Focusing

Block or report saksham1211

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don't include any personal information such as legal names or email addresses. Markdown supported. This note will be visible to only you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
saksham1211/README.md

Saksham Dubey

Software engineer with five years building AI/ML systems in production. I work across the full stack - LLMs, classical ML, deep learning, and the infrastructure underneath.

LinkedIn LeetCode Email


What I work on

Generative AI & LLMs RAG pipelines, agentic systems, multi-agent routing, prompt engineering. Production work with GPT, Claude, and open-source foundation models.

Classical ML & Deep Learning Classification, clustering (HDBSCAN, LDA), topic modeling, BERT and transformer fine-tuning, XGBoost. Applied to real problems with messy data.

ML Infrastructure Model serving, GPU inference, vector databases, Kubernetes autoscaling, Redis caching, latency optimization.

Backend & Distributed Systems FastAPI, Kafka, PostgreSQL, event-driven microservices. Backend design as part of the AI system, not separate from it.


System design patterns I've worked with

Agentic routing          Intent classification → dynamic agent dispatch
                         Fallback chains, confidence thresholds, retry logic

RAG systems              Chunking strategies, dense + sparse retrieval
                         Reranking, context compression, hallucination mitigation

Document intelligence    Schema extraction, field validation, structured output
                         Azure AI Search, vector + keyword hybrid retrieval

Log & event analytics    Temporal windowing, density-based clustering (HDBSCAN)
                         Noise filtering, pattern surfacing at scale

ML serving               GPU inference on Kubernetes, horizontal autoscaling
                         Redis-backed caching, p99 latency budgeting

NLP pipelines            BERT fine-tuning, topic modeling (LDA), similarity scoring
                         Batch inference, compliance classification

Stack

Languages    Python · Java · SQL

AI / ML      LangChain · HuggingFace · PyTorch · TensorFlow
             FAISS · XGBoost · Transformers · OpenAI · Anthropic

Cloud        Azure (AKS, AI Search, Cosmos DB)
             AWS (SageMaker, EKS, Lambda, S3)

Infra        Kubernetes · Docker · Redis · Kafka

Backend      FastAPI · Flask · PostgreSQL · Node.js · Microservices

Projects

Project What it is
PromptIQ Prompt evaluation engine - TF-IDF task classification, model-fit scoring, no LLM in the eval loop
self-healing-rag-pipeline Minimal RAG pipeline built to understand the retrieval layer properly, not abstract it
agent-routing-benchmark Intent-based multi-agent router with a benchmark comparing routing strategies
llm-eval-kit Lightweight harness scoring LLM outputs on faithfulness, relevance, and coherence

Currently

Building open-source LLM eval and prompt quality tooling. Working through system design and DSA alongside a full-time job.


Open to conversations about AI/ML engineering roles.

Pinned Loading

  1. PromptIQ PromptIQ Public

  2. self-healing-rag-pipeline self-healing-rag-pipeline Public

    Python