Skip to content
View DhruvGarg111's full-sized avatar

Highlights

  • Pro

Block or report DhruvGarg111

Block user

Prevent this user from interacting with your repositories and sending you notifications. Learn more about blocking users.

You must be logged in to block users.

Maximum 250 characters. Please don’t include any personal information such as legal names or email addresses. Markdown is supported. This note will only be visible to you.
Report abuse

Contact GitHub support about this user’s behavior. Learn more about reporting abuse.

Report abuse
DhruvGarg111/README.md

Dhruv Garg

AI / ML Engineer • Computer Vision • Generative AI

Building practical AI systems, one focused iteration at a time.

Roles

Computer Vision Deep Learning Backend Systems

Dhruv Garg tech ecosystem

Building intelligent systems that see, understand, and create.


🔬 Engineering Profile

I am a Machine Learning Engineer focused on Computer Vision and Agentic AI, with a strong foundation in scalable backend systems. My engineering philosophy revolves around translating complex research papers into optimized, production-ready code.

  • 🎯 Focus: Bypassing computational bottlenecks in high-resolution (4K) object detection using Explainable AI (XAI).
  • 🤖 AI Engineering: Building local LLM agents that seamlessly interact with third-party ecosystems (Google APIs, etc.).
  • ⚙️ Infrastructure: Architecting robust database migrations and building backend profilers.
  • 💡 Goal: I build systems that are not just intelligent, but fast, scalable, and resilient.

🚀 Featured Projects

🌟 Flagship Projects

Vision Intelligence Infrastructure: A high-performance, async control panel for human-in-the-loop AI annotation.

A sleek, dark-themed control panel designed for decoupled ML microservices and robust task queues, eliminating UX bottlenecks with pure speed and instantaneous rendering.

Key Innovations:

  • 🚀 Asynchronous ML: Non-blocking AI auto-labeling via PyTorch, LayerCAM & YOLO.
  • Zero-Latency UI: Hardware-accelerated React-Konva staging canvas.
  • 🔄 Decoupled Workers: Infinite horizontal scaling using Celery message brokers.
  • 🔒 Isolated Workspaces: Robust Role-Based Access Control (RBAC) circuits.

"Finding the needle in the haystack, from 400ft above."

A novel coarse-to-fine computer vision pipeline designed for efficient small object detection in high-resolution (2K/4K) aerial imagery. Tackles the critical trade-off between resolution and latency in drone forensics.

Key Innovations:

  • Uses LayerCAM to identify semantic "hotspots" before processing.
  • Intelligently slices and zooms into regions of interest—skipping 80%+ of empty backgrounds.
  • Outperforms blind sliding-window approaches (SAHI) in both speed and accuracy.

Transform any image into a masterpiece — in real-time.

A fast neural style transfer implementation that generates stylized images using a feed-forward CNN trained with perceptual loss. Performs instant stylization in a single forward pass.

Key Features:

  • 🚀 Real-time inference with a custom residual architecture.
  • 🧠 Perceptual content & style loss using a pretrained VGG-16 network.
  • 🔁 Instance Normalization integrated for high-quality, artifact-free outputs.
  • 📦 ONNX export supported, ready for edge deployment.

📦 More Projects

🧭 pygog (Google CLI Agent)
A powerful CLI for Google services (Gmail, Drive, Calendar). Features a built-in natural language AI agent supporting Gemini, DeepSeek, & OpenAI.
<Python> <Google APIs> <LLM Agents>

📐 Depth Estimation + Semantic Seg.
Multi-modal depth completion using RGB + sparse depth + semantic maps. Features a DepthNet-style encoder-decoder trained on NYU Depth v2 with multi-scale supervision.
<PyTorch> <NYU-Depth-v2> <Encoder-Decoder>


🛠️ Stack Matrix

stack-icons

vision modeling serving interface

🌐 Open Source Contributions

I actively contribute to the broader developer ecosystem, focusing heavily on AI tooling, backend infrastructure, and application security:

  • 🤖 SynapseKit/SynapseKit: Led additions of multiple LLM providers and vector retrieval backends to broaden model support and storage options. Also built Discord automations for onboarding, moderation, and docs workflows to improve community operations.
  • 🧠 pydantic/pydantic-ai: Upgraded the Anthropic code‑execution tool integration to align with newer API versions. This keeps model tooling stable as providers evolve.
  • 🧪 matorral-project/matorral: Wrote admin‑focused backend tests, including coverage for make_superuser. That hardens privileged flows and reduces regressions in management commands.
  • 🔒 trusera/ai-bom: Contributed to AI SBOM generation that audits project workflows. Outputs CycloneDX‑compliant dependency reports for stronger supply‑chain security.
  • Nikolaev3Artem/fastapi-silk: Implemented SQLite + Alembic migrations to establish durable database setup. Expanded profiler test coverage to improve reliability and maintenance.

📊 Telemetry

github-stats streak
streak most-commit-language
activity-graph

🔗 Connect & Explore

Website  •  Searchlight Live App  •  Email Me


Built by DhruvGarg111

Pinned Loading

  1. Neural-Style-Transfer Neural-Style-Transfer Public

    Python

  2. The-Searchlight-Protocol The-Searchlight-Protocol Public

    Python