llm-assistant

Here is 1 public repository matching this topic...

DSinghania13 / ModelShrink

An AI-powered MLOps assistant for effortless model compression. Upload PyTorch models to chat with a local LLM expert, receive hardware-aware optimization advice, and perform one-click FP16/INT8 quantization to reduce model size and latency.

deep-learning pytorch gpu-acceleration quantization fp16 int8 model-compression flask-app mlops inference-optimization edge-ai huggingface model-optimization ai-copilot open-source-ai devloper-tools llm-assistant

Updated Sep 11, 2025
HTML

Improve this page

Add a description, image, and links to the llm-assistant topic page so that developers can more easily learn about it.

Curate this topic

Add this topic to your repo

To associate your repository with the llm-assistant topic, visit your repo's landing page and select "manage topics."

Learn more

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

llm-assistant

Here is 1 public repository matching this topic...

DSinghania13 / ModelShrink

Improve this page

Add this topic to your repo