EfficientDet Lite Object Detection with ONNX & TensorRT 🚀

📖 Project Overview

EfficientDet Lite Object Detection with ONNX & TensorRT is a high-performance project designed to implement EfficientDet Lite models (versions 0 to 4) for object detection. Utilizing ONNX for model inference and TensorRT for optimized engine building, this project enables efficient and rapid deployment of object detection models with support for FP32 and FP16 precision on NVIDIA GPUs.

✨ Features

Support for EfficientDet Lite Models: Implemented versions 0, 1, 2, 3, and 4.
ONNX Inference: Run inference directly using ONNX models.
TensorRT Engine Building: Optimize models with TensorRT for FP32 and FP16 precision.
Inference Scripts: Execute inference using both ONNX and TensorRT engines seamlessly.
Performance Benchmarking: Compare latency and speed across different models and backends.
(TO BE IMPLEMENTED) INT8 Quantization: INT8 Post-Training Quantization for faster inference.

🛠 Installation

Libraries and Tools

ONNX Runtime (tested with version 1.19.2)
TensorRT (tested with version 10.5.0)
PyCUDA (tested with version 2024.1.2)
cuda-python (tested with version 12.2.1 - should be the same as installed CUDA version)

Installation Steps

Clone the Repository

git clone https://github.com/namas191297/efficientdetlite.git
cd efficientdetlite

Set Up a Virtual Environment

conda create -n efficientdetlite python=3.9
conda activate efficientdetlite

Install Dependencies
```
pip install -r requirements.txt
```
Download EfficientDet Lite Models
- Follow the instructions in the Models section to obtain the required model files.

🚀 Usage

Building TensorRT Engines

FP32 Precision

python scripts/build_trt_engine.py --model path/to/model.onnx --precision FP32 --output path/to/engine_fp32.trt

FP16 Precision

python scripts/build_trt_engine.py --model path/to/model.onnx --precision FP16 --output path/to/engine_fp16.trt

Running Inference with TensorRT Engines

Using FP32 Engine

python scripts/infer_trt.py --engine path/to/engine_fp32.trt --image path/to/image.jpg

Using FP16 Engine

python scripts/infer_trt.py --engine path/to/engine_fp16.trt --image path/to/image.jpg

Example Usage

Building TRT Engine from ONNX models

# Build .engine TRT Engine for EfficientDetLit4 with FP32 precision.
python build_engine.py --model_type efficientdet_lite4

# Build .engine TRT Engine for EfficientDetLit4 with FP16 precision.
python build_engine.py --model_type efficientdet_lite4 --fp16

Single Image

# Inference with ONNX on a single image
python onnx_inference_image.py --model_type efficientdet_lite1 --image test.jpg --score_threshold 0.5 --top_k 5

# Inference with TRT Engine on a single image using FP32 precision.
python trt_inference_image.py --model_type efficientdet_lite1 --image test.jpg --score_threshold 0.5 --top_k 5

# Inference with TRT Engine on a single image using FP16 precision.
python trt_inference_image.py --model_type efficientdet_lite1 --image test.jpg --score_threshold 0.5 --top_k 5 --fp16

Webcam

# Inference with ONNX on your webcam
python onnx_inference_webcam.py --model_type efficientdet_lite1 --score_threshold 0.5 --top_k 5

# Inference with TRT Engine on your webcam using FP32 precision.
python trt_inference_webcam.py --model_type efficientdet_lite1 --score_threshold 0.5 --top_k 5

# Inference with TRT Engine on your webcam using FP32 precision.
python trt_inference_webcam.py --model_type efficientdet_lite1 --score_threshold 0.5 --top_k 5 --fp16

🧠 Models

Supported Models

EfficientDet Lite 0
EfficientDet Lite 1
EfficientDet Lite 2
EfficientDet Lite 3
EfficientDet Lite 4

Model Details

Model Files: All models are included in this repo but you can still download the pre-trained EfficientDet Lite models from EfficientDetLite Google Drive Repo.
Place all .engine files under trt_models/.
Place all the .onnx files under onnx_models/.

⚡ Performance Comparison

Latency and Speed Metrics

The following table compares the latency (ms) of each EfficientDet Lite model across different backends when running on an NVIDIA RTX 3060.

Model	ONNX	TensorRT FP32	TensorRT FP16
Lite0	27	27	19
Lite1	39	33	23
Lite2	54	42	27
Lite3	78	54	33
Lite4	145	82	46

Hardware Specifications

GPU: NVIDIA RTX 3060
CUDA Version: 12.2
TensorRT Version: 10.5.0

📈 Results

Detection Examples

Inference using EfficientDetLite 4

Benchmark Results

The project demonstrates significant improvements in inference speed when utilizing TensorRT, especially with FP16 precision. TensorRT FP16 offers up to 300% speedup compared to ONNX for larger models, enabling real-time object detection applications.

📁 Repository Structure

root/
├── onnx_models/                 # ONNX model files    
├── trt_models/                  # TensorRT engine files                  
├── build_engine.py              # Script to build a TRT engine from ONNX models.
├── trt_engine_builder.py        # TRTEngineBuilder class implementation.
├── trt_executor.py              # TRTExecutor class implementation for inference.
├── trt_config.py                # Contains LABELS for classes and helper dictionary to build and run models. 
├── onnx_inference_image.py      # Script to run ONNX inference on a single image.
├── onnx_inference_webcam.py     # Script to run ONNX inference on webcam. 
├── trt_inference_image.py       # Script to run TRT inference on a single image.
├── trt_inference_webcam.py      # # Script to run ONNX inference on webcam.
├── requirements.txt             # Python dependencies
├── README.md                    # This file
└── LICENSE                      # License information

Scripts Overview

build_engine.py: Builds TensorRT engines from ONNX models with specified precision.
onnx_inference_image.py: Runs inference using ONNX model on a single image.
onnx_inference_webcam.py: Runs inference using ONNX model on webcam.
trt_inference_image.py: Runs inference using TRT engines on a single image.
trt_inference_webcam.py: Runs inference using TRT engines on webcam.

📜 License

This project is licensed under the Creative Commons Attribution 3.0.

📫 Contact

Email: namas.brd@gmail.com
LinkedIn: Namas Bhandari

Feel free to reach out for any questions, suggestions, or collaborations!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

EfficientDet Lite Object Detection with ONNX & TensorRT 🚀

Table of Contents

📖 Project Overview

✨ Features

🛠 Installation

Libraries and Tools

Installation Steps

🚀 Usage

Building TensorRT Engines

Running Inference with TensorRT Engines

Example Usage

🧠 Models

Supported Models

Model Details

⚡ Performance Comparison

Latency and Speed Metrics

Hardware Specifications

📈 Results

Detection Examples

Benchmark Results

📁 Repository Structure

Scripts Overview

📜 License

📫 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 8 Commits
onnx_models		onnx_models
trt_models		trt_models
LICENSE		LICENSE
README.md		README.md
build_engine.py		build_engine.py
efficientdetlite_demo.gif		efficientdetlite_demo.gif
onnx_inference_image.py		onnx_inference_image.py
onnx_inference_webcam.py		onnx_inference_webcam.py
requirements.txt		requirements.txt
test.jpg		test.jpg
trt_config.py		trt_config.py
trt_engine_builder.py		trt_engine_builder.py
trt_executor.py		trt_executor.py
trt_inference_image.py		trt_inference_image.py
trt_inference_webcam.py		trt_inference_webcam.py
trt_output.jpg		trt_output.jpg

Folders and files

Latest commit

History

Repository files navigation

EfficientDet Lite Object Detection with ONNX & TensorRT 🚀

Table of Contents

📖 Project Overview

✨ Features

🛠 Installation

Libraries and Tools

Installation Steps

🚀 Usage

Building TensorRT Engines

Running Inference with TensorRT Engines

Example Usage

🧠 Models

Supported Models

Model Details

⚡ Performance Comparison

Latency and Speed Metrics

Hardware Specifications

📈 Results

Detection Examples

Benchmark Results

📁 Repository Structure

Scripts Overview

📜 License

📫 Contact

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages