multimodal AI application with TTS, STT, depth and object detection for helping the physically challenged
-
Updated
Aug 26, 2025 - Python
multimodal AI application with TTS, STT, depth and object detection for helping the physically challenged
End-to-end Computer Vision projects covering detection, tracking, segmentation, generation, depth estimation, and 3D face morphing — built with YOLO, MediaPipe, SegFormer, and Diffusion Models.
Add a description, image, and links to the depth-detection topic page so that developers can more easily learn about it.
To associate your repository with the depth-detection topic, visit your repo's landing page and select "manage topics."