diff --git a/assets/group_Q.png b/assets/group_Q.png new file mode 100644 index 0000000..cf45be5 Binary files /dev/null and b/assets/group_Q.png differ diff --git a/index.html b/index.html index 0574491..69d291b 100644 --- a/index.html +++ b/index.html @@ -273,6 +273,27 @@

Smart Event Detection for Highlight Clips

+ +
+ +
+

CLIP, FAISS, patch-level matching, image retrieval

+

Image retrieval with ClIP

+

+ Got a clue? ImageDetective can find the picture. Describe what you’re looking for, upload an image, or search by visual details. ImageDetective connects text and images through foundation models, combining global semantic search with patch-level matching for smarter and more explainable retrieval. +

+ We will propose a vision language retrieval system to implement bidirectional search between images and natural language. With CLIP, we will map both modalities into a share embedding space, a image could be retrieved with FAISS indexing. Perhaps the accurary should be improved, further approach is using a SAM3 for text guided semantic segmentation and enable patch level matching. +

+ +
+