Computer-Vision-2026 · nihermann · May 21, 2026 · May 20, 2026
diff --git a/index.html b/index.html
@@ -273,6 +273,78 @@ <h3>Smart Event Detection for Highlight Clips</h3>
               </label>
             </div>
           </article>
+
+
+
+
+
+           <article class="project-card">
+            <div class="teaser" role="img" aria-label="Open-vocabulary tracking project.">
+              <img src="assets/group_X.png" alt="Two segmented puppies in a park" style="position:absolute; inset:0; width:100%; height:100%; object-fit:cover; z-index:2;">
+              <span class="teaser-label" style="z-index:3;">Group X</span>
+            </div>
+            <div class="project-content">
+              <p class="project-meta">Object detection, segmentation, tracking, vision-language models</p>
+              <h3>Open-Vocabulary Object Tracking with Grounding DINO, SAM 2 and CLIP</h3>
+              <p class="project-abstract">
+                We present an open-vocabulary object tracking system that enables users to search, segment, and track arbitrary objects in images and videos using natural language queries.
+                <br><br>
+                Our pipeline combines Grounding DINO for text-conditioned object detection, CLIP for semantic verification, and SAM 2 for segmentation and temporal tracking.
+                 <br><br>
+                The system supports interactive querying through a Gradio web interface and demonstrates how modern vision foundation models can be integrated into a unified visual understanding pipeline.
+                </p>
+              <label class="project-toggle-label">
+                <input class="project-toggle" type="checkbox" aria-label="Toggle full project pitch">
+                <span class="project-toggle-more">Read more</span>
+                <span class="project-toggle-less">Show less</span>
+              </label>
+            </div>
+          </article>
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
+
 
           <article class="project-card">
             <div class="teaser" role="img" aria-label="Image retrieval with CLIP.">