diff --git a/assets/group_F.png b/assets/group_F.png new file mode 100644 index 0000000..df71681 Binary files /dev/null and b/assets/group_F.png differ diff --git a/index.html b/index.html index 7ceb546..8e3895c 100644 --- a/index.html +++ b/index.html @@ -304,7 +304,32 @@

Open-Vocabulary Object Tracking with Grounding DINO, SAM 2 and CLIP

- +
+ +
+

Monocular depth estimation, hand tracking, augmented reality, human-computer interaction

+

Air Instrument: Depth-Aware Virtual Music Placement

+

+ Air Instrument explores how a normal webcam can turn a room into an interactive musical stage. The system first + estimates scene depth using Depth Anything V2, detects candidate floor or surface regions, and lets users place + virtual instruments into available 3D space through hand gestures. Once instruments are placed, a playing mode uses + MediaPipe hand tracking to control expressive parameters such as pitch and volume without touching any physical + device. +

+ The project combines monocular depth estimation, spatial reasoning, gesture recognition, and augmented reality + rendering into a live demo. Our goal is to study how depth-aware scene understanding can support natural interaction: + where can an object be placed, how large should it appear, and how can the user control it through movement? +

+ +
+