Air Instrument: Depth-Aware Virtual Music Placement
+
+ Air Instrument explores how a normal webcam can turn a room into an interactive musical stage. The system first
+ estimates scene depth using Depth Anything V2, detects candidate floor or surface regions, and lets users place
+ virtual instruments into available 3D space through hand gestures. Once instruments are placed, a playing mode uses
+ MediaPipe hand tracking to control expressive parameters such as pitch and volume without touching any physical
+ device.
+
+ The project combines monocular depth estimation, spatial reasoning, gesture recognition, and augmented reality
+ rendering into a live demo. Our goal is to study how depth-aware scene understanding can support natural interaction:
+ where can an object be placed, how large should it appear, and how can the user control it through movement?
+
+