Skip to content
Merged
Show file tree
Hide file tree
Changes from all commits
Commits
File filter

Filter by extension

Filter by extension

Conversations
Failed to load comments.
Loading
Jump to
Jump to file
Failed to load files.
Loading
Diff view
Diff view
Binary file added assets/group_F.png
Loading
Sorry, something went wrong. Reload?
Sorry, we cannot display this file.
Sorry, this file is invalid so it cannot be displayed.
27 changes: 26 additions & 1 deletion index.html
Original file line number Diff line number Diff line change
Expand Up @@ -304,7 +304,32 @@ <h3>Open-Vocabulary Object Tracking with Grounding DINO, SAM 2 and CLIP</h3>




<article class="project-card">
<div class="teaser" role="img" aria-label="Depth-aware augmented reality music interface showing virtual instruments placed on detected floor regions.">
<img src="assets/group_F.png" alt="Air Instrument depth-aware AR music interface preview" style="position:absolute; inset:0; width:100%; height:100%; object-fit:cover; z-index:2;">
<span class="teaser-label" style="z-index:3;">Group F</span>
</div>
<div class="project-content">
<p class="project-meta">Monocular depth estimation, hand tracking, augmented reality, human-computer interaction</p>
<h3>Air Instrument: Depth-Aware Virtual Music Placement</h3>
<p class="project-abstract">
Air Instrument explores how a normal webcam can turn a room into an interactive musical stage. The system first
estimates scene depth using Depth Anything V2, detects candidate floor or surface regions, and lets users place
virtual instruments into available 3D space through hand gestures. Once instruments are placed, a playing mode uses
MediaPipe hand tracking to control expressive parameters such as pitch and volume without touching any physical
device.
<br><br>
The project combines monocular depth estimation, spatial reasoning, gesture recognition, and augmented reality
rendering into a live demo. Our goal is to study how depth-aware scene understanding can support natural interaction:
where can an object be placed, how large should it appear, and how can the user control it through movement?
</p>
<label class="project-toggle-label">
<input class="project-toggle" type="checkbox" aria-label="Toggle full project pitch">
<span class="project-toggle-more">Read more</span>
<span class="project-toggle-less">Show less</span>
</label>
</div>
</article>



Expand Down