Skip to content
This repository was archived by the owner on Sep 5, 2025. It is now read-only.

Shadraze/V-RAI

Repository files navigation

V-RAI (VR AI Assistant)

V-RAI is my 7th semester internship project. The project takes input from a VR headset's microphone, synthesizes it with OpenAI Whisper, generates text based on user input using GPT3.5 through the OpenAI API, and finally vocodes it using Tactotron2. This constitutes the backend of the project.

The frontend of the project is in Unity, more specifically you want to use Unity 2022 to compatibility. The Meta Quest 2 is also a requirement as the project was made with that headset in mind. A 3D model of an android is displayed as a 'virtual assistant' to the user.

Through specific prompt manipulation of prompts during TextToText generation (V-RAI-Backend/AppFiles/AI/Generation/inital.py), the AI is able to respond in JSON to trigger some basic tasks in Unity through function calls specified in the JSON.

The backend of the project can still run without a frontend through Websockets.

Note: The OpenAI API key in V-RAI-Backend/AppFiles/AI/Generation/TextToText.py is invalid. In order for the project to work, an OpenAI API key is needed! OpenAI provides free trial of their API to new accounts, you may use that for testing purposes. Or may replace the model with a local LLM.

About

VR AI Assistant, uses OpenAI's pretrained Whisper, GPT3.5 Public API, Speechbrain's pretrained Tacotron2 and Hifigan. Consists of a frontend and backend with the frontend on VR and made with Unity Engine.

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

 
 
 

Contributors

Languages