Vision Guide

The Vision Guide is a revolutionary project aimed at providing assistance through voice commands, with a primary focus on aiding users in obtaining information and recognizing objects through images.

Features

Voice-Based Question Answering

Users can initiate conversations by speaking commands such as "Go to GPT".
Voice input is converted to text using a speech-to-text module.
The text is passed to Gemini, our question-answering system.
Gemini processes the query and generates a response.
The response is converted back to speech using a text-to-speech module and communicated to the user.

Image Recognition

Users can activate image recognition by speaking commands like "Scan photo".
The camera module opens to capture an image when prompted by the user.
The captured image is analyzed by Gemini for recognition.
Gemini analyzes the images and provides descriptive narratives of their content.
The description is converted to speech and communicated to the user.

WorkFlow

Technologies Used

Speech-to-text module for converting voice input to text.
Text-to-speech module for converting responses to speech.
Gemini for question answering and image recognition.
Camera module for capturing images.

Usage

Clone the repository.
Install the necessary dependencies.
Run the main application.
Use voice commands to interact with the system.

Name		Name	Last commit message	Last commit date
Latest commit History 18 Commits
static		static
templates		templates
Procfile		Procfile
README.md		README.md
app.py		app.py
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Vision Guide

Features

Voice-Based Question Answering

Image Recognition

WorkFlow

Technologies Used

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Vision Guide

Features

Voice-Based Question Answering

Image Recognition

WorkFlow

Technologies Used

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages