Youtube Rag Pipeline

A sophisticated multimodal Retrieval-Augmented Generation (RAG) system that enables conversational interaction with YouTube video content through advanced AI analysis.

Features

Multimodal Analysis: Processes both visual frames and textual captions from YouTube videos
Precise Timestamps: Generates accurate time references for all answers with ±20-second context windows
Visual Evidence: Displays relevant video frames that support generated responses
Conversational Interface: Natural language querying powered by Google's Gemini AI
Efficient Indexing: Leverages Qdrant vector database for high-performance content retrieval
Responsive UI: Clean, modern Streamlit interface with real-time progress indicators

Usage

Create a virtual environment
Git clone the repo
Install Requirements file pip install -r requirements.txt
Configure your Gemini API key in config/config.yaml
streamlit run app.py
Clear DB Index on UI [side bar button] before uploading url.

Name		Name	Last commit message	Last commit date
Latest commit History 4 Commits
.gitignore		.gitignore
.python-version		.python-version
README.md		README.md
app.py		app.py
config.yaml.example		config.yaml.example
inference.py		inference.py
pyproject.toml		pyproject.toml
requirements.txt		requirements.txt
retriever.py		retriever.py
utils.py		utils.py
video_indexer.py		video_indexer.py
video_processor.py		video_processor.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Youtube Rag Pipeline

Features

Usage

About

Uh oh!

Releases

Packages

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

Youtube Rag Pipeline

Features

Usage

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Uh oh!

Contributors

Uh oh!

Languages

Packages