Welcome to PDF-RAG-Application! This project provides a simple and efficient framework for Retrieval-Augmented Generation (RAG) using PDF documents. Harness the power of Large Language Models with your own PDF knowledge baseβsearch, query, and analyze PDF content interactively.
PDF-RAG-Application enables users to upload and query PDF files using state-of-the-art embedding models (via SentenceTransformers) and fast similarity search with FAISS. The application features an intuitive Gradio interface for seamless user interaction.
- PDF Upload & Parsing: Easily upload and process PDF documents.
- Semantic Search: Powerful vector search using FAISS and sentence-transformer embeddings.
- Interactive Interface: User-friendly Gradio UI for uploading PDFs and querying contents.
- Asynchronous Processing: Efficient handling of large documents.
- Secure & Lightweight: Minimal dependencies, runs locally.
-
Clone the Repository
git clone https://github.com/your-username/PDF-RAG-Application.git cd PDF-RAG-Application -
Create a Virtual Environment (Optional but Recommended)
python3 -m venv venv source venv/bin/activate # On Windows: venv\Scripts\activate
-
Install Dependencies
pip install -r requirements.txt
Note: Main dependencies are
gradio,sentence-transformers,faiss-cpu,PyMuPDF, andnumpy.
-
Start the Application
python app.py
-
Interact via Web Browser
- Open the Gradio link displayed in your terminal.
- Upload your PDF files.
- Ask questions or search for content within your PDFs.
Contributions are welcome! To get started:
- Fork the repository.
- Create your feature branch:
git checkout -b feature/YourFeature
- Commit your changes:
git commit -m "Add YourFeature" - Push to the branch:
git push origin feature/YourFeature
- Open a Pull Request.
This project is licensed under the MIT License.
Enjoy using PDF-RAG-Application! If you find this project useful, please β star the repo and share your feedback.
For questions or suggestions, feel free to open an issue or contact the maintainer.
This project is licensed under the MIT License.
π GitHub Repo: https://github.com/selvaganesh19/PDF-RAG-Application