📚 LLM Flashcard Generator

Generate high-quality question-answer flashcards from your study materials using the power of Large Language Models (LLMs) and Structured JSON Outputs.

🌟 Overview

LLM Flashcard Generator is a production-grade Streamlit web app that transforms any educational content—be it textbook excerpts, lecture notes, or even a single word—into effective, editable flashcards. Powered by Google Gemini 2.0 Flash and Pydantic for rigorous data validation, it guarantees perfectly formatted flashcards every time.

Designed for students, educators, and lifelong learners who want to supercharge their revision and teaching.

💡 Motivation

Traditional flashcard creation is time-consuming and repetitive. With the rise of LLMs, we can automate this process. This application utilizes modern LLM techniques like Structured Outputs to ensure high-quality, diverse, and context-aware Q&A pairs that are always parsable and formatted perfectly.

🚀 Features

Flexible Input: Upload .pdf, .docx, .txt files or paste any text directly.
Subject Selection: Guide flashcard style by subject (Biology, History, etc.).
Structured LLM Outputs: Uses the official google-genai SDK and Pydantic schema validation to guarantee 100% accurate JSON responses. No brittle string parsing.
Robust: Works even with a single word as input, generating creative context.
Review & Edit: Interactive, dynamic data table for editing and refining flashcards.
Export: Download as CSV or JSON for seamless import into Anki, Quizlet, etc.
Containerized: Fully Dockerized for instant, consistent deployment anywhere.

🛠️ Tech Stack & Architecture

Frontend/UI: Streamlit
Backend/LLM: Google GenAI SDK
Data Validation: Pydantic
Document Parsing: PyPDF2, python-docx
Deployment: Docker

📁 File Structure

Phoenix/
├── Dockerfile                  # Containerization config
├── requirements.txt            # Python dependencies
├── .env                        # Environment variables (your API key)
├── README.md                   # This file
├── src/
│   ├── app.py                  # Main Streamlit UI entrypoint
│   ├── models/
│   │   └── flashcard.py        # Pydantic schemas for data validation
│   ├── services/
│   │   └── llm_service.py      # Google GenAI API integration
│   └── utils/
│       └── document_parser.py  # PDF, DOCX, TXT extraction logic
└── tests/                      # Unit testing directory

⚡ Quick Start (Local Setup)

Clone the repository:

git clone https://github.com/techwallahexplorer/Phoenix.git
cd Phoenix

Install dependencies:
```
pip install -r requirements.txt
```
Set up your API key: Create a file named .env in the project root:
```
GEMINI_API_KEY=YOUR_GEMINI_API_KEY
```
(Get your Gemini API key from Google AI Studio)
Run the app locally:
```
streamlit run src/app.py
```
The app will open in your browser at http://localhost:8501

🐳 Quick Start (Docker)

For the most reliable experience without managing Python environments, run the app using Docker.

Build the image:
```
docker build -t llm-flashcards .
```

Run the container:

docker run -p 8501:8501 -e GEMINI_API_KEY=your_api_key_here llm-flashcards

Access the app at http://localhost:8501.

🌐 Deployment to Streamlit Cloud

Fork/Clone this repository to your GitHub account.
Go to streamlit.io/cloud and connect your GitHub repo.
Set the Main file path to src/app.py.
In the Streamlit Cloud advanced settings, add your GEMINI_API_KEY to the Secrets manager.
Click Deploy.

📝 Sample Output

(Note: The UI includes dynamic data tables, expandable preview cards, and one-click JSON/CSV exports.)

❓ FAQ

Q: What file types are supported?
A: You can upload .pdf, .docx, .txt files or paste any raw text.

Q: Why use Pydantic and Structured Outputs?
A: Earlier versions relied on regex and string matching to parse LLM text (e.g., looking for "Q:" and "A:"). By using Pydantic with the Google GenAI SDK's response_schema, the LLM is forced to return valid JSON. This eliminates parsing errors and hallucinations.

Q: Is my data private?
A: If running locally via Docker or Python, all processing stays on your machine (sent directly to the Google API).

🤝 Contributing

For major changes, please open an issue first to discuss what you would like to change. Pull requests implementing direct Anki (.apkg) export or RAG for massive textbooks are welcome!

📫 Contact

Thank you for using LLM Flashcard Generator! If you have any questions or need support, please open an issue or email urjagjeetsingh@gmail.com.

Name		Name	Last commit message	Last commit date
Latest commit History 11 Commits
.github/workflows		.github/workflows
__pycache__		__pycache__
src		src
tests		tests
.env		.env
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
image.png		image.png
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

📚 LLM Flashcard Generator

🌟 Overview

💡 Motivation

🚀 Features

🛠️ Tech Stack & Architecture

📁 File Structure

⚡ Quick Start (Local Setup)

🐳 Quick Start (Docker)

🌐 Deployment to Streamlit Cloud

📝 Sample Output

❓ FAQ

🤝 Contributing

📫 Contact

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

📚 LLM Flashcard Generator

🌟 Overview

💡 Motivation

🚀 Features

🛠️ Tech Stack & Architecture

📁 File Structure

⚡ Quick Start (Local Setup)

🐳 Quick Start (Docker)

🌐 Deployment to Streamlit Cloud

📝 Sample Output

❓ FAQ

🤝 Contributing

📫 Contact

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages