TLDR Live Typing Highlighter

This project trains a machine learning model to provide real-time feedback on text to help users write more concise, "TLDR-style" summaries. As a user types into a textbox, words or phrases that the model considers "fluff" or non-essential are highlighted in red.

How it Works

Data Preprocessing: The project uses the trl-lib/tldr dataset. A one-time preprocessing script (src/preprocess_data.py) runs first. It uses spaCy to analyze each post and its summary. It identifies key phrases (like noun chunks) in the post that are semantically similar to the summary. These are considered "essential". This process generates a new labeled dataset saved as a CSV.
Modeling: The problem is framed as a Token Classification task. We fine-tune a DistilBERT model on the preprocessed data to classify each word (token) from the input text as either ESSENTIAL or FLUFF.
Application: A FastAPI backend serves the fine-tuned model. A simple HTML/JS frontend sends the user's text to the backend as they type. The backend returns chunks of the original text labeled as FLUFF or ESSENTIAL, which the frontend then uses to apply highlighting.

How to Run with Docker Compose (GPU Recommended)

Prerequisites for GPU Support

This is CRITICAL. The following must be installed on your host machine (your computer, not in Docker):

An NVIDIA GPU.
The latest NVIDIA Drivers for your GPU and OS.
The NVIDIA Container Toolkit, which allows Docker to access your GPU. Follow the official installation guide for your Linux distribution.

Running the Application

Build and Start the Services: Open your terminal in the project root and run:
```
docker-compose up --build
```

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
src		src
static		static
templates		templates
.dockerignore		.dockerignore
.gitignore		.gitignore
Dockerfile		Dockerfile
README.md		README.md
docker-compose.yaml		docker-compose.yaml
preprocessing.ipynb		preprocessing.ipynb
requirements.txt		requirements.txt

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

TLDR Live Typing Highlighter

How it Works

How to Run with Docker Compose (GPU Recommended)

Prerequisites for GPU Support

Running the Application

About

Uh oh!

Releases

Packages

Uh oh!

Contributors

Uh oh!

Languages

Folders and files

Latest commit

History

Repository files navigation

TLDR Live Typing Highlighter

How it Works

How to Run with Docker Compose (GPU Recommended)

Prerequisites for GPU Support

Running the Application

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages