PDF & Screenshot File Renamer

A powerful CLI tool that automatically renames PDF files and screenshots based on their content using Claude AI. For PDFs, it extracts bibliographic information (author, year, title). For screenshots, it uses either local OCR (Tesseract) or Claude Vision to analyze content and identify applications, dates, and content types. Perfect for organizing academic papers, books, screenshots, and other documents with consistent, meaningful filenames.

Latest Version: 0.2.1 - Now with dual OCR methods for screenshots!

What's New in v0.2.1

Claude Vision Support: Use Claude's vision capabilities directly on screenshots for perfect accuracy
Dual OCR Methods: Choose between Tesseract (free, local) or Claude Vision (accurate, API-based)
Flexible Configuration: Set OCR method via CLI flag or environment variable
No Tesseract Required: When using Claude Vision, no local OCR installation needed

Features

PDF Processing

🤖 AI-Powered Extraction: Uses Claude AI to intelligently extract bibliographic information from PDFs
📚 Smart Name Detection: Handles various naming conventions (e.g., "van Gogh", "O'Brien", "Smith Jr.")
📄 Page Limit Control: Analyzes only the first pages of PDFs for efficiency

Screenshot Processing (NEW!)

🖼️ Dual OCR Methods: Choose between Tesseract (local) or Claude Vision (API)
🔍 Intelligent Analysis: AI identifies applications, dates, content types, and main subjects
📸 Format Support: Handles PNG, JPG, JPEG, BMP, GIF, TIFF, WEBP
🏷️ Smart Categorization: Recognizes emails, chats, errors, websites, documents, etc.

General Features

🎯 Flexible Templates: Fully customizable filename templates for both PDFs and screenshots
📁 Batch Processing: Process entire directories with mixed file types
🔍 Preview Mode: Dry-run option to preview changes before applying them
🛡️ Safe Operation: Automatic handling of duplicate filenames
⚡ Fast Processing: Efficient handling of large directories

Installation

Prerequisites

Python 3.12 or higher
An Anthropic API key for Claude AI
Tesseract OCR (optional, for local screenshot OCR)
- Ubuntu/Debian: sudo apt install tesseract-ocr
- macOS: brew install tesseract
- Windows: Download from GitHub
- Note: Not needed if using Claude Vision for OCR

Install from source

Clone the repository:

git clone https://github.com/ArneJanning/pdf-file-renamer.git
cd pdf-file-renamer

Create a virtual environment (recommended):

python -m venv venv
source venv/bin/activate  # On Windows: venv\Scripts\activate

Install the package:

pip install -e .

Install using uv (recommended)

If you have uv installed:

uv pip install -e .

Quick Start

Set up your API key:

cp .env.example .env
# Edit .env and add your Anthropic API key

Test with dry-run:

pdf-renamer /path/to/directory --dry-run

Rename files:

pdf-renamer /path/to/directory

Configuration

Environment Variables

Create a .env file in your working directory (or copy .env.example):

# Claude API Configuration
ANTHROPIC_API_KEY=your-anthropic-api-key-here

# PDF file naming template
# Variables: {author}, {author_last}, {editor}, {editor_last}, 
# {author_or_editor}, {author_or_editor_last}, {year}, {title}, {subtitle}, {full_title}
# Use {full_title} for title with subtitle properly formatted (recommended)
PDF_FILENAME_TEMPLATE={author_or_editor_last} {year} - {full_title}.pdf

# Screenshot file naming template  
# Variables: {application}, {date}, {time}, {datetime}, {content_type}, {main_subject}
SCREENSHOT_FILENAME_TEMPLATE={datetime} {application} - {main_subject}.png

# Number of pages to extract for analysis (default: 10)
MAX_PAGES_TO_EXTRACT=10

# Claude model to use (default: claude-3-5-sonnet-20241022)
CLAUDE_MODEL=claude-3-5-sonnet-20241022

# OCR method for screenshots: 'tesseract' or 'claude' (default: tesseract)
OCR_METHOD=tesseract

Configuration Options

Option	Description	Default
`ANTHROPIC_API_KEY`	Your Anthropic API key (required)	None
`PDF_FILENAME_TEMPLATE`	Template for renamed PDF files	`{author_or_editor_last} {year} - {title}.pdf`
`SCREENSHOT_FILENAME_TEMPLATE`	Template for renamed screenshots	`{datetime} {application} - {main_subject}.png`
`MAX_PAGES_TO_EXTRACT`	Number of PDF pages to analyze	10
`CLAUDE_MODEL`	Claude model to use	`claude-3-5-sonnet-20241022`
`OCR_METHOD`	OCR method for screenshots: `tesseract` or `claude`	`tesseract`

Usage

Basic Command

pdf-renamer [OPTIONS] DIRECTORY

Options

Option	Short	Description
`--output`	`-o`	Output directory for renamed files (default: input directory)
`--dry-run`	`-n`	Preview changes without renaming files
`--pdf-template`		Override the PDF filename template from .env
`--screenshot-template`		Override the screenshot filename template from .env
`--ocr-method`		OCR method for screenshots: `tesseract` or `claude`
`--help`		Show help message

Command Examples

Preview changes (dry-run):

pdf-renamer ~/Documents/Papers --dry-run

Rename to a different directory:

pdf-renamer ~/Downloads/PDFs --output ~/Documents/Organized

Use custom templates:

# Custom PDF template
pdf-renamer ~/Papers --pdf-template "{author_last}, {author} ({year}) - {title}.pdf"

# Custom screenshot template
pdf-renamer ~/Screenshots --screenshot-template "{application} - {content_type} - {main_subject}.png"

Process current directory:

pdf-renamer .

PDF Templates

Available Variables

Variable	Description	Example
`{author}`	Full author name	"F. Scott Fitzgerald"
`{author_last}`	Author's last name only	"Fitzgerald"
`{editor}`	Full editor name	"John Smith"
`{editor_last}`	Editor's last name only	"Smith"
`{author_or_editor}`	Author or editor (with suffix)	"F. Scott Fitzgerald" or "John Smith (Ed.)"
`{author_or_editor_last}`	Last name of author or editor	"Fitzgerald" or "Smith"
`{year}`	Publication year	"1925"
`{title}`	Main title only	"Red Mafiya"
`{subtitle}`	Subtitle only (if present)	"How the Russian Mob Has Invaded America"
`{full_title}`	Title with subtitle properly formatted	"Red Mafiya. How the Russian Mob Has Invaded America"

Template Examples

Default format (clean and simple, with subtitle support):

{author_or_editor_last} {year} - {full_title}.pdf
→ Fitzgerald 1925 - The Great Gatsby.pdf
→ Friedman 2000 - Red Mafiya. How the Russian Mob Has Invaded America.pdf

Academic citation style:

{author_last}, {author} ({year}). {full_title}.pdf
→ Fitzgerald, F. Scott Fitzgerald (1925). The Great Gatsby.pdf
→ Friedman, Robert I. Friedman (2000). Red Mafiya. How the Russian Mob Has Invaded America.pdf

Library style:

[{year}] {author_or_editor} - {full_title}.pdf
→ [1925] F. Scott Fitzgerald - The Great Gatsby.pdf
→ [2000] Robert I. Friedman - Red Mafiya. How the Russian Mob Has Invaded America.pdf

Title and subtitle separate:

{author_last}-{year}-{title}_{subtitle}.pdf
→ Friedman-2000-Red Mafiya_How the Russian Mob Has Invaded America.pdf

Full information:

{author} ({year}) - {full_title} [{author_last}].pdf
→ F. Scott Fitzgerald (1925) - The Great Gatsby [Fitzgerald].pdf
→ Robert I. Friedman (2000) - Red Mafiya. How the Russian Mob Has Invaded America [Friedman].pdf

Screenshot Templates

Available Variables

Variable	Description	Example
`{application}`	Application/software name	"Chrome", "WhatsApp", "Terminal"
`{date}`	Date from screenshot	"2025-01-15"
`{time}`	Time from screenshot	"14:30"
`{datetime}`	Combined date and time	"2025-01-15 14:30"
`{content_type}`	Type of content	"email", "chat", "error", "website"
`{main_subject}`	AI-determined subject	"Project Meeting Schedule"

Template Examples

Default format (chronological organization):

{datetime} {application} - {main_subject}.png
→ 2025-01-15 14:30 Gmail - Project Meeting Schedule Email.png

Application-based organization:

{application}/{date} - {main_subject}.png
→ Gmail/2025-01-15 - Project Meeting Schedule Email.png

Content type grouping:

{content_type}/{application} - {main_subject}.png
→ email/Gmail - Project Meeting Schedule Email.png

Minimal format:

{date} - {main_subject}.png
→ 2025-01-15 - Project Meeting Schedule Email.png

Detailed format:

{date} {time} - {application} ({content_type}) - {main_subject}.png
→ 2025-01-15 14:30 - Gmail (email) - Project Meeting Schedule Email.png

Examples

Example 1: Organizing Research Papers

# Preview the renaming
pdf-renamer ~/Downloads/papers --dry-run

# Output:
# Processing: quantum_computing_2023.pdf
#   Author/Editor: Alice Johnson
#   Year: 2023
#   Title: Advances in Quantum Computing Algorithms
#   New filename: Johnson 2023 - Advances in Quantum Computing Algorithms.pdf
#   [DRY RUN] Would copy to: ~/Downloads/papers/Johnson 2023 - Advances in Quantum Computing Algorithms.pdf

# Apply the renaming
pdf-renamer ~/Downloads/papers

Example 2: Custom Organization System

# Organize by year with custom template
pdf-renamer ~/Library/PDFs \
  --template "[{year}] {author_last} - {title}.pdf" \
  --output ~/Library/Organized

Example 3: Processing Books

# Books often have editors instead of authors
pdf-renamer ~/Books --pdf-template "{author_or_editor} - {title} ({year}).pdf"

# Output example:
# Original: handbook_of_ai.pdf
# Renamed: Smith (Ed.) - Handbook of Artificial Intelligence (2022).pdf

Example 4: Processing Screenshots

# Process a directory of screenshots
pdf-renamer ~/Screenshots --dry-run

# Output:
# Processing: email_screenshot.png
#   Application: Gmail
#   Date: 2025-01-15
#   Content Type: email
#   Main Subject: Project Meeting Schedule Email
#   New filename: 2025-01-15 1430 Gmail - Project Meeting Schedule Email.png
#   [DRY RUN] Would copy to: ~/Screenshots/2025-01-15 1430 Gmail - Project Meeting Schedule Email.png

Example 5: Mixed Directory (PDFs and Screenshots)

# Process a Downloads folder with both PDFs and screenshots
pdf-renamer ~/Downloads

# Output:
# Found 150 PDF files and 45 screenshot files to process
# Processing PDF files...
# Processing: research_paper.pdf
#   New filename: Johnson 2024 - Machine Learning in Healthcare.pdf
# Processing screenshot files...
# Processing: screenshot_2024.png
#   New filename: 2024-01-20 1030 Terminal - Docker Container Status.png

Example 6: Using Claude Vision for Screenshots

# Use Claude Vision for more accurate screenshot analysis
pdf-renamer ~/Screenshots --ocr-method claude

# Output:
# Processing: error_dialog.png
#   Application: Microsoft Windows
#   Date: 2025-01-16
#   Content Type: error
#   Main Subject: Application Error 0x80070005
#   New filename: 2025-01-16 1145 Microsoft Windows - Application Error 0x80070005.png
# Note: Using Claude Vision - no OCR errors!

API Key Setup

Getting an Anthropic API Key

Sign up at console.anthropic.com
Navigate to API Keys section
Create a new API key
Copy the key (starts with sk-ant-api...)

Setting the API Key

Method 1: Environment file (recommended)

echo "ANTHROPIC_API_KEY=sk-ant-api..." > .env

Method 2: Export in shell

export ANTHROPIC_API_KEY="sk-ant-api..."

Method 3: Pass via environment

ANTHROPIC_API_KEY="sk-ant-api..." pdf-renamer /path/to/pdfs

Advanced Usage

OCR Methods for Screenshots

The tool supports two OCR methods for processing screenshots:

1. Tesseract (Default)

Pros: Fast, free, runs locally, no API costs
Cons: May have OCR errors, requires Tesseract installation
Usage: Default method, or use --ocr-method tesseract

# Using Tesseract (default)
pdf-renamer ~/Screenshots

# Explicitly specify Tesseract
pdf-renamer ~/Screenshots --ocr-method tesseract

2. Claude Vision

Pros: More accurate, no OCR errors, understands visual context, no Tesseract needed
Cons: Uses more API credits (~10x), slightly slower
Usage: Use --ocr-method claude or set OCR_METHOD=claude in .env

# Using Claude Vision for better accuracy
pdf-renamer ~/Screenshots --ocr-method claude

# Set in .env file for permanent configuration
echo "OCR_METHOD=claude" >> .env

Comparison Example:

Tesseract might read: "Mierosoft Windows" (OCR error)
Claude Vision reads: "Microsoft Windows" (accurate)

Recommendation: Use Tesseract for bulk processing to save costs, Claude Vision for important files where accuracy matters.

Handling Special Cases

Multiple authors: Claude will intelligently handle papers with multiple authors, often using "et al." for many authors:

Original: collaborative_research.pdf
Renamed: Smith et al 2023 - Collaborative Research Methods.pdf

Non-English names: The AI correctly handles various naming conventions:

van Gogh → van Gogh (not "Gogh")
O'Brien → O'Brien (not "Brien")
José García → García (not "José")

Missing information: Files with missing data use defaults:

No author: Unknown 2023 - Title.pdf
No year: Author Unknown Year - Title.pdf

Performance Tips

Batch size: Process directories with 100-200 PDFs at a time for best performance
Page extraction: Reduce MAX_PAGES_TO_EXTRACT for faster processing of large PDFs
Model selection: Use claude-3-5-sonnet for best accuracy, or claude-3-haiku for speed

Integration with File Managers

macOS Automator: Create a Quick Action to rename PDFs from Finder Windows: Add to Send To menu for right-click renaming Linux: Create a Nautilus script or KDE Service Menu

Troubleshooting

Common Issues

"ANTHROPIC_API_KEY not found"

Ensure your .env file is in the current directory
Check the API key is correctly formatted
Try exporting the key: export ANTHROPIC_API_KEY="your-key"

"Failed to extract text from PDF"

The PDF might be scanned/image-based
The PDF might be corrupted
Try opening the PDF in a reader to verify it's valid

"Failed to extract text from screenshot"

Ensure Tesseract is installed: which tesseract (only needed for Tesseract OCR)
Try switching to Claude Vision: --ocr-method claude
The image might be corrupted or in an unsupported format
Try a different image format (PNG usually works best)
Check if the image contains readable text

"Failed to extract bibliographic information"

The PDF might not contain clear bibliographic information
Try increasing MAX_PAGES_TO_EXTRACT
The PDF might be in an unsupported language

Rate limiting errors

Add delays between large batches
Reduce concurrent processing
Check your API tier limits

Debug Mode

Run with logging to see detailed information:

# Set logging level
export LOG_LEVEL=DEBUG
pdf-renamer /path/to/pdfs

Development

Project Structure

pdf-file-renamer/
├── file_renamer/
│   ├── __init__.py          # Package initialization
│   ├── __main__.py          # Entry point
│   ├── cli.py               # CLI interface
│   ├── models.py            # Pydantic models
│   ├── ai_extractor.py      # Claude AI integration
│   └── pdf_extractor.py     # PDF and screenshot text extraction
├── tests/                   # Comprehensive test suite
│   ├── conftest.py         # Test fixtures
│   ├── test_models.py      # Model tests
│   ├── test_pdf_extractor.py # PDF extraction tests
│   ├── test_ai_extractor.py # AI integration tests
│   ├── test_cli.py         # CLI tests
│   ├── test_integration.py # Integration tests
│   └── test_performance.py # Performance tests
├── .github/workflows/      # CI/CD workflows
├── .env.example            # Example configuration
├── pyproject.toml          # Project configuration
├── README.md               # This file
├── CHANGELOG.md            # Version history
├── SCREENSHOT_TEST_RESULTS.md # Screenshot functionality test results
├── TEST_SUMMARY.md         # Test suite documentation
└── LICENSE                 # MIT License

Running from Source

# Clone the repository
git clone https://github.com/ArneJanning/pdf-file-renamer.git
cd pdf-file-renamer

# Install in development mode
pip install -e .

# Run directly
python -m file_renamer /path/to/pdfs

Testing

The project includes a comprehensive test suite with 80+ tests covering:

Install test dependencies:

pip install -e ".[test]"

Run all tests:

pytest tests/ -v

Run with coverage:

pytest tests/ --cov=file_renamer --cov-report=term-missing

Run performance tests:

pytest tests/test_performance.py -v

Use the test runner:

python run_tests.py --install

Test categories:

Unit Tests: Individual component functionality
Integration Tests: End-to-end workflows
Performance Tests: Scalability and efficiency
CLI Tests: Command-line interface behavior

See TEST_SUMMARY.md for detailed test documentation.

Contributing

Fork the repository
Create a feature branch: git checkout -b feature-name
Make your changes
Run tests: pytest
Commit: git commit -am 'Add feature'
Push: git push origin feature-name
Create a Pull Request

Adding New Features

To add a new template variable:

Update models.py to add the field
Update the Claude prompt in ai_extractor.py
Add the variable to format_filename()
Update documentation

License

MIT License - see LICENSE file for details.

Acknowledgments

Built with PydanticAI for structured AI interactions
Powered by Claude from Anthropic for content analysis
PDF processing via pypdf
OCR processing via pytesseract and Tesseract OCR (optional)
Claude Vision API for direct image analysis (alternative to OCR)
Image handling with Pillow
CLI interface using Click

Support

Issues: GitHub Issues
Discussions: GitHub Discussions
Email: your-email@example.com

Made with ❤️ by Arne Janning

Name		Name	Last commit message	Last commit date
Latest commit History 14 Commits
.github/workflows		.github/workflows
file_renamer		file_renamer
tests		tests
.env.example		.env.example
.gitignore		.gitignore
.python-version		.python-version
CHANGELOG.md		CHANGELOG.md
CLAUDE.md		CLAUDE.md
LICENSE		LICENSE
README.md		README.md
SCREENSHOT_TEST_RESULTS.md		SCREENSHOT_TEST_RESULTS.md
TEST_SUMMARY.md		TEST_SUMMARY.md
pyproject.toml		pyproject.toml
run_tests.py		run_tests.py
uv.lock		uv.lock

Folders and files

Latest commit

History

Repository files navigation

PDF & Screenshot File Renamer

What's New in v0.2.1

Features

PDF Processing

Screenshot Processing (NEW!)

General Features

Table of Contents

Installation

Prerequisites

Install from source

Install using uv (recommended)

Quick Start

Configuration

Environment Variables

Configuration Options

Usage

Basic Command

Options

Command Examples

PDF Templates

Available Variables

Template Examples

Screenshot Templates

Available Variables

Template Examples

Examples

Example 1: Organizing Research Papers

Example 2: Custom Organization System

Example 3: Processing Books

Example 4: Processing Screenshots

Example 5: Mixed Directory (PDFs and Screenshots)

Example 6: Using Claude Vision for Screenshots

API Key Setup

Getting an Anthropic API Key

Setting the API Key

Advanced Usage

OCR Methods for Screenshots

1. Tesseract (Default)

2. Claude Vision

Handling Special Cases

Performance Tips

Integration with File Managers

Troubleshooting

Common Issues

Debug Mode

Development

Project Structure

Running from Source

Testing

Contributing

Adding New Features

License

Acknowledgments

Support

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors

Uh oh!

Languages

Packages