Contributing to File Organizer

Development Environment Setup

Prerequisites

Python 3.11+ (we test against 3.11, 3.12)
Ollama (for AI model inference)

Python Version Management (pyenv)

Our CI tests against Python 3.11 and 3.12. To catch version-specific issues locally before pushing, install all target versions via pyenv:

# Install pyenv
brew install pyenv          # macOS
curl https://pyenv.run | bash  # Linux

# Install CI-targeted Python versions
pyenv install 3.11.11
pyenv install 3.12.8

Install the Package

# Create a virtual environment
python3 -m venv .venv
source .venv/bin/activate

# Install with development dependencies
pip install -e ".[dev]"

# Verify
file-organizer --version

Adding Dev Dependencies

This project uses uv for local dependency management but CI installs via pip install -e ".[dev,search]", which reads from [project.optional-dependencies] in pyproject.toml. Always use --optional dev (not --dev) when adding test/dev packages so they land in the right section:

uv add --optional dev <package>

Using uv add --dev <package> writes to [dependency-groups] instead, which pip does not read — the package installs locally but CI fails with ModuleNotFoundError.

Pre-Commit Hooks (Automatic Validation)

This project uses automated pre-commit validation to catch common issues before commits:

# Install git pre-commit hooks (one-time setup after first time)
pre-commit install

# Now on every commit, these hooks automatically run:

Pre-Commit Hooks (run automatically on git commit):

Hook	Purpose	Catches
ruff-check	Python linting (strict, includes RUF100)	Style, imports, stale `# noqa` comments
codespell	Spelling consistency	Typos, spelling inconsistencies
absolute-path-check	Hardcoded absolute paths	`/Users/`, `/home/`, `C:\Users\` paths
pytest (multiple)	CI guardrails, web UI, websocket tests	Test failures block commit

Pre-PR Orchestration (via bash .claude/scripts/pre-commit-validation.sh):

Layer	Canonical owner	Purpose
Staged-file guardrails	`.pre-commit-config.yaml`	Mechanical checks, focused pytest gates, and changed-file validation
Semantic guardrails	`tests/ci/`	Behavior, contract, and review-regression checks
CI runtime support	`.github/workflows/ci.yml`	Permissions and environment required by CI-only guardrails
Pre-PR orchestration	`.claude/scripts/pre-commit-validation.sh`	Runs the enforced layers above before push/PR
Reference guidance	anti-pattern docs under `memory/` and `.claude/rules/`	Explains why a guard exists; not a blocking policy source

Manual Validation:

# Run all pre-commit hooks on entire codebase
pre-commit run --all-files

# Run specific hook
pre-commit run ruff-check --all-files
pre-commit run codespell --all-files

# Run the canonical pre-PR guardrail orchestrator
bash .claude/scripts/pre-commit-validation.sh

Skipping Hooks (only if necessary):

# Commit bypassing pre-commit (NOT recommended)
git commit --no-verify

# But hooks will still run in CI, so issues will be caught there

Pre-Push Checklist

MANDATORY: Before EVERY push, run the canonical pre-PR guardrail orchestrator:

bash .claude/scripts/pre-commit-validation.sh
# Must pass (exit code 0) before proceeding to push

This command orchestrates the real enforcement layers:

pre-commit validate-config
pre-commit run --files ... for changed files, or --all-files when nothing is staged
pytest tests/ci -q --no-cov --override-ini="addopts="

For first-wave review-regression CI parity, you can run the same audit command used by the standing enforcement checks:

python3 -m file_organizer.review_regressions.audit \
  --root . \
  --detector file_organizer.review_regressions.security:SECURITY_DETECTORS \
  --detector file_organizer.review_regressions.correctness:CORRECTNESS_DETECTORS \
  --detector file_organizer.review_regressions.test_quality:TEST_QUALITY_DETECTORS \
  --fail-on-findings

It is intentionally not a second policy engine. If you need a new blocking rule, add it to .pre-commit-config.yaml or tests/ci, then let this script invoke it.

Do not push if validation fails. Fix violations and re-run until it passes.

Quick Check (recommended before every push)

# Fastest — tests on Python 3.12 only (~2 min)
./scripts/test-local-matrix.sh --quick

Full Matrix (recommended before opening a PR)

# Full CI mirror — Python 3.11/3.12 (~10 min)
./scripts/test-local-matrix.sh

Individual Checks

# Python matrix only
./scripts/test-local-matrix.sh --python

# Lint only
ruff check src/

# Type check
mypy src/file_organizer/ --strict

Full CI in Docker with `act` (ubuntu-latest parity)

act runs the actual .github/workflows/*.yml files inside Docker containers that mirror ubuntu-latest. This catches platform-specific bugs that test-local-matrix.sh misses (e.g., st_birthtime on macOS vs st_mtime on Linux).

Prerequisites: Docker Desktop running locally (~2 GB disk for the ubuntu-latest image on first run).

# Install act
brew install act            # macOS
curl -s https://raw.githubusercontent.com/nektos/act/master/install.sh | sudo bash  # Linux
choco install act-cli       # Windows

Usage:

# Run the full ci-full.yml matrix (simulates the daily cron schedule)
act schedule

# Same, but simulate a manual workflow_dispatch trigger
act workflow_dispatch

# Run just the Python test matrix
act schedule -j test-matrix

# Run the fast CI (ci.yml — simulates a push to main)
act push

# Run CI as if a PR was opened
act pull_request

The .actrc file in the project root pins the Docker image and architecture automatically.

act vs test-local-matrix.sh:

	`act`	`test-local-matrix.sh`
OS parity	ubuntu-latest in Docker	Host OS (macOS/Windows)
Workflow sync	Uses actual YAML — stays in sync	Must be updated manually
Speed	Slower (Docker overhead)	Faster (native execution)
Dependencies	Docker Desktop required	pyenv only
Offline	Needs Docker images cached	Works fully offline

What the CI Runs

Workflow ownership

Workflow	File	Triggers	What it runs
CI	`.github/workflows/ci.yml`	push to `main`, PRs	Lint; PR suite (Python 3.11, ci marker); push suite (4 shards × Python 3.11+3.12, `not benchmark and not e2e`); coverage-gate job; benchmark-only lane; Rust desktop checks
CI Full Matrix	`.github/workflows/ci-full.yml`	daily (06:00 UTC), manual	Linux full-suite (4 shards × Python 3.11+3.12) + coverage gate; macOS + Windows cross-platform (Python 3.12, ci/smoke markers)
Security	`.github/workflows/security.yml`	weekly (Monday), PRs	pip-audit + bandit + CodeQL

Rule: each check lives in exactly one workflow.

ci.yml is the fast-path gate: runs on PRs and pushes to main.
- PR test lane: single Python 3.11 job, ci marker only (~2 400 tests), diff-coverage gate
- Push to main: 4 parallel shards × Python 3.11+3.12, each ~4 000 tests with timeout-minutes: 20; a separate coverage-gate job merges .coverage artifacts and enforces the 93% floor. Sharding prevents the GC-finaliser hang that occurred when all 17 000+ tests shared 2 xdist workers in a single job.
- benchmark-only lane runs without xdist (--benchmark-only)
ci-full.yml is the breadth gate: runs daily and includes the same 4-shard Linux full-suite run plus macOS and Windows cross-platform validation.
security.yml owns all security tooling.

scripts/test-local-matrix.sh mirrors the Python matrix locally on the host OS. act mirrors the full workflow inside Docker for ubuntu-latest parity.

Guardrail Ownership

The canonical ownership rules and examples live in Developer Guardrails. Use .pre-commit-config.yaml for staged-file checks, tests/ci/ for semantic guardrails, .github/workflows/ci.yml for CI-only runtime support, and .claude/scripts/pre-commit-validation.sh only as the pre-PR orchestrator.

Common Pitfalls

These are the most frequent causes of CI failures that pass locally:

Pitfall	Why it happens	How to avoid
`datetime.utcnow()`	Deprecated in 3.12, removed in 3.14	Use `datetime.now(timezone.utc)`
`os.stat().st_birthtime`	macOS-only; Linux raises `AttributeError`	Use platform-aware fallback (see `heuristics.py`)
`X \| Y` union types	Python 3.10+ only	Use `Union[X, Y]` or `Optional[X]`
`tomllib`	Python 3.11+ only	Use `tomli` backport or conditional import

Code Style

Formatter: Black (line length 100)
Import sorting: isort
Linter: Ruff (strict)
Type checking: mypy strict mode
Docstrings: Google style

Commit Messages

<type>(<scope>): <subject>

Types: feat, fix, docs, style, refactor, test, chore

Example: fix(history): replace deprecated datetime.utcnow() with timezone-aware alternative

Testing

# Run all tests (full suite)
pytest

# Run with coverage
pytest --cov=file_organizer --cov-report=html

# Run specific test subsets
pytest -m unit           # Unit tests only
pytest -m smoke          # Fast smoke suite (~3.5s) — local pre-commit validation
pytest -m ci             # CI validation tests — PR check suite
pytest -m "not slow"     # Skip slow tests for faster local development
pytest -m "not regression"  # Full suite without regression (PR validation)
pytest tests/            # Full suite including regression tests (complete local/CI run)

Test Markers

Marker	Purpose	When Used
`@pytest.mark.unit`	Fast unit tests	Both local and CI
`@pytest.mark.smoke`	Critical-path tests for pre-commit (~3.5s, deterministic, fast)	Local pre-commit validation
`@pytest.mark.ci`	PR validation tests	GitHub Actions PR checks
`@pytest.mark.integration`	Integration tests (real services, mocked HTTP)	Main branch CI only
`@pytest.mark.regression`	Full regression suite	Complete CI runs, manual testing
`@pytest.mark.slow`	Long-running tests	Skipped in pre-commit and PR CI

Integration Tests

Integration tests live in tests/integration/ and exercise real service wiring with only external HTTP mocked. Use the shared fixtures from tests/integration/conftest.py.

Full-stack fixtures (for API router, web route, and CLI testing):

async_client — httpx.AsyncClient wired to the full FastAPI app via ASGI transport; no server process required
cli_runner — typer.testing.CliRunner for invoking CLI commands in-process
fake_text_model — concrete BaseModel subclass returning deterministic responses; use when a test needs a real model instance rather than a patch

Model stub fixtures (for service-layer testing):

@pytest.mark.integration
def test_organizer_creates_output(
    stub_all_models,       # Stubs both text + vision model init and generate
    stub_nltk,             # No-ops NLTK data download
    integration_source_dir,  # Temp dir with .txt, .csv, .md files
    integration_output_dir,  # Clean temp output dir
):
    from tests.integration.conftest import make_text_config, make_vision_config

    org = FileOrganizer(
        text_model_config=make_text_config(),
        vision_model_config=make_vision_config(),
        dry_run=False,
    )
    result = org.organize(
        input_path=str(integration_source_dir),
        output_path=str(integration_output_dir),
    )
    assert result.processed_files == 3

Integration tests run on main branch pushes only (pytest -m integration), not on every PR. See Testing Guide for the full fixture reference.

Playwright E2E Tests

Browser-based end-to-end tests live in tests/playwright/ and use @pytest.mark.playwright. They exercise the desktop UI in a real Chromium browser with a mocked window.pywebview.api injected by tests/playwright/conftest.py. These tests are not part of the standard CI suite — run them manually when working on the desktop UI layer.

Setup (one-time):

pip install 'file-organizer[desktop,dev]'
playwright install chromium

Running:

# Run all playwright tests (skips addopts coverage gate)
pytest tests/playwright/ --override-ini='addopts='

# Run a single file
pytest tests/playwright/test_desktop_api_contract.py --override-ini='addopts='

The --override-ini='addopts=' flag is required to suppress the default --cov-fail-under=95 coverage gate, which is not meaningful for a browser-driven test run targeting the UI layer.

Quality Gates

Before committing code, this project enforces three quality gates (in order). Note: The /simplify and /code-reviewer commands are Claude Code-specific tools. For contributors not using Claude Code, follow the automated validation script only.

Pre-Commit Validation (bash .claude/scripts/pre-commit-validation.sh)
- Lint, format, type-check, test, validate patterns
- Must PASS before committing
- Prevents CI failures due to local issues
- Fast: ~30 seconds (fail fast on cheap checks)
Code Review (/code-reviewer skill — Claude Code users only)
- Validate implementation against CLAUDE.md standards
- Check for architectural and design issues
- Verify test logic and assertions
- Medium: 30-60 seconds
Code Simplification (/simplify skill — Claude Code users only, optional)
- Review code for efficiency and reuse
- Suggest optimizations and improvements
- Run after significant code changes (>50 lines)
- Expensive: 1-5 minutes (improvement suggestions, not required)

Order matters: Pre-Commit (required) → Code Review (Claude Code only) → Simplify (Claude Code only, optional) → Commit

For non-Claude-Code contributors: Run only step 1 (pre-commit validation script). GitHub CI enforces additional checks.

For details, see .claude/rules/code-quality-validation.md and .claude/rules/development-guidelines.md.

Pull Requests

Create a feature branch from main: git checkout -b feature/description
Make changes with tests
Run quality gates (in order):
- bash .claude/scripts/pre-commit-validation.sh (required, all contributors)
- /code-reviewer (Claude Code users only: validate design and logic)
- /simplify (Claude Code users only: optional if >50 lines of code changes)
Commit with descriptive message following conventional commits
Push and open a PR against main

PRs trigger both CI workflows automatically. Copilot Review runs on every PR (premium feature), so catching issues locally via quality gates saves time and money.

PR Review Response Protocol

If reviewers request changes:

Extract all findings upfront (don't iterate one at a time)
Verify each finding against current code
Apply all fixes in one local pass (no pushing between fixes)
Run quality gates (pre-commit → code-reviewer → simplify)
Commit and push once with comprehensive message
Don't monitor iteratively — trust your quality gates did their job

This single-pass approach prevents review churn and keeps PR history clean.

See .claude/rules/pr-review-response-protocol.md for full details.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Contributing to File Organizer

Development Environment Setup

Prerequisites

Python Version Management (pyenv)

Install the Package

Adding Dev Dependencies

Pre-Commit Hooks (Automatic Validation)

Pre-Push Checklist

Quick Check (recommended before every push)

Full Matrix (recommended before opening a PR)

Individual Checks

Full CI in Docker with `act` (ubuntu-latest parity)

What the CI Runs

Workflow ownership

Guardrail Ownership

Common Pitfalls

Code Style

Commit Messages

Testing

Test Markers

Integration Tests

Playwright E2E Tests

Quality Gates

Pull Requests

PR Review Response Protocol

FilesExpand file tree

CONTRIBUTING.md

Latest commit

History

CONTRIBUTING.md

File metadata and controls

Contributing to File Organizer

Development Environment Setup

Prerequisites

Python Version Management (pyenv)

Install the Package

Adding Dev Dependencies

Pre-Commit Hooks (Automatic Validation)

Pre-Push Checklist

Quick Check (recommended before every push)

Full Matrix (recommended before opening a PR)

Individual Checks

Full CI in Docker with act (ubuntu-latest parity)

What the CI Runs

Workflow ownership

Guardrail Ownership

Common Pitfalls

Code Style

Commit Messages

Testing

Test Markers

Integration Tests

Playwright E2E Tests

Quality Gates

Pull Requests

PR Review Response Protocol

Full CI in Docker with `act` (ubuntu-latest parity)