v3.0.3 by ajslater · Pull Request #134 · ajslater/comicbox

ajslater · 2026-05-15T06:33:17Z

Fix small crashes with metron credits and comicbox with no path

type annotate magic metron field functions and make all params kwargs use eslint outside of editor update deps, new ruff rules. lint & format

commit e27050fbd42f0cf8e549871cc06c70f041672306 Author: AJ Slater <aj@slater.net> Date: Thu Nov 7 21:36:49 2024 -0800 rename deserializeMeta class to TrapExcepionsMeta fix type issues with field metaclass wrapper

fix notes parsing for metron and many variations move notes parsing into another file. add comicinfo metron origin test rename modules to not shadow python builtins fix binary pdf files for new mupdf

Callers that only want a thumbnail (e.g. codex's CoverThread) don't need the full ComicInfo/CoverImage hint resolution. Parsing the metadata for every cover dominates the cost of cover extraction and emits a flood of debug-bucket Union ValidationErrors that look like real failures in DEBUG logs. When skip_metadata=True, bypass generate_cover_paths entirely and read archive index 0 directly. This drops per-call schema instantiation, Union resolution, and path normalization. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* compact news * update deps

…122) ClearingErrorStoreSchema previously split each schema's errors into two buckets: ignored ones logged at DEBUG, real ones at WARNING. The DEBUG bucket only ever held errors from ``_ignore_errors`` — ``Field may not be null.`` (sparse-field tolerance) and ``Invalid input type.`` (Union variant misses) — both of which are internal mechanics, not operator-actionable signal. Each Union miss emitted one ``ValidationError - {'_schema': ['Invalid input type.']}`` line per field per archive, drowning the genuinely useful per-source DEBUG messages emitted by ``_except_on_load``. Filter ignored errors at split time, log only WARNINGs. Real schema failures still surface with full context (path, schema class, normalized message). Collapses the dual-bucket _split_*_errors methods into _filter_* + _log_warnings. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* compact news * update deps * metron: drop broken URL slugs for genre, location, reprint, role, story, tag Metron has no public web pages for these types — only API endpoints — so URLs like https://metron.cloud/genre/3 always 404. Stop emitting them. The numeric Metron ID is still preserved on the identifier. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Shortens the import path for the helper from comicbox.enums.maps.age_rating to comicbox.enums.maps so downstream callers can reach it without drilling into the submodule. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

- Remove unused module/class constants: _COMMENT_ARCHIVE_TYPES, SUFFIXES, _LOG_FORMAT, comet.py IDENTIFIER_TAG/IS_VERSION_OF_TAG, comictagger.py IDENTIFIER_TAG/PAGES_TAG, XmlCountryField (and now-orphaned imports RarFile, ZipFile, CountryField). - Fix latent bug in TrapExceptionsMeta: `attr_name in "deserialize"` was a substring check that wrapped any callable whose name was a substring of "deserialize" (e.g. "er", "size", "ali"). Use the existing _WRAP_METHODS tuple instead so only the exact `deserialize` method is wrapped. - Simplify _get_pdf_enabled() to a plain `import pdffile` probe; the except-arm stub import had no effect. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Consolidate the optional comicbox-pdffile integration into one module (comicbox/_pdf.py) and delete the hand-maintained pdffile_stub.py. Previously six call sites each duplicated a `try: from pdffile import X / except: from pdffile_stub import X` block, and the stub class mirrored the real PDFFile API method-for-method — silent drift risk every time upstream pdffile shipped. Now: - comicbox/_pdf.py is the single source of truth for PDF_ENABLED, PDFFile, and PAGE_FORMAT_VALUES. When pdffile is absent, PDFFile is None at runtime; type checkers see the real class via TYPE_CHECKING. - Every call site that touches PDFFile is gated by `if PDF_ENABLED`. - The `case PDFFile():` arm in box/archive/archive.py is lifted to an `if PDF_ENABLED and isinstance(archive, PDFFile):` guard above the match (the match form would fail when PDFFile is None). - config/__init__.py reads PAGE_FORMAT_VALUES instead of iterating an empty stub Enum. Verified with `pdffile` installed (307/307 tests pass) and in a fresh venv without it (PDF_ENABLED=False, CBZ archives still work, PDF files raise UnsupportedArchiveTypeError, CLI shows the "not installed" hint). Net: -70 lines across 9 files. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

* compact news * update deps * update news and version to alpha 4 * update deps * rename function path in NEWS * bump alpha version to 3.0.0a5

* require comicbox-pdffile 0.6.x for image-dominant page detection Widens the optional ``[pdf]`` extra to require comicbox-pdffile 0.6.x. The new minor release adds image-dominant page detection ( ``PDFFile.classify_page``, ``PDFFile.read_image_if_dominant``, ``PDFFile.read_full_pixmap_jpeg``) used by browser readers to serve scanned-comic PDF pages as plain ``<img>`` instead of routing through pdf.js on the client. comicbox itself doesn't use the new API — the bump is purely a pin update so downstream callers (Codex, OPDS readers) can adopt it. The ``[tool.uv.sources]`` block is transient: it points at the pdffile PR branch so this CI can resolve dependencies before 0.6.x lands on PyPI. Drop it once 0.6.x publishes. Co-Authored-By: Claude Opus 4.7 (1M context) <noreply@anthropic.com> * just use the released pdffile --------- Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

Add bin/regenerate-pdf-test-pages.py — drives Comicbox.get_page_by_index against tests/files/test_pdf.pdf to refresh tests/files/pdf/{N}.pdf when pymupdf or pdffile change page-extraction output. Run on the next drift. Co-authored-by: Claude Opus 4.7 (1M context) <noreply@anthropic.com>

ajslater added 30 commits October 8, 2024 14:27

update pdf pages. binary difference with new mupdf

365c52a

update docker images

e235b83

fix make install dependencies

02f94b3

add jxl to image extensions

5fdf697

fix ignoring macos resource forks

1ea7a2e

resource fork test file

354c438

update deps

fc34948

adjust news

96d977c

Merge branch 'main' into develop

6619c06

Squashed commit of the following:

4417c7c

type annotate magic metron field functions and make all params kwargs use eslint outside of editor update deps, new ruff rules. lint & format

add venv upgrade script

825aa67

ignore PERF203

dc00351

update deps and install pdffile

ef54373

update deps. appease typechecker. new eslint.config

d7c5264

Squashed commit of the following:

f4982b9

commit e27050fbd42f0cf8e549871cc06c70f041672306 Author: AJ Slater <aj@slater.net> Date: Thu Nov 7 21:36:49 2024 -0800 rename deserializeMeta class to TrapExcepionsMeta fix type issues with field metaclass wrapper

add eslint-plugin-json-schema-validator

dd68454

update deps and lint

39e3d48

use mdx instead of markdown

1d0b6ec

remove unused import

341d539

remove superfluous plugins. remove first level globs

1721a0f

update deps

7dcd22b

Squashed commit of the following:

320d4f3

fix notes parsing for metron and many variations move notes parsing into another file. add comicinfo metron origin test rename modules to not shadow python builtins fix binary pdf files for new mupdf

bump version and news

e7d20f8

fix type errors

722135a

format

7470e1f

refactor dynamic class creation to appease typchecker

665c8e8

add libmupdf docs

212d951

Simplify Identifier URL construction for Metron pk ids.

7547160

update deps

633d565

fix story arc parsing. bump version

e61e7ce

ajslater and others added 29 commits May 1, 2026 14:02

ReadResults data structure for process functions

cc656bd

compact news (#119)

cfbdc55

v3.0.0a2 (#121)

55af0d0

* compact news * update deps

update devenv

e41cbd6

update devenv

571e70f

remove unused ty ignore

81d2ce4

comicbox 3 alpha 5 (#123)

34652a5

* compact news * update deps * update news and version to alpha 4 * update deps * rename function path in NEWS * bump alpha version to 3.0.0a5

version 3.0.0

7cd8db0

massage news

82fdab5

Merge branch 'main' into develop

b76faa2

bump version and news and update deps

b1cea20

update deps

b66ea2e

bump pdffile to 0.6.1

5da94e3

Merge branch 'main' into develop

3dd9d27

bump version and news to 3.0.2

615d31f

Merge branch 'main' into develop

99f096e

update deps

a56722b

fix initializing pdf vars with no path

bc412e2

make transforming metron credits more durable

97505c8

bump news

c3285e2

bump version to v3.0.3

190ce6d

ajslater merged commit 9bbc50b into main May 15, 2026
3 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

v3.0.3#134

v3.0.3#134
ajslater merged 335 commits into
mainfrom
develop

ajslater commented May 15, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

Conversation

ajslater commented May 15, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

1 participant

ajslater commented May 15, 2026 •

edited

Loading