fix: flatten merged widget annotations in form.flatten() by horacio-penya · Pull Request #134 · cantoo-scribe/pdf-lib

horacio-penya · 2026-01-31T00:35:19Z

Extends form.flatten() to handle "merged" widget annotations - widgets that have field properties (/FT, /V, /T) directly on the annotation dict in page Annots rather than being registered in AcroForm.Fields.

Previously, form.flatten() would do nothing for PDFs with merged widgets because getFields() only traverses AcroForm.Fields. Now it also scans page annotations for orphaned widgets and flattens them.

Handles both simple appearances (text fields) and stateful appearances (checkboxes, radio buttons with /AS appearance state).

What?

Extends form.flatten() to handle "merged" widget annotations - widgets that have field properties (/FT, /V, /T) directly on the annotation dict in page Annots rather than being registered in AcroForm.Fields.

Why?

embedPages missed those annotations

How?

Added a private method flattenMergedWidgets() called at the end of flatten() that:

Iterates all pages and their annotations
Identifies widget annotations (/Subtype: /Widget) with field type (/FT) directly on them
Resolves the appearance stream from /AP/N, handling:
Direct streams (text fields)
Appearance state dictionaries (checkboxes/radio buttons) - looks up /AS to get current state
Draws the appearance as an XObject at the widget's /Rect position
Removes the widget annotation

Alternative implementation: embedPages could copy the annotations as such, as copyPages does, but my current code uses flatten, so this seemed like the right fix.

Testing?

I tested with documents that had "merged" widget annotations, before this PR the annotations would be lost, now they are there.

New Dependencies?

No.

Screenshots

Before:

After:

Anything Else?

I used claude code, in an AI pair programming way (I directed the work, even if not writing the code)

I'm not sure how a test for this should be done.

I ran the linter, and it wanted to make changes in files unrelated to this PR, so I didn't commit those.

Checklist

Extends form.flatten() to handle "merged" widget annotations - widgets that have field properties (/FT, /V, /T) directly on the annotation dict in page Annots rather than being registered in AcroForm.Fields. Previously, form.flatten() would do nothing for PDFs with merged widgets because getFields() only traverses AcroForm.Fields. Now it also scans page annotations for orphaned widgets and flattens them. Handles both simple appearances (text fields) and stateful appearances (checkboxes, radio buttons with /AS appearance state). Co-Authored-By: Claude Opus 4.5 <noreply@anthropic.com>

Sharcoux · 2026-02-17T13:40:06Z

src/api/form/PDFForm.ts

+      const annotsToRemove: PDFRef[] = [];
+
+      for (let i = 0; i < annots.size(); i++) {
+        const annotRef = annots.get(i);


Probably a good idea to wrap this in a try/catch and handle errors gracefully

Sharcoux · 2026-02-17T13:42:14Z

src/api/form/PDFForm.ts

+        const rect = annot.get(PDFName.of('Rect'));
+        if (!(rect instanceof PDFArray) || rect.size() < 4) continue;
+
+        const x1 = (rect.get(0) as any)?.asNumber?.() ?? 0;


You could use:

const rect = annot.get(PDFName.of('Rect')); const rectangle = rect.asRectangle();

This will properly normalize the rectangle

github-actions bot added the needs-triage label Jan 31, 2026

Sharcoux requested changes Feb 17, 2026

View reviewed changes

Sharcoux mentioned this pull request Mar 5, 2026

How to remove the form fields and comments from pdf. #118

Open

2 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix: flatten merged widget annotations in form.flatten()#134

fix: flatten merged widget annotations in form.flatten()#134
horacio-penya wants to merge 1 commit intocantoo-scribe:masterfrom
horacio-penya:fix/flatten-merged-widgets

horacio-penya commented Jan 31, 2026

Uh oh!

Sharcoux Feb 17, 2026

Uh oh!

Sharcoux Feb 17, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

horacio-penya commented Jan 31, 2026

What?

Why?

How?

Testing?

New Dependencies?

Screenshots

Suggested Reading?

Anything Else?

Checklist

Uh oh!

Sharcoux Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Sharcoux Feb 17, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants