ocr-story

## TL;DR

Why most computers can't properly parse this document? 

[A perfectly fine PDF page that cannot be read 

Why most text extraction methods are not optical character recognition. And how computer _reading_ a document is more complex than you think.

## Technical prowess (optional)

 - Extracting, structuring and validating PDF text extraction in 2025.