Skip to content

Fix pararius parser for Cloudflare-blocked .nl domain#132

Merged
wtfloris merged 1 commit intomasterfrom
fix/pararius-parser
Mar 13, 2026
Merged

Fix pararius parser for Cloudflare-blocked .nl domain#132
wtfloris merged 1 commit intomasterfrom
fix/pararius-parser

Conversation

@wtfloris
Copy link
Owner

Summary

  • Pararius added Cloudflare bot protection to their .nl domain, causing 403s. Switched scraper to pararius.com which serves identical listings without protection.
  • Added English property type prefix stripping (Flat, House, Room, Apartment) since .com uses English labels instead of Dutch.
  • Added sqm extraction from illustrated-features__item--surface-area element.

Test plan

  • All 195 tests pass (9 pararius-specific)
  • Verified live: 13 listings scraped with correct address, city, price, and sqm
  • Run migration SQL on prod: misc/sql/20260313_target_pararius_com_migration.sql.enc

🤖 Generated with Claude Code

Switch from pararius.nl (now behind Cloudflare managed challenge) to
pararius.com which serves the same listings. Add English property type
prefix stripping and sqm extraction.

Co-Authored-By: Claude Opus 4.6 <noreply@anthropic.com>
@wtfloris wtfloris merged commit 4041bb3 into master Mar 13, 2026
1 check passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant