Skip to content

Fetch article URLs into a folder #9

Description

@njt

The RSS feeds only have heaadlines. We need the full text.
Ultimately we'll fetch and clean the article HTML.
For you to write the article HTML cleaner, I'll need to fetch articles and make them available to you in a later issue.
So write a new stage that fetches the URLs and stores the raw HTML in the database (no need for CSS or JS).
I'll check in and my database and push it back up.

Metadata

Metadata

Assignees

Labels

No labels
No labels

Projects

No projects

Milestone

No milestone

Relationships

None yet

Development

No branches or pull requests

Issue actions