Skip to content

Implement fuzzy matching for similar titled articles #16

@sjbitcode

Description

@sjbitcode

Problem

As of 12/28/20, I came across three similarly titled articles, the difference is that some titles have the source included in it, ex:

  • In the new year, take a new look at immigration – starting with DACA
  • In the new year, take a new look at immigration – starting with DACA | Charlotte Observer
  • In the new year, take a new look at immigration – starting with DACA | Raleigh News & Observer

In this function of the article pipeline, I implement a function to check article titles 15 days prior and ahead for the same title.

Some solutions:

Metadata

Metadata

Assignees

No one assigned

    Labels

    No labels
    No labels

    Projects

    No projects

    Milestone

    No milestone

    Relationships

    None yet

    Development

    No branches or pull requests

    Issue actions