Skip to content

Add generated data quality report#5

Merged
Bmowville merged 1 commit into
mainfrom
feature/data-quality-report
Jun 6, 2026
Merged

Add generated data quality report#5
Bmowville merged 1 commit into
mainfrom
feature/data-quality-report

Conversation

@Bmowville

Copy link
Copy Markdown
Owner

Adds a generated data quality report for the Titanic SQLite pipeline output.

Summary:

  • Add a Markdown quality report generator for row count, expected columns, duplicate PassengerId values, missing values, and numeric ranges.
  • Add validator checks for the generated quality report.
  • Update CI to generate and verify the quality report from a clean pipeline run.
  • Update README and pipeline contracts with the new report path and run step.

Validation:

  • python pipelines/01_ingest_to_sqlite.py
  • python scripts/generate_data_quality_report.py
  • python scripts/validate_outputs.py
  • python -m compileall pipelines scripts

Closes #4

@Bmowville Bmowville merged commit 35ae254 into main Jun 6, 2026
1 check passed
@Bmowville Bmowville deleted the feature/data-quality-report branch June 6, 2026 05:37
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

Add generated data quality report

1 participant