Skip to content

T1.4: Time-aligned comparison dataset #4

@w2naf

Description

@w2naf

Description

Write a function/module that takes a HamSCI DataFrame (from #2) and a GIRO DataFrame (from #3) and produces a time-aligned paired dataset for validation.

Requirements

  • Accept a HamSCI DataFrame and a GIRO DataFrame as inputs
  • Align measurements in time using nearest-neighbor matching (HamSCI sounds every 5 min; GIRO typically every 15 min)
  • Allow configurable maximum time gap for pairing (e.g., reject pairs where timestamps differ by > 5 minutes)
  • Output a single DataFrame where each row contains both HamSCI and GIRO measurements, clearly labeled (e.g., hamsci_vh_3p5, giro_fof2, etc.)
  • Include a column indicating the time difference between the paired measurements
  • Flag or exclude rows where one side has missing data

Acceptance Criteria

  • Produces a paired DataFrame from sample HamSCI and GIRO data
  • Time alignment is correct (verified by inspection)
  • Missing data is handled (NaN or excluded)
  • Basic test included

Timeline

Context

Part of Track 1 data infrastructure. This paired dataset is the input for all validation analysis. See plans/track1-workplan.md.

Metadata

Metadata

Assignees

Labels

data-infrastructureData loading, retrieval, and processing infrastructuretrack-1-validationTrack 1: Instrument Validation

Type

No type

Projects

Status

In review

Relationships

None yet

Development

No branches or pull requests

Issue actions