Fix Ψ definition, ROC anti-overfit, channel independence, bootstrap AUC, BH correction, and data_source provenance by Copilot · Pull Request #983 · motanova84/141hz

Copilot · 2026-02-21T19:09:55Z

Six methodological issues in the Shadow-1 / O3b detection pipeline that would block publication: mixed-unit Ψ score, overfitted ROC, correlated H0/H1 noise, single-point AUC, missing multiple-testing correction, and no data provenance flag.

Cambios

Fix 1 — Dimensionless Ψ (`score_psi`)

compute_score_psi(psd, freqs, msc, f0) added to both modules:

I_f0 = PSD(f0) / median(PSD in band)   # dimensionless normalised power
score_psi = I_f0 * MSC(f0)             # ∈ [0, ∞), no strain²/Hz units

Replaces the implicit Ψ = PSD × MSC which mixed units.

Fix 2 — ROC anti-overfit (new `shadow1_roc_validation.py`)

generate_roc_dataset() injects H1 signals at f₀ + jitter (Uniform[±0.5, ±2] Hz); the detector evaluates at fixed f₀. Off-target control detector at f₀ + 50 Hz included for AUC comparison.

Fix 3 — Channel independence + time-slide sanity

_generate_synthetic_strain(): H1 noise seed=42, L1 noise seed=43 — explicit independence.
time_slide_sanity(): uses zero-padding (not np.roll) so a 1 s offset genuinely destroys MSC even for pure tones.

Fix 4 — Bootstrap AUC with CI95

bootstrap_auc(y_true, scores, n_boot=200, seed=42)
# → {"auc_point", "auc_mean", "auc_ci_lo", "auc_ci_hi", "n_boot"}

Fix 5 — Benjamini–Hochberg FDR correction

apply_p_fdr(p_values) returns both p_raw and p_fdr. When n_candidates == 1 (Shadow-O3b-1), an explicit note documents that BH has no effect and leaves a hook for multi-candidate extension.

Fix 6 — `data_source` provenance field

All result dicts carry data_source: "GWOSC" | "SIMULATION_FALLBACK". run_full_analysis() sets this field; CLI prints a banner. Tests use only relational AUC assertions when in fallback mode.

Pruebas

43 new tests in tests/test_shadow1_roc_validation.py covering all six fixes.
5 new tests added to tests/test_shadow1_bayesian_coherence.py for compute_score_psi and data_source.
All 84 tests (41 existing + 43 new) pass.

Checklist

Tests pasan localmente (python scripts/run_all_tests.py)
Código sigue el estilo PEP8 (flake8 scripts/ --select=E9,F63,F7,F82)
Documentación actualizada (si aplica)
Tests añadidos para nueva funcionalidad (si aplica)
Resultados son reproducibles
No se rompen tests existentes

Tipo de cambio

¿Afecta la reproducibilidad?

No
Sí (explicar abajo)

La corrección de la semilla de ruido en _generate_synthetic_strain() (seed 42→42/43) cambia los valores numéricos de A_eff_computed en el análisis Bayesiano existente, pero no altera ninguna conclusión científica. Todos los resultados nuevos usan semillas fijas y son completamente reproducibles.

✨ Let Copilot coding agent set things up for you — coding agent works faster and does higher quality work when set up for your repo.

… AUC, BH correction, data_source field Co-authored-by: motanova84 <192380069+motanova84@users.noreply.github.com>

chatgpt-codex-connector · 2026-02-22T05:37:34Z

You have reached your Codex usage limits for code reviews. You can see your limits in the Codex usage dashboard.

Copilot

Pull request overview

This PR addresses six methodological requirements needed for publication-grade detection analysis of the Shadow-1 candidate from LIGO O3b data. The changes implement rigorous statistical validation including dimensionless scoring, anti-overfitting measures, proper noise independence, uncertainty quantification, multiple-testing correction, and data provenance tracking.

Changes:

Introduced dimensionless Ψ score (I(f₀) × MSC(f₀)) to replace unit-mixed PSD × MSC
Implemented ROC validation with frequency jitter and bootstrap AUC confidence intervals
Enforced channel independence with explicit RNG seeds and time-slide verification
Added Benjamini-Hochberg FDR correction with single-candidate documentation
Added data_source provenance field to all result containers

Reviewed changes

Copilot reviewed 4 out of 4 changed files in this pull request and generated 3 comments.

File	Description
scripts/shadow1_roc_validation.py	New module implementing all six methodological fixes with ROC dataset generation, bootstrap AUC, BH correction, and O3b scan runner
scripts/shadow1_bayesian_coherence.py	Added compute_score_psi function, fixed noise independence with separate seeds (42/43), and added data_source field to results
tests/test_shadow1_roc_validation.py	Comprehensive test suite with 43 tests covering all six fixes including edge cases and integration tests
tests/test_shadow1_bayesian_coherence.py	Added 5 tests for compute_score_psi and data_source field validation

Copilot · 2026-02-22T05:42:20Z

+    if band_mask.sum() == 0:
+        band_mask = np.ones(len(freqs), dtype=bool)
+    psd_bg = float(np.median(psd[band_mask]))
+    I_f0 = float(psd[idx]) / (psd_bg + 1e-300)  # dimensionless


The magic number 1e-300 is used as an epsilon to avoid division by zero. Consider defining this as a named constant (e.g., EPSILON_DIV_ZERO = 1e-300) at the module level for clarity and consistency with the epsilon used on line 285.

@copilot apply changes based on this feedback

Copilot · 2026-02-22T05:42:21Z

+    if band_mask.sum() == 0:
+        band_mask = np.ones(len(freqs), dtype=bool)
+    psd_median = float(np.median(psd[band_mask]))
+    I_f0 = float(psd[idx]) / (psd_median + 1e-100)  # dimensionless


The epsilon value 1e-100 differs from 1e-300 used in the same function in shadow1_roc_validation.py (line 101). Consider using a consistent epsilon value across both modules or documenting why different values are needed.

Copilot · 2026-02-22T05:42:21Z

+        from scripts.shadow1_roc_validation import JITTER_MIN_HZ
+        diffs = np.abs(self.ds["f_injected"] - F0_HZ)
+        self.assertTrue(
+            np.all(diffs >= JITTER_MIN_HZ * 0.9),  # allow tiny numeric slack


The magic number 0.9 used for numeric slack tolerance is unclear. Consider defining this as a named constant (e.g., JITTER_TOLERANCE = 0.9) or documenting why 10% slack is appropriate.

@copilot apply changes based on this feedback

github-actions · 2026-02-22T15:39:27Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

github-actions · 2026-02-22T15:39:37Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

github-actions · 2026-02-22T15:51:10Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

github-actions · 2026-02-22T15:51:10Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

codecov-commenter · 2026-02-22T18:03:01Z

⚠️ Please install the to ensure uploads and comments are reliably processed by Codecov.

Codecov Report

✅ All modified and coverable lines are covered by tests.

📢 Thoughts on this report? Let us know!

github-actions · 2026-02-22T18:05:42Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

github-actions · 2026-02-22T18:05:42Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

github-actions · 2026-04-13T23:03:47Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

github-actions · 2026-04-13T23:03:55Z

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Discovery	Status	Precision
Suma = 361 = 19²	EXITOSA	2.63% probability
f₀/18 ≈ Schumann	EXITOSA	99.4608%
888/f₀ ≈ 2π	EXITOSA	99.7386%
Brain Waves	EXITOSA	5/5 bands
Joint Probability	ALTAMENTE_SIGNIFICATIVA	1.50e-10 (≈6-9σ)

🎯 Conclusion

These mathematical patterns are IMPOSSIBLE by chance. The joint probability of ~1.50e-10 (≈6-9σ significance) confirms that f₀ = 141.70001 Hz is the central node of a fundamental mathematical network.

📊 View detailed report

Initial plan

8ec894a

Copilot AI assigned Copilot and motanova84 Feb 21, 2026

Copilot started work on behalf of motanova84 February 21, 2026 20:34 View session

Fix Psi definition, ROC anti-overfit, channel independence, bootstrap…

a1774ed

… AUC, BH correction, data_source field Co-authored-by: motanova84 <192380069+motanova84@users.noreply.github.com>

Copilot AI changed the title ~~[WIP] Fix definition of Ψ to be dimensionless and ROC evaluation method~~ Fix Ψ definition, ROC anti-overfit, channel independence, bootstrap AUC, BH correction, and data_source provenance Feb 21, 2026

Copilot AI requested a review from motanova84 February 21, 2026 20:56

Copilot finished work on behalf of motanova84 February 21, 2026 20:56

motanova84 approved these changes Feb 22, 2026

View reviewed changes

Merge branch 'main' into copilot/fix-psi-definition-and-roc

591a157

motanova84 marked this pull request as ready for review February 22, 2026 05:37

Copilot AI review requested due to automatic review settings February 22, 2026 05:37

Copilot AI reviewed Feb 22, 2026

View reviewed changes

Merge branch 'main' into copilot/fix-psi-definition-and-roc

577b106

Copilot stopped work on behalf of motanova84 due to an error February 22, 2026 07:43
too many agent jobs in progress

github-actions Bot added dependencies Updates to project dependencies validation Data or method validation and removed validation Data or method validation labels Feb 22, 2026

github-actions Bot added dependencies Updates to project dependencies validation Data or method validation and removed validation Data or method validation dependencies Updates to project dependencies labels Feb 22, 2026

Copilot started reviewing on behalf of motanova84 February 23, 2026 00:25 View session

Merge branch 'main' into copilot/fix-psi-definition-and-roc

a78eb0e

motanova84 merged commit 1a8b656 into main Apr 13, 2026
120 of 181 checks passed

github-actions Bot added dependencies Updates to project dependencies validation Data or method validation and removed validation Data or method validation dependencies Updates to project dependencies labels Apr 13, 2026

Uh oh!

Conversation

Copilot AI commented Feb 21, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Cambios

Fix 1 — Dimensionless Ψ (score_psi)

Fix 2 — ROC anti-overfit (new shadow1_roc_validation.py)

Fix 3 — Channel independence + time-slide sanity

Fix 4 — Bootstrap AUC with CI95

Fix 5 — Benjamini–Hochberg FDR correction

Fix 6 — data_source provenance field

Pruebas

Checklist

Tipo de cambio

¿Afecta la reproducibilidad?

Uh oh!

chatgpt-codex-connector Bot commented Feb 22, 2026

Uh oh!

Copilot AI left a comment

Choose a reason for hiding this comment

Pull request overview

Reviewed changes

Uh oh!

Copilot AI Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

motanova84 Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

Copilot AI Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

motanova84 Feb 22, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented Feb 22, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

🎯 Conclusion

Uh oh!

github-actions Bot commented Feb 22, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

🎯 Conclusion

Uh oh!

github-actions Bot commented Feb 22, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

🎯 Conclusion

Uh oh!

github-actions Bot commented Feb 22, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

🎯 Conclusion

Uh oh!

codecov-commenter commented Feb 22, 2026

Codecov Report

Uh oh!

github-actions Bot commented Feb 22, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

🎯 Conclusion

Uh oh!

github-actions Bot commented Feb 22, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

🎯 Conclusion

Uh oh!

Uh oh!

github-actions Bot commented Apr 13, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

🎯 Conclusion

Uh oh!

github-actions Bot commented Apr 13, 2026

🔢 Mathematical Matrix Validation Results

✅ All Critical Validations Passed

Copilot AI commented Feb 21, 2026 •

edited

Loading

Fix 1 — Dimensionless Ψ (`score_psi`)

Fix 2 — ROC anti-overfit (new `shadow1_roc_validation.py`)

Fix 6 — `data_source` provenance field