fix crash when debugging by weiyuan-jiang · Pull Request #173 · GEOS-ESM/GEOSldas_GridComp

weiyuan-jiang · 2026-05-10T01:40:48Z

This PR fixes the run in debug mode. It is 0-diff. It it goes with this PR in NCEP_Shared, GEOSldas can run on debug mode with land assimilation

Related PRs:
GEOS-ESM/NCEP_Shared#37

In debug mode, Intel Fortran raises error 65 (floating invalid) when comparing NaN values (e.g. NaN >= 0.). Input scaling parameter files use NaN as a no-data indicator, which causes comparisons like sclprm_std_obs >= 0. to raise FP exceptions before returning false. Fix: Add NaN guards using the standard Fortran idiom (val == val is .false. for NaN) before the existing range checks at all 3 guard sites in clsm_ensupd_read_obs.F90 (~line 9755, ~9989, ~10384).

When N_files=1, lon_min_vec(2) and lon_max_vec(2) remain initialized to MAPL_UNDEF (1.0e15). Whole-array operations like: start_ind = (lon_min_vec - CMG_ll_lon) / CMG_dlon produce ~2e16 which overflows a 32-bit integer, causing forrtl: error (65): floating invalid in debug mode. Fix: Restrict array operations to (1:N_files) elements only: start_ind(1:N_files) = (lon_min_vec(1:N_files) - CMG_ll_lon)/CMG_dlon last_ind(1:N_files) = (lon_max_vec(1:N_files) - CMG_ll_lon)/CMG_dlon N_lon_vec(1:N_files) = last_ind(1:N_files) - start_ind(1:N_files) + 1

gmao-rreichle

@weiyuan-jiang, many thanks for the PR. I have a couple of comments/questions below.

gmao-rreichle · 2026-05-11T14:40:16Z

+       ! when in debug mode, nint(VEGCLS) with 1.0e15 may crash
+       allocate(tmpR(N_catl))
+       tmpR = VEGCLS(:)
+       where(tmpR > 1.0e10) tmpR = nodata_generic 
+       mwRTM_param(:)%vegcls    = nint(tmpR(:))
+       tmpR = SOILCLS(:)
+       where(tmpR > 1.0e10) tmpR = nodata_generic
+       mwRTM_param(:)%soilcls   = nint(tmpR(:))
+


@weiyuan-jiang : I'm not sure if we need this, and if we do, I'm not sure it's the best fix. First, I'm pretty confident that vegcls and soilcls here should never be no-data. So a better way of addressing this might be to check for (native) no-data-values first and stop if any are encountered.
If I'm wrong and no-data-values for the "integer" fields could happen, we might need to figure out a more robust solution. The current solution depends on "nodata_generic" being -9999. I was hoping that at some point in the future we can use MAPL_UNDEF instead of LDAS having its own nodata-value. But if nodata_generic turns into 1e15, the currently implemented solution won't work, I think.

When I debugged, I printed the values and I did see 1.0e15. When it is converted to int, it becomes the most negative integer. Here our nondata_generic is -9999.0

gmao-rreichle · 2026-05-11T14:41:06Z

+          if ( sclprm_mean_obs(ind)==sclprm_mean_obs(ind) .and.          &
+               sclprm_mean_mod(ind)==sclprm_mean_mod(ind) .and.          &
+               sclprm_std_obs(ind) ==sclprm_std_obs(ind)  .and.          &
+               sclprm_std_mod(ind) ==sclprm_std_mod(ind)  .and.          &


Why do we need these four lines, which should always be true? Not sure where they're coming from and what they might be meant to do.
Or do they evaluate to "false" if a value is NaN? If this is the intent, we should at least add a comment to clarify. Although I don't think the reader for the "sclprm_*" values would produce NaNs.
Alternatively, perhaps check more explicitly for nodata values?
I guess all of this depends on the nodata-value here being -9999. If the nodata-value is 1e15, it wouldn't be identified by any of the if-conditions.

I think that comparison is for Nan. If Nan == Nan is always .false. . In debugging mode, comparing Nan may lead to crash

gmao-rreichle · 2026-05-11T14:41:38Z

+          if ( sclprm_mean_obs(j_ind, i_ind)==sclprm_mean_obs(j_ind, i_ind) .and.  &
+               sclprm_mean_mod(j_ind, i_ind)==sclprm_mean_mod(j_ind, i_ind) .and.  &
+               sclprm_std_obs(j_ind, i_ind) ==sclprm_std_obs(j_ind, i_ind)  .and.  &
+               sclprm_std_mod(j_ind, i_ind) ==sclprm_std_mod(j_ind, i_ind)  .and.  &


same as above

gmao-rreichle · 2026-05-11T14:44:01Z

+          if ( sclprm_mean_obs(ind)==sclprm_mean_obs(ind) .and.  &
+               sclprm_mean_mod(ind)==sclprm_mean_mod(ind) .and.  &
+               sclprm_std_obs( ind)==sclprm_std_obs( ind) .and.  &
+               sclprm_std_mod( ind)==sclprm_std_mod( ind) .and.  &


same as above

github-actions · 2026-05-11T14:49:43Z

This PR is being prevented from merging because you have added one of our blocking labels: Contingent - DNA, Needs Lead Approval, Contingent -- Do Not Approve. You'll need to remove it before this PR can be merged.

weiyuan-jiang added 3 commits May 8, 2026 16:44

work around compiler bug

554f786

weiyuan-jiang requested a review from a team as a code owner May 10, 2026 01:40

weiyuan-jiang added the 0-diff label May 10, 2026

change log

f97633d

weiyuan-jiang changed the title ~~Feature/wjiang/fix debug~~ fix crash when debugging May 11, 2026

gmao-rreichle reviewed May 11, 2026

View reviewed changes

gmao-rreichle added bug fix Contingent -- Do Not Approve labels May 11, 2026

gmao-rreichle marked this pull request as draft May 11, 2026 14:49

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

fix crash when debugging#173

fix crash when debugging#173
weiyuan-jiang wants to merge 4 commits into
developfrom
feature/wjiang/fix_debug

weiyuan-jiang commented May 10, 2026 •

edited by gmao-rreichle

Loading

Uh oh!

gmao-rreichle left a comment

Uh oh!

gmao-rreichle May 11, 2026

Uh oh!

weiyuan-jiang May 11, 2026

Uh oh!

gmao-rreichle May 11, 2026

Uh oh!

weiyuan-jiang May 11, 2026 •

edited

Loading

Uh oh!

gmao-rreichle May 11, 2026

Uh oh!

gmao-rreichle May 11, 2026

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

weiyuan-jiang commented May 10, 2026 • edited by gmao-rreichle Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

gmao-rreichle left a comment

Choose a reason for hiding this comment

Uh oh!

gmao-rreichle May 11, 2026

Choose a reason for hiding this comment

Uh oh!

weiyuan-jiang May 11, 2026

Choose a reason for hiding this comment

Uh oh!

gmao-rreichle May 11, 2026

Choose a reason for hiding this comment

Uh oh!

weiyuan-jiang May 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Choose a reason for hiding this comment

Uh oh!

gmao-rreichle May 11, 2026

Choose a reason for hiding this comment

Uh oh!

gmao-rreichle May 11, 2026

Choose a reason for hiding this comment

Uh oh!

github-actions Bot commented May 11, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

weiyuan-jiang commented May 10, 2026 •

edited by gmao-rreichle

Loading

weiyuan-jiang May 11, 2026 •

edited

Loading