Integration of MEC workflow by andreaspauling · Pull Request #110 · MeteoSwiss/evalml

andreaspauling · 2026-02-12T14:42:42Z

Add the MEC workflow. The new parts are in green in the DAG: snakemake_dag.pdf

For each valid date a MEC case is set up and run. This includes:

creating the directory structure
adding the observations
organizing the model input including past runs depending on the config
rendering the MEC namelist
executing MEC for all dates with complete data for all leadtimes (excludes the first ones of the period)
storing the final feedback file in a separate place.

All MEC cases can be removed once the final feedback file is produced (removal not yet implemented).

Topics already raised by Francesco:
- put folder mec/ in data/mec in order not to mix up init and valid time (MEC is valid time oriented)
- check globbing options in MEC namelist with DWD (not documented, only FCR_TIME is supported afaik, * etc not). The aim is to avoid copying data.

… we want to factor it out of the rule

* Distinguish between primary runs ('candidates') and secondary runs * Docstrings

* Adopt forecast intervals including the end point * Fix parsing * Experiments work * Update config/forecasters.yaml * Align init times to availabiliy of COE * run pre-commit * Change README to COSMO-E availability --------- Co-authored-by: Jonas Bhend <jonasbhend@users.noreply.github.com> Co-authored-by: Jonas Bhend <jonas.bhend@meteoswiss.ch>

* draft changes * rename workspace resources dir * working for config/forecasters.yaml * improve logging * works for interpolators.yaml * re-add get_leadtime function * refactor run directives into script

* add region averages * add regions to config * Add regions to verification module, scripts, and rules * add stratification to forecaster config and fix typo * fix dict indexing * fix append error * read lon/lat from obs dataset * Add inner verification domain * Add missing dependency * add plots by region * Add regions to dashboard * Fix dashboard * Add region name and initializations to plot title (and remove header div) * Add support for multiple regions * Fix legend

…e-to-generate-namelist

…inference output)

dnerini

looking very nice @andreaspauling ! I have added a few initial thoughts form a quick look into your changes, but I plan to have a closer look soon!

workflow/rules/verif_obs.smk

dnerini · 2026-02-12T14:51:02Z

workflow/rules/verif_obs.smk

+# prepare_mec_input: setup run dir, gather observations and model data in the run dir for the actual init time
+rule prepare_mec_input:
+    input:
+        src_dir=OUT_ROOT / "data/runs/{run_id}/{init_time}/grib",


There is no rule giving this an output, so this should to the very list trigger some warnings from snakemake. You could specify it as a parameter instead.

dnerini · 2026-02-12T14:55:28Z

workflow/rules/verif_obs.smk

+        set -euo pipefail
+
+        # Run MEC inside sarus container
+        # Note: pull command currently needed only once to download the container


the pull command could then be factored out into a separate rule that is run only once before launching all the parallel MEC jobs

dnerini · 2026-02-12T14:56:45Z

workflow/Snakefile

 RESULTS_DIR = OUT_ROOT / "results" / EXPERIMENT_NAME

+# prefer one rule because snakemake complains about ambiguous rules (same output)
+ruleorder: prepare_inference_forecaster > prepare_inference_interpolator


need to have a closer look at this, I don0't understand why this problem would appear with your changes

Without it snakemake complains. Thanks for having a closer look at it.

Was this clarified?

workflow/Snakefile

frazane

Added some comments. The most important ones are those we already discussed (avoid copying data and directory structure) but I wanted to make sure we do not forget about them.

frazane · 2026-03-18T16:45:03Z

workflow/scripts/generate_mec_namelist.py

Could we get a "script summary log" at the beginning of the script? See other scripts for examples.

frazane · 2026-03-18T16:46:03Z

workflow/scripts/generate_mec_namelist.py

+    parser.add_argument(
+        "--namelist",
+        type=str,
+        help="Anything useful",


Missing an actual help message.

frazane · 2026-03-18T16:46:17Z

workflow/scripts/generate_mec_namelist.py

+    parser.add_argument(
+        "--template",
+        type=str,
+    )


Missing a help message.

frazane · 2026-03-18T16:46:29Z

workflow/scripts/generate_mec_namelist.py

+if __name__ == "__main__":
+    parser = ArgumentParser()
+
+    parser.add_argument("--steps", type=_parse_steps, default="0/120/6")


Missing a help message.

frazane · 2026-03-18T16:47:00Z

workflow/Snakefile

 RESULTS_DIR = OUT_ROOT / "results" / EXPERIMENT_NAME

+# prefer one rule because snakemake complains about ambiguous rules (same output)
+ruleorder: prepare_inference_forecaster > prepare_inference_interpolator


Was this clarified?

frazane · 2026-03-18T16:48:59Z

workflow/rules/verif_obs.smk

+    return list(range(start, end + 1, step))
+
+
+# TODO: merge with _ref_times from common.smk?


Not merged but perhaps could be moved to common.smk.

frazane · 2026-03-18T16:51:28Z

workflow/rules/verif_obs.smk

+        # concatenate all grib files in src_dir into a single file fc_file
+        echo "grib files processed:"
+        files=( "$src_dir"/20*.grib )
+        if (( ${{#files[@]}} )); then
+            printf '%s\n' "${{files[@]}}"
+            cat "${{files[@]}}" > "$fc_file"
+        else
+            echo "WARNING: no grib files found in $src_dir" >&2
+        fi


Is this really necessary? We are effectively duplicating the entire output data.

frazane · 2026-03-18T16:57:23Z

workflow/rules/verif_obs.smk

Could we move the mec directory from being inside each forecast run directory (output/data/runs/<run-id>/<init-time>/mec) to output/data/mec/<valid-time> so as to not mix up inittime-based directory structure and validtime-based directory structure? This is also the same approach adopted by osm in the operational archive.

dnerini and others added 30 commits October 7, 2025 14:01

Initial draft (pseudo code)

c1375ab

add namelist as resource

9f608f2

add verif_obs.smk to Snakefile

e82bd94

Add rules for observation data and namelist generation (using fake data)

c3ab651

add newline to namelist template

7512d96

somewhat working version of run_mec (with fake data)

13301a5

correct typo and add optional script for generating namelist, in case…

e722e5f

… we want to factor it out of the rule

fix: add localrule to inference_interpolator rule (#57)

3d9e3c1

Fix for interpolator rule

918913f

Consolidate multi packages into unique src/ dir (#58)

179eb4d

Update configs (#63)

e791a30

Adopt 'steps' instead of 'lead_time' (#62)

d197712

Update example config for experiment with interpolators (#70)

9568987

Distinguish between primary runs ('candidates') and secondary runs (#64)

128eb91

* Distinguish between primary runs ('candidates') and secondary runs * Docstrings

Mrb 550 inconcsistent forecast initializations in evalml (#72)

e028f59

Update vega-lite spec (#69)

5406777

Decouple inference preparation and execution (#68)

8d01490

* draft changes * rename workspace resources dir * working for config/forecasters.yaml * improve logging * works for interpolators.yaml * re-add get_leadtime function * refactor run directives into script

input data and namelist for MEC

04c4cf1

Merge remote-tracking branch 'origin/main' into MRB-534-Implement-rul…

b1959dc

…e-to-generate-namelist

Cleanup

23c9599

Refactor MEC namelist generation

804455a

setup MEC case

f793d85

add use of local MEC executable and cleaning

3839476

Support of mec in a sarus container

5b58b7a

target final feedback files

292878d

Fix linting

09f06da

Ensure newline at the end of MEC namelist

6776572

model data preparation for MEC

5d381f1

Andreas Pauling added 3 commits February 2, 2026 15:35

Merge branch 'main' into MRB-534-Implement-rule-to-generate-namelist

acce2f7

fix init_times_for_mec and add touch output/input (MEC waits for all …

f9a5889

…inference output)

Refactoring, bugfixes

99753e5

andreaspauling requested review from cosunae, dnerini and frazane February 12, 2026 14:42

Merge branch 'main' into MRB-534-Implement-rule-to-generate-namelist

5fa2a34

dnerini reviewed Feb 12, 2026

View reviewed changes

Andreas Pauling added 3 commits February 24, 2026 08:24

Formatting requirements

4d7191b

fix rule dependencies and feedback file naming

87a4d07

same feedback file naming as NWP

8e87ea2

frazane requested changes Mar 18, 2026

View reviewed changes

		return list(range(start, end + 1, step))


		# TODO: merge with _ref_times from common.smk?

Conversation

andreaspauling commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

dnerini left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

frazane left a comment

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

5 participants

andreaspauling commented Feb 12, 2026 •

edited

Loading