Multi observation type inversion by cpjordan · Pull Request #429 · thetisproject/thetis

cpjordan · 2025-11-10T09:55:18Z

Currently, the inversion tools are set up to deal with either a scalar or vector quantity (typically elevation or velocity) but we might want to optimise for both simultaneously.

Add a wrapper class which contains multiple StationObservationManagers
Add some more stations for the headland inversion example

For real cases this is a useful way of preventing control values heading to the extremes where one type of data is sparse. For example, near coastlines we might have some ADCP data, but in the open sea we're more likely to have elevation gauges. I have found that friction values can become exceedingly large in deeper areas which is reflected in gauge harmonic amplitudes.

TODO:

Fix a few small bugs I've noticed:
hanging in parallel again (fix no longer works with multiple control stations)
how the final control values are accessed for IPS/regions, exported fields are right but printed values are not
Some additional new things that will be needed:
weighting between observation types (or at least warnings!)
update the plotting
update the channel and tsunami examples

- Details are in the README - inversion of Manning field using different field representations: - Constant - Region-based - Independent Points Scheme - Free specification at all points in the domain (Hessian regularised)

- Also fix hanging in parallel when using weighting of stations in inversion examples

- Update examples accordingly

- The cost function scaling didn't make sense to be passed here and I think was possibly applied 2x so took it out here

- self.control_coeff_list does not get updated from initial values and is just for exporting purposes

- using weighting by station is essentially using NMSE instead of MSE, therefore you shouldn't weight elevation or velocity differently by default - warn if the user does - NMSE includes noise so you might have a smooth observed elevation signal and noisy velocity, so we do want to allow the user to weight it if they want to

- J_elev is ~ 1e-6 * J_vel, would need a better example for both to play a role

- should really add functionality to monitor more than just the observation parameter during optimisation

cpjordan · 2025-12-15T11:13:56Z

The only issue with the current code is that for real-world applications, we might want to make some datum corrections (as per this publication (Section 3.2) and in this code) which means we need to assess the cost function at the end of the forward run rather than at every timestep - I've got a branch for this as it's an easy fix but I would do this as a follow-up PR.

stephankramer

Just some comments/queries. Happy for this to go in as is as well

stephankramer · 2026-01-28T10:45:32Z

                    (f"[{fd.COMM_WORLD.rank}] ERROR: Check for NaNs. Found non-finite variances of "
                     f"observation data: {var.dat.data[:]}")
-            self.sta_manager.station_weight_0d.interpolate(1 / var)
+                # in parallel some mesh partitions will have no stations so we need a conditional to avoid div by 0


Why would that be a problem? Presumably local partitions with no stations, this expression isn't evaluated at all? The var is a variance per station, on a VOM, I think is that right? So in what location/node would we have var==0 where that conditional would apply?

stephankramer · 2026-01-28T11:08:19Z

+            J = som.eval_cost_function(t)  # individual manager will assert initialized
+            components[name] = float(J)
+            J_total += J
+        return J_total, components


Why is this returning a components dictionary? The reason I ask is that the only place it seems to expect a second output from eval_cost_function() is in a place where it explicitly checks for isinstance(sta_manager, MultiStationObservationManager) to deal with that - but then it actually throws away the extra components output.

Generally speaking having (many) of these

if isinstance(obj, ClassVersionA): # call obj.method is some way else: # call obj.method in slightly different way because the API is different

are a bit of a code smell. If you want to treat both StationObservationManager and MultiStationObservationManager as the same kind of thing in part of your code, it's better to think of a well defined common API, maybe using an abstract StationObservationManagerBase that both inherit from. It's fine to then extend beyond that common API, e.g. you could have a different named evaluation method, that does return a component dictionary - but the methods in the common API should have the same (required) input arguments and the same outputs.

cpjordan added 15 commits November 7, 2025 16:56

Add headland inversion example

883e970

- Details are in the README - inversion of Manning field using different field representations: - Constant - Region-based - Independent Points Scheme - Free specification at all points in the domain (Hessian regularised)

lint fix

0eadd9f

VTKFile path

a7196d3

Add DiagnosticCallback for inversion cost function

e74596f

- Also fix hanging in parallel when using weighting of stations in inversion examples

Provide uv as option for calibration in inversion tools

c0ac8cf

- Update examples accordingly

Remove unused import

5eedb29

Last fixes

11f83a3

Add multi-observation inversion

17a5fe2

Update README image

ae78445

Add new stations for plotting

44c0a02

Fix other examples

bf2fac6

- The cost function scaling didn't make sense to be passed here and I think was possibly applied 2x so took it out here

Return the right values!!!!!

d424bf4

- self.control_coeff_list does not get updated from initial values and is just for exporting purposes

Default to consistent scaling

cfb8a1a

- J_elev is ~ 1e-6 * J_vel, would need a better example for both to play a role

Update plotting

11cd365

- should really add functionality to monitor more than just the observation parameter during optimisation

cpjordan marked this pull request as ready for review November 10, 2025 17:05

cpjordan mentioned this pull request Nov 11, 2025

Inverse modelling (adjoint) improvements #413

Open

3 tasks

cpjordan and others added 3 commits December 4, 2025 11:24

Merge branch 'main' into multi-observation-type-inversion

ade2a8f

forward collect_time_series

917271a

Merge branch 'main' into multi-observation-type-inversion

08a1d44

cpjordan added 2 commits January 7, 2026 10:11

Merge branch 'main' into multi-observation-type-inversion

8ad856a

Merge branch 'main' into multi-observation-type-inversion

57803f8

stephankramer approved these changes Jan 28, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Multi observation type inversion#429

Multi observation type inversion#429
cpjordan wants to merge 20 commits intomainfrom
multi-observation-type-inversion

cpjordan commented Nov 10, 2025 •

edited

Loading

Uh oh!

cpjordan commented Dec 15, 2025

Uh oh!

stephankramer left a comment

Uh oh!

stephankramer Jan 28, 2026

Uh oh!

stephankramer Jan 28, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

cpjordan commented Nov 10, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

cpjordan commented Dec 15, 2025

Uh oh!

stephankramer left a comment

Choose a reason for hiding this comment

Uh oh!

stephankramer Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

stephankramer Jan 28, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

cpjordan commented Nov 10, 2025 •

edited

Loading