Implement the Stop-Gradient DPF by DanWaxman · Pull Request #202 · state-space-models/cuthbert

DanWaxman · 2026-02-20T16:01:48Z

Currently, the PF defined in the SMC classes is not differentiable, in the sense that estimators of the score (i.e., grad(marginal log likelihood)) will be biased. In fact, automatic differentiation at all is broken now on CPU, but bias persists even after fixing that. This is unfortunate, because it limits what one can do for system identification -- for example, SGMCMC -- which rely on these score estimates being unbiased.

There are several ways to ameliorate this (c.f. the PyDPF paper), but one of the simplest is the stop-gradient trick, which recovers some classical gradient estimators by a simple + term - term trick.

This PR implements the stop-gradient version of the DPF, and compared the resulting MLL and score estimates of a bootstrap PF to the analytically-known ones in a linear-Gaussian SSM. These show the expected results, i.e., that both the PF as-is and DPF have good MLL estimates, but the PF is highly biased:

This bias persists under many Monte Carlo draws, even at the true parameters:

The implementation adds minimal overhead (which is also in line with the paper):

PF (standard): 15.80 ± 1.80 ms (median ± std over 1000 evals)
PF (diff-resampling): 15.97 ± 1.14 ms (median ± std over 1000 evals)

See the discussion in #201 for design discussion.

n.b., currently there is a notebook illustrating the results in docs/examples. This should probably move or otherwise be deleted, but seemed useful for review/discussion purposes.

DanWaxman · 2026-02-20T16:04:12Z

This will currently fail the noop tests in tests/cuthbert/smc/test_particle_filters.py, because the DPF algorithm will normalize the weights. This seems okay to me? I.e., maybe the noop test should be checking up to a constant factor in the weights or something.

AdrienCorenflos · 2026-02-26T12:47:05Z

#207 is merged @DanWaxman so you can proceed with implementing this as a decorator I guess!

AdrienCorenflos self-assigned this Feb 22, 2026

AdrienCorenflos mentioned this pull request Feb 23, 2026

Refactor resampling functions #207

Merged

DanWaxman closed this Feb 26, 2026

DanWaxman force-pushed the dw-make-pf-diff branch from 2bbe624 to 7b1a120 Compare February 26, 2026 15:21

DanWaxman mentioned this pull request Feb 26, 2026

Implement the Stop-Gradient Differentiable Particle Filter #209

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement the Stop-Gradient DPF#202

Implement the Stop-Gradient DPF#202
DanWaxman wants to merge 0 commit intostate-space-models:mainfrom
DanWaxman:dw-make-pf-diff

DanWaxman commented Feb 20, 2026

Uh oh!

DanWaxman commented Feb 20, 2026

Uh oh!

AdrienCorenflos commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants

Conversation

DanWaxman commented Feb 20, 2026

Uh oh!

DanWaxman commented Feb 20, 2026

Uh oh!

AdrienCorenflos commented Feb 26, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

2 participants