release: To Prod by suisuss · Pull Request #1581 · KeeperHub/keeperhub

suisuss · 2026-06-18T00:52:10Z

No description provided.

The Runs tab now shows a 'v{n}' chip on each run, resolved server-side from the run's executed_workflow_hash to its workflow_history version (timestamp- aware so a reverted content hash maps to the version in effect at run time). Clicking the chip opens the History tab and highlights that version (via ?version=N, which History already rings/expands/scrolls to).

feat(runs): show the workflow version each run executed, linkable to History

keeperhub_workflow_errors_by_workflow was an all-time cumulative count filtered only on status='error'. At prod scale that query matched every error execution across all orgs and joined them before narrowing to managed orgs, tripping the 8s metrics statement_timeout (Postgres 57014) on most scrapes. The catch returned [], so reset() emptied the gauge and it flapped (present ~9% of scrapes), which (a) starved the managed-client alert of data and (b) combined with the alert's offset-delta guard to read the full cumulative count as a 1h delta -> false-positive pages. Change the gauge to a rolling-1h count: add completed_at >= now() - interval '1 hour' to the query, backed by a new partial index idx_workflow_executions_error_completed_at (status='error') so the lookup is an index range scan that finishes in ms and never times out. Metric name and labels are unchanged; only the value semantics move from all-time cumulative to last-hour count, which matches the alert's rolling-60-minute definition and lets the alert read the gauge directly (the companion infra change drops the offset/guard math). Migration is @requires-db-prep: an operator applies the index CONCURRENTLY out-of-band; the in-file CREATE INDEX IF NOT EXISTS is a no-op on prod and a fast build on dev/PR DBs. Refs TECH-48

The per-workflow managed-client error gauge was never added to METRICS_REFERENCE.md when it was introduced. Document it with the windowed (last-1h, not cumulative) semantics and its role in the Sky/Ajna managed-client alerts. Refs TECH-48

Comment cited a non-existent idx_workflow_executions_status_completed_at; the migration creates idx_workflow_executions_error_completed_at. Align the comment with the actual index name (PR #272 review, item 4). Refs TECH-48

…tric fix(metrics): window per-workflow error gauge to last 1h

joelorzet and others added 4 commits June 17, 2026 13:24

Merge pull request #1579 from KeeperHub/feat/execution-version-link

a4607eb

feat(runs): show the workflow version each run executed, linkable to History

suisuss temporarily deployed to staging June 18, 2026 00:52 — with GitHub Actions Inactive

chong-techops and others added 2 commits June 18, 2026 11:38

docs(metrics): fix index name in query rationale comment

c7eda3b

Comment cited a non-existent idx_workflow_executions_status_completed_at; the migration creates idx_workflow_executions_error_completed_at. Align the comment with the actual index name (PR #272 review, item 4). Refs TECH-48

Merge pull request #1580 from KeeperHub/fix/tech-48-windowed-error-me…

f06b368

…tric fix(metrics): window per-workflow error gauge to last 1h

suisuss temporarily deployed to staging June 18, 2026 02:25 — with GitHub Actions Inactive

suisuss added db-prepped-prod Operator applied lock-free DDL to prod DB; safe to merge metrics-db-reviewed Reviewer sign-off: metrics aggregate queries optimised + tables indexed (KEEP-680) labels Jun 18, 2026

suisuss temporarily deployed to staging June 18, 2026 02:28 — with GitHub Actions Inactive

suisuss temporarily deployed to staging June 18, 2026 02:39 — with GitHub Actions Inactive

suisuss merged commit 49ce37f into prod Jun 18, 2026
58 of 60 checks passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

release: To Prod#1581

release: To Prod#1581
suisuss merged 6 commits into
prodfrom
staging

suisuss commented Jun 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

suisuss commented Jun 18, 2026

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants