Skip to content

PS-11179: Add ARM health probe + durability init scripts to ps3-k8s#101

Merged
nogueiraanderson merged 1 commit into
mainfrom
PS-11179-ps3k8s-arm-health-probe
Jun 5, 2026
Merged

PS-11179: Add ARM health probe + durability init scripts to ps3-k8s#101
nogueiraanderson merged 1 commit into
mainfrom
PS-11179-ps3k8s-arm-health-probe

Conversation

@nogueiraanderson
Copy link
Copy Markdown
Collaborator

Bug

  • The in-cluster jenkins-ps3-k8s controller's init ConfigMap held only a neutered ec2FleetCloud.groovy; the hetznerArmHealth.groovy probe and durability.groovy from the EC2 ps3 set were never ported. The HETZNER_ARM64_HEALTHY probe never ran (gen=null, flag frozen ~4.8 days), so resolveArmWorker's arm64 Hetzner-to-Graviton fallback was inert on ps3.

Fix

  • Add both scripts to the jenkins-ps3-k8s-clouds-init ConfigMap; JCasC cannot express a PeriodicWork or setDurabilityHint(), so they stay as init.groovy.d keys.
  • init.groovy.d is boot-only and this ConfigMap is not sidecar-watched, so it activates on the next controller restart, not via hot-reload; both were restored live via Script Console so the probe is active now.

Tickets

  • PS-11179: ARM Graviton fallback (probe)
  • PS-11173: ps3 init.groovy.d parity / durability

- Port the two imperative init.groovy.d scripts the in-cluster ps3
  controller was missing into the jenkins-ps3-k8s-clouds-init ConfigMap:
  hetznerArmHealth.groovy (publishes HETZNER_ARM64_HEALTHY that drives
  resolveArmWorker's arm64 Hetzner-to-Graviton fallback) and
  durability.groovy (anchors MAX_SURVIVABILITY at boot).
- JCasC cannot express a PeriodicWork or setDurabilityHint(), so these
  stay as init.groovy.d keys beside the now-neutered ec2FleetCloud.groovy.
- Without the probe its gen stayed null and the flag was frozen ~4.8d,
  so the arm64 fallback never engaged on ps3.
@nogueiraanderson nogueiraanderson merged commit cb02a43 into main Jun 5, 2026
4 checks passed
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant