Add experiment signals to fleet remote config by khewonc · Pull Request #2872 · DataDog/datadog-operator

khewonc · 2026-04-07T13:31:03Z

What does this PR do?

Adds experiment signals: start, stop, and promote. This depends on #2838. The latest commit contains the relevant changes

Motivation

https://datadoghq.atlassian.net/browse/CONTP-1424
https://datadoghq.atlassian.net/browse/CONTP-1425
https://datadoghq.atlassian.net/browse/CONTP-1426

Additional Notes

Merge after #2838

Minimum Agent Versions

Are there minimum versions of the Datadog Agent and/or Cluster Agent required?

Agent: vX.Y.Z
Cluster Agent: vX.Y.Z

Describe your test plan

TBD

Checklist

PR has at least one valid label: bug, enhancement, refactoring, documentation, tooling, and/or dependencies
PR has a milestone or the qa/skip-qa label
All commits are signed (see: signing commits)

chatgpt-codex-connector

💡 Codex Review

Here are some automated review suggestions for this pull request.

Reviewed commit: 2e153f00f4

ℹ️ About Codex in GitHub

Codex has been enabled to automatically review pull requests in this repo. Reviews are triggered when you

Open a pull request for review
Mark a draft as ready
Comment "@codex review".

If Codex has suggestions, it will comment; otherwise it will react with 👍.

When you sign up for Codex through ChatGPT, Codex can also answer questions or update the PR, like "@codex address that feedback".

codecov-commenter · 2026-04-07T13:39:47Z

Codecov Report

❌ Patch coverage is 72.84595% with 104 lines in your changes missing coverage. Please review.
✅ Project coverage is 40.54%. Comparing base (a646370) to head (a816a2a).
⚠️ Report is 1 commits behind head on main.

Files with missing lines	Patch %	Lines
pkg/fleet/daemon.go	69.13%	58 Missing and 17 partials ⚠️
pkg/fleet/experiment.go	83.67%	7 Missing and 1 partial ⚠️
internal/controller/datadogagent/experiment.go	82.50%	4 Missing and 3 partials ⚠️
internal/controller/datadogagent/revision.go	79.41%	4 Missing and 3 partials ⚠️
pkg/remoteconfig/updater.go	0.00%	4 Missing ⚠️
cmd/main.go	0.00%	3 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main    #2872      +/-   ##
==========================================
+ Coverage   40.03%   40.54%   +0.51%     
==========================================
  Files         319      321       +2     
  Lines       28066    28514     +448     
==========================================
+ Hits        11235    11560     +325     
- Misses      16008    16107      +99     
- Partials      823      847      +24

Flag	Coverage Δ
unittests	`40.54% <72.84%> (+0.51%)`	⬆️

Flags with carried forward coverage won't be shown. Click here to find out more.

Files with missing lines	Coverage Δ
pkg/fleet/remote_config.go	`100.00% <100.00%> (ø)`
cmd/main.go	`6.66% <0.00%> (ø)`
pkg/remoteconfig/updater.go	`0.00% <0.00%> (ø)`
internal/controller/datadogagent/experiment.go	`85.15% <82.50%> (+0.37%)`	⬆️
internal/controller/datadogagent/revision.go	`78.12% <79.41%> (-3.39%)`	⬇️
pkg/fleet/experiment.go	`83.67% <83.67%> (ø)`
pkg/fleet/daemon.go	`65.45% <69.13%> (+65.45%)`	⬆️

... and 4 files with indirect coverage changes

Continue to review full report in Codecov by Sentry.

Legend - Click here to learn more
Δ = absolute <relative> (impact), ø = not affected, ? = missing data
Powered by Codecov. Last update a646370...a816a2a. Read the comment docs.

🚀 New features to boost your workflow:

❄️ Test Analytics: Detect flaky tests, report on failures, and find test suite problems.

* Add cluster_name as tag * Add updater_type

datadog-prod-us1-6 · 2026-04-15T17:26:14Z

✨ Fix all issues with BitsAI

🛑 Gate Violations

🎯 1 Code Coverage issue detected

A Patch coverage percentage gate may be blocking this PR.

• Patch coverage: 71.83% (threshold: 80.00%)

ℹ️ Info

🎯 Code Coverage (details)
• Patch Coverage: 71.83%
• Overall Coverage: 40.61% (+0.50%)

_{This comment will be updated automatically if new data arrives.

🔗 Commit SHA: a816a2a | Docs | Datadog PR Page | Was this helpful? React with 👍/👎 or give us feedback!}

coignetp · 2026-04-16T07:56:18Z

+
+	// Apply the spec patch.
+	if err := retryWithBackoff(ctx, func() error {
+		return d.client.Patch(ctx, dda, client.RawPatch(types.MergePatchType, op.Config))


❓ question: ‏Is this a blocking call? Else, any UPDATER_TASK instruction will block any further UPDATER_TASK until the first one is done.
This means we can't stop a change from the backend

This is indeed a blocking call. Both spec patch and status update are needed to start the experiment before stoping it though. If we patch the spec, then cancel the start signal to stop, stopExperiment would see there's no active experiment (since the status wasn't updated yet) and fail. The patched spec would still be applied, but with no experiment to track it. If we cancel the start signal before patching, then there wouldn't be an experiment to stop either

coignetp · 2026-04-16T07:59:54Z

 				continue
 			}

 			seen[req.ID] = struct{}{}


❓ question: ‏Should it be before the h(req)?

I'm not entirely certain on this, but I think either would be fine

khewonc added this to the v1.26.0 milestone Apr 7, 2026

khewonc added the enhancement New feature or request label Apr 7, 2026

khewonc requested a review from a team April 7, 2026 13:31

khewonc requested review from a team as code owners April 7, 2026 13:31

github-actions bot added team/container-platform team/container-autoscaling team/fleet labels Apr 7, 2026

chatgpt-codex-connector bot reviewed Apr 7, 2026

View reviewed changes

Comment thread internal/controller/datadogagent/experiment.go

Comment thread internal/controller/datadogagent/controller_reconcile_v2.go

initial experiment signals

1542192

khewonc force-pushed the khewonc/experiment-signals branch from 2e153f0 to e686774 Compare April 7, 2026 19:36

github-actions bot removed the team/container-autoscaling label Apr 7, 2026

Review suggestions

d0ff10a

khewonc force-pushed the khewonc/experiment-signals branch from e686774 to d0ff10a Compare April 7, 2026 20:03

coignetp and others added 12 commits April 8, 2026 11:14

[FA] Add cluster_name as tag (#2878)

120e0c5

* Add cluster_name as tag * Add updater_type

add test logs

70edfda

more logs

e0d5b37

use installer config

9162303

Logging + race

6a42afc

More edge cases

b6d8b7f

Add package state pop

bc752d3

Fix lint

1d84dc2

package state

acb73f4

Check expected state

c7fffcd

no nsn for stop/promote

475d416

Tweak logging

8397e6c

github-actions bot added the team/container-autoscaling label Apr 15, 2026

Merge branch 'main' into khewonc/experiment-signals

eb7b351

coignetp reviewed Apr 16, 2026

View reviewed changes

Review suggestions + 2x promote fix

a816a2a

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add experiment signals to fleet remote config#2872

Add experiment signals to fleet remote config#2872
khewonc wants to merge 16 commits intomainfrom
khewonc/experiment-signals

khewonc commented Apr 7, 2026

Uh oh!

chatgpt-codex-connector bot left a comment

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Apr 7, 2026 •

edited

Loading

Uh oh!

datadog-prod-us1-6 bot commented Apr 15, 2026 •

edited by datadog-prod-us1-3 bot

Loading

🎯 1 Code Coverage issue detected

Uh oh!

coignetp Apr 16, 2026

Uh oh!

khewonc Apr 16, 2026

Uh oh!

Uh oh!

Uh oh!

coignetp Apr 16, 2026

Uh oh!

khewonc Apr 16, 2026

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Conversation

khewonc commented Apr 7, 2026

What does this PR do?

Motivation

Additional Notes

Minimum Agent Versions

Describe your test plan

Checklist

Uh oh!

chatgpt-codex-connector bot left a comment

Choose a reason for hiding this comment

💡 Codex Review

Uh oh!

Uh oh!

Uh oh!

codecov-commenter commented Apr 7, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Codecov Report

Uh oh!

datadog-prod-us1-6 bot commented Apr 15, 2026 • edited by datadog-prod-us1-3 bot Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

🛑 Gate Violations

🎯 1 Code Coverage issue detected

ℹ️ Info

Uh oh!

coignetp Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

khewonc Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

coignetp Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

khewonc Apr 16, 2026

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

codecov-commenter commented Apr 7, 2026 •

edited

Loading

datadog-prod-us1-6 bot commented Apr 15, 2026 •

edited by datadog-prod-us1-3 bot

Loading