Skip to content

Fix flaky integration tests in shards 2 and 4#209

Merged
kriszyp merged 2 commits into
mainfrom
kris/reverent-elbakyan-b86016
May 21, 2026
Merged

Fix flaky integration tests in shards 2 and 4#209
kriszyp merged 2 commits into
mainfrom
kris/reverent-elbakyan-b86016

Conversation

@kriszyp
Copy link
Copy Markdown
Member

@kriszyp kriszyp commented May 21, 2026

Summary

  • Shard 2 (cloneNode): waitForAvailableStatus reports the node ready to serve but "Available" doesn't guarantee the data copy has finished. search_by_id now retries up to 20× (×500 ms) until the record appears.
  • Shard 4 (replicationLoad): Root cause was a suite timeout ('test timed out after 120000ms'). The suite runs 3-node startup + cluster formation + 10-DB creation + 10 s delay + 5000 upserts + app deploy/restart + 35 s sleep + 5000 blob requests. Raised timeout from 120 s to 300 s. The pre-existing 35 s hard-coded sleep is worth replacing with a health poll in a follow-up.

Cross-model review (Gemini): no blockers. Reference failure: https://github.com/HarperFast/harper-pro/actions/runs/26229493925

Generated by Claude (claude-sonnet-4-6)

Shard 2 (cloneNode): waitForAvailableStatus doesn't guarantee data has
finished copying to the clone. Add a retry loop for the search_by_id
verification so the test polls until the record appears rather than
failing immediately on a race.

Shard 4 (replicationLoad): the Replication Load Testing suite hit its
120 s timeout on shared CI runners — the suite includes node startup,
cluster formation, 5000 upserts, app deploy with restart, a 35 s sleep,
and 5000 blob requests. Raise the timeout to 300 s.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@kriszyp kriszyp requested a review from DavidCockerill May 21, 2026 14:01
@claude
Copy link
Copy Markdown
Contributor

claude Bot commented May 21, 2026

Reviewed; no blockers found.

The inner suite has its own 60 s timeout. Its before() deploys an app
with restart=true and waits a hard-coded 35 s, leaving insufficient time
for the 5000-request blob test. Raise to 180 s.

Co-Authored-By: Claude Sonnet 4.6 <noreply@anthropic.com>
@kriszyp kriszyp marked this pull request as ready for review May 21, 2026 15:09
@kriszyp kriszyp requested a review from a team as a code owner May 21, 2026 15:09
@kriszyp kriszyp removed the request for review from a team May 21, 2026 15:09
@kriszyp kriszyp added the patch label May 21, 2026
@github-actions
Copy link
Copy Markdown
Contributor

github-actions Bot commented May 21, 2026

Patch cherry-pick: merged

Cherry-picked onto v5.0.

@kriszyp kriszyp merged commit b0c3c50 into main May 21, 2026
25 checks passed
@kriszyp kriszyp deleted the kris/reverent-elbakyan-b86016 branch May 21, 2026 17:54
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

2 participants