Skip to content

Add ECK cert mismatch causing stack connection failure#9342

Merged
pebrc merged 6 commits intomainfrom
kunisen-docpr-stl-1776
Apr 13, 2026
Merged

Add ECK cert mismatch causing stack connection failure#9342
pebrc merged 6 commits intomainfrom
kunisen-docpr-stl-1776

Conversation

@kunisen
Copy link
Copy Markdown
Contributor

@kunisen kunisen commented Apr 10, 2026

Description

During troubleshooting, we (support) found that,

  • During or after the upgrade of the Elastic Cloud on Kubernetes (ECK) operator to 3.3.x, HTTP or transport certificate issues can arise due to mismatched Authority Key Identifier (AKI) and Subject Key Identifier (SKI) values. This results in SSL handshake failures and prevents ES nodes from joining the cluster or Kibana/Fleet/other HTTP clients from connecting to it.
  • KB article: https://support.elastic.co/knowledge/5198af8e (internal view: https://support.elastic.dev/knowledge/view/5198af8e)

We would like to raise a doc PR to record this as ECK known issue for version 3.3.0-3.3.2.

The fix is tentatively scheduled for release in ECK 3.4.0 per #9197.

View / preview

After merge:
https://www.elastic.co/docs/release-notes/cloud-on-k8s/known-issues

Before merge:
docs/release-notes/known-issues.md


cc @jeanfabrice

@kunisen kunisen self-assigned this Apr 10, 2026
@kunisen kunisen added >docs Documentation known_issue labels Apr 10, 2026
@kunisen kunisen requested a review from a team as a code owner April 10, 2026 11:38
@prodsecmachine
Copy link
Copy Markdown
Collaborator

prodsecmachine commented Apr 10, 2026

Snyk checks have passed. No issues have been found so far.

Status Scan Engine Critical High Medium Low Total (0)
Open Source Security 0 0 0 0 0 issues
Licenses 0 0 0 0 0 issues

💻 Catch issues earlier using the plugins for VS Code, JetBrains IDEs, Visual Studio, and Eclipse.

@kunisen kunisen requested review from pebrc and pkoutsovasilis April 10, 2026 11:38
@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 10, 2026

🔍 Preview links for changed docs

@github-actions
Copy link
Copy Markdown

github-actions bot commented Apr 10, 2026

✅ Vale Linting Results

No issues found on modified lines!


The Vale linter checks documentation changes against the Elastic Docs style guide.

To use Vale locally or report issues, refer to Elastic style guide for Vale.

@jeanfabrice
Copy link
Copy Markdown
Contributor

Thanks for writing the PR @kunisen
Is this going to appear for all 3.3.x version, or only for 3.3.2? IIUC, all 3.3.x versions are affected.

Copy link
Copy Markdown
Collaborator

@pebrc pebrc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Code Review — PR #9342

Thanks for documenting this known issue! A few suggestions to improve the entry:

Findings

1. Missing entries for 3.3.0 and 3.3.1 (medium)
The cert mismatch affects versions 3.3.0–3.3.2, but the entry is only added under the 3.3.2 section. The established pattern in this file (see the FIPS BoringCrypto issue) is to duplicate the known issue entry under every affected version. The same dropdown should also appear under the 3.3.1 and 3.3.0 sections.

2. Workaround should be self-contained (medium)
The workaround currently says "follow the KB article to regenerate relevant certificates without upgrading," but the KB article likely requires Elastic support access — external users would hit a login wall. The actual workaround is simple enough to inline directly:

Delete the transport certificate secret (<cluster>-es-<nodeset>-es-transport-certs) and the HTTP certificate secret (<cluster>-es-http-certs-internal) to force ECK to regenerate all certificates. Alternatively, upgrade to ECK 3.4.0 or later.

The KB link can remain as a "for more details" reference, but the actionable steps should not be gated behind it.

3. "Upgrade to ECK 3.4.0" reads as already released (nit)
The PR description says the fix is "tentatively scheduled for release in ECK 3.4.0." If 3.4.0 hasn't shipped yet when this merges, the wording could be softened (e.g., "once available").

4. Trailing whitespace (nit)
Line 30 has a trailing space.

@kunisen
Copy link
Copy Markdown
Contributor Author

kunisen commented Apr 13, 2026

@jeanfabrice thanks

Is this going to appear for all 3.3.x version, or only for 3.3.2? IIUC, all 3.3.x versions are affected.

Yes. I wrote it in the expanded text During or after upgrading the ECK operator to 3.3.0–3.3.2 originally, but per your suggestion and also Peter's, I added the wording blocks to all 3.3 patch versions respectively.


@pebrc thanks.

  1. Missing entries for 3.3.0 and 3.3.1 (medium)

Done by adding the wording blocks to all 3.3 patch versions respectively.

  1. Workaround should be self-contained (medium)

Done by amending the wording per the suggestion.

  1. "Upgrade to ECK 3.4.0" reads as already released (nit)

Done by amending the wording per the suggestion.

  1. Trailing whitespace (nit)

Done by removing the trailing space.


@pebrc thanks for your review. could you please check in advance?

Copy link
Copy Markdown
Collaborator

@pebrc pebrc left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

LGTM

@kunisen
Copy link
Copy Markdown
Contributor Author

kunisen commented Apr 13, 2026

Thanks @pebrc can you help me merge this since I don't have permission -
image

Or if you could grant me the permission to merge, that's also fine (I have such permission for some internal repos).
Thanks!

@pebrc pebrc merged commit dc5b594 into main Apr 13, 2026
9 checks passed
@pebrc pebrc deleted the kunisen-docpr-stl-1776 branch April 13, 2026 12:10
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

Projects

None yet

Development

Successfully merging this pull request may close these issues.

4 participants