Skip to content

fix: remove Glue RCA Check 2 causing 100% false-positive failures#60

Merged
dwsmith1983 merged 1 commit intomainfrom
fix/glue-rca-false-positive
Mar 8, 2026
Merged

fix: remove Glue RCA Check 2 causing 100% false-positive failures#60
dwsmith1983 merged 1 commit intomainfrom
fix/glue-rca-false-positive

Conversation

@dwsmith1983
Copy link
Copy Markdown
Owner

Summary

  • Remove verifyGlueRCA Check 2 which queried /aws-glue/jobs/error with a filter pattern that matched benign JVM startup output in every Glue job's stderr
  • Every SUCCEEDED Glue job was being reclassified as FAILED due to classpath entries like -XX:OnOutOfMemoryError and Glue's internal AnalyzerLogHelper messages containing "Error" as substrings
  • Check 1 (GlueExceptionAnalysisJobFailed in the RCA log stream) remains — this is Glue's purpose-built mechanism for detecting false successes
  • Post-run validation provides the application-level safety net for data quality issues
  • Adds v0.7.1 changelog entry

The verifyGlueRCA Check 2 filtered /aws-glue/jobs/error for error
indicators (?Exception ?Error ?FATAL ...), but every Glue job's stderr
contains benign JVM startup output with "Error" in classpath entries
(e.g., -XX:OnOutOfMemoryError) and Glue's internal AnalyzerLogHelper
messages. This caused every SUCCEEDED job to be reclassified as FAILED.

Check 1 (GlueExceptionAnalysisJobFailed in the RCA log stream) is
Glue's purpose-built mechanism for detecting false successes and is
sufficient. Post-run validation provides the application-level safety
net for data quality issues.
@github-actions github-actions bot added tests Test changes docs Documentation triggers Trigger types labels Mar 8, 2026
@dwsmith1983 dwsmith1983 self-assigned this Mar 8, 2026
@dwsmith1983 dwsmith1983 merged commit f046124 into main Mar 8, 2026
6 checks passed
@dwsmith1983 dwsmith1983 deleted the fix/glue-rca-false-positive branch March 8, 2026 14:48
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

docs Documentation tests Test changes triggers Trigger types

Projects

None yet

Development

Successfully merging this pull request may close these issues.

1 participant