observability: backend panic and maestro alerts#4869
Conversation
|
[APPROVALNOTIFIER] This PR is APPROVED This pull-request has been approved by: geoberle The full list of commands accepted by this bot can be found here. The pull request process is described here DetailsNeeds approval from an approver in each of these files:
Approvers can indicate their approval by writing |
1b9471b to
6277628
Compare
* added BackendControllerPanic alert to detect panics caught by the runtime panic handler * added Maestro alerts: MaestroGRPCSourceClientExcessConnections, MaestroRESTAPIErrorRate, MaestroGRPCServerErrorRate, MaestroSpecControllerReconcileErrors * Increase PrometheusOperatorRejectedResources for duration from 5m to 20m to reduce noise during infra provisioning * known issues require less screen estate in gather-observability rendering
6277628 to
2ef9aad
Compare
|
@geoberle: The following test failed, say
Full PR test history. Your PR dashboard. DetailsInstructions for interacting with me using PR comments are available here. If you have questions or suggestions related to my behavior, please file an issue against the kubernetes-sigs/prow repository. I understand the commands that are listed here. |
|
/test e2e-parallel |
What
Why
Testing
Special notes for your reviewer