-
Couldn't load subscription status.
- Fork 1.2k
[router] make a distinction between fatal errors, warnings and unknown as healthcheck result #10710
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[router] make a distinction between fatal errors, warnings and unknown as healthcheck result #10710
Conversation
Codecov Report❌ Patch coverage is Additional details and impacted files@@ Coverage Diff @@
## main #10710 +/- ##
============================================
+ Coverage 17.36% 17.39% +0.03%
- Complexity 15237 15281 +44
============================================
Files 5888 5890 +2
Lines 525741 526155 +414
Branches 64164 64233 +69
============================================
+ Hits 91274 91528 +254
- Misses 424167 424282 +115
- Partials 10300 10345 +45
Flags with carried forward coverage won't be shown. Click here to find out more. ☔ View full report in Codecov by Sentry. 🚀 New features to boost your workflow:
|
546c094 to
73f40ee
Compare
eb83516 to
c5157a6
Compare
|
This pull request has merge conflicts. Dear author, please fix the conflicts and sync your branch with the base branch. |
|
We probably need to raise this against 4.19? |
732ef96 to
f3848c0
Compare
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
code LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
clgtm
|
this has been (dev-)tested but needs an upgrade test as most important validation. |
|
Packaging result [SF]: ✔️ el8 ✔️ el9 ✔️ el10 ✔️ debian ✔️ suse15. SL-JID 15063 |
|
[SF] Trillian test result (tid-14379)
|
|
Packaging result [SF]: ✖️ el8 ✖️ el9 ✖️ debian ✖️ suse15. SL-JID 15111 |
|
@DaanHoogland is this failure related "test_01_deployVMInSharedNetwork" ? |
maybe just a intermittent failure @blueorangutan package |
|
@weizhouapache a [SL] Jenkins job has been kicked to build packages. It will be bundled with KVM, XenServer and VMware SystemVM templates. I'll keep you posted as I make progress. |
|
Packaging result [SF]: ✔️ el8 ✔️ el9 ✔️ el10 ✔️ debian ✔️ suse15. SL-JID 15123 |
|
@blueorangutan test |
|
@weizhouapache a [SL] Trillian-Jenkins test job (ol8 mgmt + kvm-ol8) has been kicked to run smoke tests |
|
[SF] Trillian test result (tid-14410)
|
|
@vishesh92 @harikrishna-patnala @weizhouapache , there is a slight backwards incompatibility with this change; success can now be true, false or not present. In addition to result there is status, which can be “success”, “failure”, “warning” or “unknown”. So the incompatibilty is that result may not be present. I think this is not grave as it would be a false positive in the past. What do you think? Other than this we are good to merge (@vladimirpetrov tested but forgot to give his opinion here;) |
|
[SF] Trillian test result (tid-14415)
|
as we are only updating the value of the response parameter properly I think this is fine and moreover more detailed 'status' is now present. |
…n as healthcheck result (apache#10710) * [routers] distiction between fatal failure and warning or unknown on healthchecks * UI status for router health checks * status from scripts varied * automation signalled errors * revert removal of update sql * upgradeversion * move config item and further cleanup * handling services better * backwards compatible response --------- Co-authored-by: Daan Hoogland <[email protected]>
Description
This PR is a small functional change inspired by #9800, #9852 and #9473. More related issues may exist. The intention is to introduce the concept of non fatal failures in the router health check scripts. In the past only
successandfailurewere reported. Now also the statuseswarningandunknownare available.The scripts,
were changed, but the machanism may be applied to other scripts as well.
monitor.py has been changed to relay the extra possible statuses, and the backend and UI have been altered to store them and display them to the user.
Types of changes
Feature/Enhancement Scale or Bug Severity
Feature/Enhancement Scale
Bug Severity
Screenshots (if appropriate):
How Has This Been Tested?
UNKNOWNinstead ofFAILEDHow did you try to break this feature and the system with this change?
added some fake tests to test the UI:
