-
Notifications
You must be signed in to change notification settings - Fork 4
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
monitoring guidelines #13
Comments
In GitLab by @wsalembi on Jun 28, 2018, 09:15
|
In GitLab by @wsalembi on Jun 28, 2018, 10:49 After review
|
In GitLab by @pvdbosch on Dec 21, 2018, 12:17 mentioned in December WG meeting:
|
In GitLab by @pvdbosch on Jan 21, 2019, 15:41 mentioned in merge request !3 |
In GitLab by @pvdbosch on Jan 22, 2019, 10:48 Discussed on WG of January: Http status code for failing health checks: 200 or 503?
Conclusion: use status code 503. For 'degraded' status, use 200. 'unknown' status can be removed, a common use case can't be found to use it Steve mentioned VAS uses variants of the /health resource for OpenShift, e.g.:
|
In GitLab by @wsalembi on May 10, 2019, 09:36 mentioned in merge request !4 |
In GitLab by @pvdbosch on Jun 18, 2019, 21:51 How the microprofile health standard compares:
I'd like to discuss our options:
|
In GitLab by @wsalembi on Dec 11, 2019, 10:21 Logging at the specs, I think we should just formalize the common parts. Top-level is UP (200) or DOWN (503). |
I agree with @wsalembi , adoption of only the common parts in gCloud REST guide. |
A draft of an upcomming agreement: |
latest working version: https://inadarei.github.io/rfc-healthcheck/ We'll check if this RFC has some (planned) support in implementations. If not, we'll use the microprofile Health/spring-boot based one. |
About the RFC, I found following related github issues
So Spring Boot rejected it because lack of apparent traction on the RFC. Issues in microprofile are still open, but discussion focuses more on the health check details (not the global state), which we were thinking to not regulate in the styleguide. The global state ("status": "UP" / "DOWN") is considered compliant for the RFC, because the RFC allows it as "acceptable aliases". The RFC is quite lenient in order to be compatible with Spring Boot and microprofile, but this makes it weaker as a standard IMO. For instance, multiple status value aliases are accepted and they are case-insensitive, and for http status codes only 2xx-3xx (pass) or 4xx-5xx (fail) are mentioned. |
decision on WG: continue based on common parts of the microprofile Health/spring-boot standards |
FYI, Spring Boot 2.3 will add specific support for liveness and readiness health checks: https://spring.io/blog/2020/03/25/liveness-and-readiness-probes-with-spring-boot |
degraded/warn status is being discussed in microprofile health: eclipse/microprofile-health#130 |
Hi! I was happy to stumble across this thread about monitoring guidelines for REST standards within Belgium! Everyone is still tending to hand roll their own Health systems, and common elements beyond HTTP status codes and liveness/readiness groups (thanks, k8s!) have been hard to come by or slow to emerge. There is some momentum to standardize Health check responses, and I think it is much needed, so I would be happy to join this conversation from the United States! I think the RFC could be important, if people can have their interoperability/alignment conversations there to get appropriate adjustments made for a more viable standard. |
Hi @derekm , we're seeing three use cases requiring different types of health checks:
Our REST guide mainly focuses on the third case. |
I updated the PR #51 a bit:
The PR for the Swagger/OpenAPI files is in belgif/openapi-common#2 If OK on WG tomorrow, we can merge. |
#13 health check guidelines Co-authored-by: wisa <[email protected]> Co-authored-by: wsalembi <[email protected]>
both PRs are merged. Will be published in next version of guide; probably by end or this week or beginning of next week |
Hello @pvdbosch , |
Hi @smals-chlo, thanks for reporting. The links for all swagger files were broken because the directories "swagger-2.0" were renamed to "swagger" (#27). I've updated and published the rest guide to fix this. |
Hi @pvdbosch,
|
I think we can generalize in a broader scope than Health: any API operation can always fail unexpectedly and return 500 response. Not being able to generate a health response is just a specific occurrence of this. And then we can discuss if such unexpected failures should or shouldn't be represented in OpenAPI/Swagger => created #67 on this. |
In GitLab by @wsalembi on Feb 23, 2018, 09:04
Because of the low priority, monitoring.adoc is removed from the styleguide at the moment, but still exists in the git repo.
eHealth: HealthCheck
CBSS:
The text was updated successfully, but these errors were encountered: