Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Spike: OpenShift monitoring to notify us when batches fail. #698

Open
1 task
ianliuwk1019 opened this issue Sep 16, 2024 · 1 comment
Open
1 task

Spike: OpenShift monitoring to notify us when batches fail. #698

ianliuwk1019 opened this issue Sep 16, 2024 · 1 comment

Comments

@ianliuwk1019
Copy link
Collaborator

ianliuwk1019 commented Sep 16, 2024

Describe the task
There was an incident recently that the FOM batch jobs failed and we were not aware until production and reported by business.
It is better to have a monitoring on OpenShift to notify us when batches failed.

Discuss what possible monitoring tools or implementation can we have to improve our awareness of the batch failure.

Acceptance Criteria

  • [TO BE REFINED]

Additional context

@ianliuwk1019
Copy link
Collaborator Author

You are right @basilv , the very early ticket #217 is probably the one you mentioned.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

1 participant