Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Bug] Yarn resourcemanagergc Alarms But the rm gc curve does not exceed the threshold #563

Open
3 tasks done
mengbaba3316 opened this issue May 21, 2024 · 2 comments
Open
3 tasks done
Labels
bug Something isn't working

Comments

@mengbaba3316
Copy link

Search before asking

  • I had searched in the issues and found no similar issues.

What happened

Version ddp1.2.1
Yarn resourcemanagergc Alarms But the rm gc curve does not exceed the threshold
b12d06b87586bf22c357c51cec4f8d0
I suspect that the UI page status update is not timely, and I feel that other components will also have this problem

What you expected to happen

Hopefully, we can resolve this issue and check the other components

How to reproduce

When the threshold is exceeded and the gc time of the restart service decreases, this alarm is occasionally displayed

Anything else

No response

Version

dev

Are you willing to submit PR?

  • Yes I am willing to submit a PR!

Code of Conduct

@mengbaba3316 mengbaba3316 added the bug Something isn't working label May 21, 2024
@datasophon
Copy link
Member

The ResourceManagerGC indicator of resourcemanager is incorrect, you can turn it off

@hawk9821
Copy link
Contributor

应该是告警时效性的问题, 在告警发触发的时候产生了告警, 告警记录的状态并没有更新导致的。 重启yarn 服务告警就没有了
告警的计算逻辑应该是没问题的 。
我发现从 alertmanager 发送的告警信息 status 都是 firing , 没有 resolved , 导致告警记录的状态不会更新
image
@datasophon

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

3 participants