-
Notifications
You must be signed in to change notification settings - Fork 72
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
High CPU load on Debian 12 VM caused by LKRG #301
Comments
Thank you for reporting this @gnd. My main two guesses as to what could be causing this are:
Per your analysis so far, this is more likely issue 2 above. It's puzzling that PHP causes this. It's also puzzling that a "jump label" would presumably be switching back and forth - normally, these are only switched once or very infrequently (on changes to kernel runtime configuration via sysctl or such). This could indicate a minor kernel bug, where what was meant to be an optimization ended up the other way around, since even without LKRG updating the kernel code has some performance cost. |
Is it possible to see the list of all processes while you have such a spike of CPU usage? If the problem is related to JUMP_LABEL we should see a spikes related to kernel worker threads |
Hello, unfortunately, if you mean the number of
|
I think Adam meant not the number of those processes, but whether they're the ones actively running on CPU (e.g. per |
Hello, attached are two files, One before enabling LKRG, second one after LKRG is enabled, when load reached > 100. |
Thanks @gnd. This is puzzling. We really need the |
This might be because of some custom sysctl settings .. let me check |
Hello,
we recently upgraded some of our VMs to Debian 12. They are used to run php8.2 for some web apps. However as soon as we recompiled LKRG with the new kernel and started it, we noticed the CPU reaching 100% very fast. This leads to a machine lockups and the apps become slow and non-responsive.
We tried many things but it seems like LKRG is the issue. Once started the load reached 100% very fast, once turned off, load falls back to normal within a minute. We run LKRG on dozens of machines but only the ones running Debian 12 AND php have this issue. Older Debian machines with PHP and LKRG have no problem. So do machines that do not run PHP workloads.
We tried fiddling with the module's parameters, eg.
krg.profile_validate
setting it via sysctl all the way to 0, but this didnt help.We also tried looking into older LKRG releases and run them - with the same result (specifically it was 7db7483). In the current state we cant run LKRG, even tho we would like to have it :(
Do you have any ideas what might be wrong, or how to help you debug this issue ? Thanks !
Attached is an screenshot from Grafana, showing the effect of re/enabling LKRG 3 times in a row.
The text was updated successfully, but these errors were encountered: