-
Notifications
You must be signed in to change notification settings - Fork 69
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
KeyError when adding to stack #6
Comments
Hmm. Plop should be thread-safe; I initially developed it in a multithreaded app. statck_counts is normally a defaultdict so it should never raise KeyErrors (barring threading issues). Collector.filter() converts stack_counts into a regular dict (effectively making the object read-only until it is reset()). Could you be calling filter before the collector has stopped or reusing a collector without resetting it? |
Not that I know of; I'm controlling the Collector from the Twisted reactor loop (main thread), and I've put a simple lock to avoid it starting or stopping twice (only starts if not started already, only stops if running). The filter is run right after the collector is stopped, in the same function, also in the reactor loop. Do you have visibility enough to the signal handler, in order to know when there's an error while collecting, in your applications? |
If the signal handler fails, the exception will keep going into whatever code was running at the time of the signal, probably just getting caught and logged in the event loop as we've seen here. If you have a lot of threads and are using a small sampling interval I think it's possible that the signal handler is taking more than its allotted time and a second signal is interrupting the first. Consider increasing the collector's interval and see if the problems still occur. |
So, I increased the interval to 0.1 seconds, and it did improve the stability, but I'm still getting those errors:
|
How many threads do you typically have? Was the application still responsive while the profile was running? |
15 threads per server. In this second test, the application seemed to keep responsive after a small time using, but I didn't test it thoroughly to see if it went down like it happened in my first test. |
Hmm, 15 threads shouldn't be enough to cause signal handler overrun problems, although if you're also having GIL contention it's a possibility. My next move would be to explicitly guard against overrun and see if the problems persist, with a patch like this:
|
I think this is somehow related to the fact that I'm using plop with a multithreaded application (which runs under Twisted), but I'm getting these errors every once in a while, any ideas what it might be?:
The text was updated successfully, but these errors were encountered: