Skip to content
This repository has been archived by the owner on Feb 8, 2018. It is now read-only.

post-mortem DHH crash #1712

Closed
chadwhitacre opened this issue Dec 3, 2013 · 3 comments
Closed

post-mortem DHH crash #1712

chadwhitacre opened this issue Dec 3, 2013 · 3 comments

Comments

@chadwhitacre
Copy link
Contributor

Right after https://www.youtube.com/watch?v=p1E-svVd9Xc we crashed. Manifested as a drained Aspen thread pool:

screen shot 2013-12-03 at 3 18 47 pm

CPU load isn't bad, response times spike.

@chadwhitacre
Copy link
Contributor Author

If 40 were too many threads I would expect CPU load to reflect that. Spiking response times make me think that we hit some slow queries which backed up our threads. What other parameters are we missing? Network I/O on the box?

@chadwhitacre
Copy link
Contributor Author

I'm not going to debug this. :-(

At least it's noted, alongside #1541.

@zbynekwinkler
Copy link
Contributor

I think we should strive to fail faster (200s is way too long). I do not think it is possible with python threads - I do not know of a way how to cancel a running thread in python from "outside" after a certain time :(. Logging requests that took too much time would make sense in this context but this info is in some way already available in papertrail logs.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants