Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Lwt-async-exn: Unix.ENOMEM error #462

Open
arahuja opened this issue Aug 12, 2016 · 1 comment
Open

Lwt-async-exn: Unix.ENOMEM error #462

arahuja opened this issue Aug 12, 2016 · 1 comment
Labels

Comments

@arahuja
Copy link
Contributor

arahuja commented Aug 12, 2016

Very consistently seeing this when submitting many epidisco pipelines at once. It tends to happen when they all simultaneously hit the parallelized variant calling stage.

Lwt-async-exn: Unix.Unix_error(Unix.ENOMEM, "fork", "")

The danger here is that I have to restart the server while tasks are in progress. Many tasks create their output file before they are completed, which means that if I resubmit the task it might be viewed as done and move on to the next stage using incomplete output. (Perhaps this is only true if I remove the database, which I often do since after the submitting ~50 or 1000 nodes jobs the 'Building' stage a job is pretty long. Usually, there is a pretty long lag between submitting the task (and observing the submission in the in the log at the bottom of the UI) and see it display as 'Building' in the table)

@smondet
Copy link
Member

smondet commented Aug 13, 2016

What else was on the machine?

I had that one before but Ketrew wasn't the actual problem; other jobs in the same host had exhausted the memory (picard-mark-dups / mutect).

@ihodes ihodes added the bug label Sep 19, 2016
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

3 participants