Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Failures in Distributed Algorithms Appear to Hang forever in CCClient #45

Open
coltonbh opened this issue Jan 16, 2024 · 0 comments
Open
Assignees
Labels
bug Something isn't working

Comments

@coltonbh
Copy link
Collaborator

With Jan when trying to do a distributed frequency analysis on a TS structure, BigChem workers showed some failures in the logs, and his client would hang forever without getting a response. Why is a failed structure not returned? How are failures handled by celery in these distributed algorithms?

Need to test some failure cases and see how they are handled and make sure the correct behavior is happening. I think "correct" would be returning a "ProgramFailure" object if any of the gradients fail.

@coltonbh coltonbh added the bug Something isn't working label Jan 16, 2024
@coltonbh coltonbh self-assigned this Jan 16, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant