-
-
Notifications
You must be signed in to change notification settings - Fork 20
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add retry/backoff to get diagnostics data #112
Comments
@vpetersson can we just add a depends-on flag to the docker compose? |
If balena had a newer version of docker compose that would have worked. However the version they are using is rather primitive unfortunately and won't guarantee start order. |
Depends on does work... You can even add a healthcheck command as well (Although it says "Only array form is supported" for Other possibility is using |
@vpetersson this might be a better way to handle the dbus-wait and diagnostics wait in the miner and config containers actually rather than shell code in the start scripts |
Strange. I'm pretty sure I looked for health check before in Balena and determined it wasn't supported. Looks like I was wrong (or they added it later). Yes, a combination of depends_on and healthcheck is the way to go here as we delegate this logic to docker instead https://docs.docker.com/compose/compose-file/compose-file-v2/ |
The only thing I'm not sure of is where it says "Only array form is supported" for depends on. Does that mean we can't pass a healthcheck to it? Also I'm pretty sure I read somewhere that the supervisor automatically kills and restarts a container if a healthcheck fails. But can't seem to find where I read that. But worth keeping in mind anyway |
Don't worry about this. I'm sorting it in https://github.com/NebraLtd/hm-diag/pull/171/files |
Superseeded by NebraLtd/helium-miner-software#177 |
The hm-config container currently crashes with the following error if it is launched before the hm-diag container has been fully initiated:
This is not a big deal, as it is recovering automatically. However, we should clean this up with some better retry logic in the near future.
The text was updated successfully, but these errors were encountered: