Show more granular OrcaHello status on dashboard #19

dthaler · 2024-06-05T20:56:18Z

Any way to get pulse of inference containers on Azure (ask Patrick? Or Michelle on Github?)

dthaler · 2024-06-12T18:13:35Z

@pastorep @micya Scott suggested pinging you two. Any way to tell what nodes OrcaHello is actively monitoring? The status endpoint is not node specific, and the detections endpoint doesn't give any information on what is actively being monitored.

micya · 2024-06-12T19:05:43Z

Any way to get pulse of inference containers on Azure (ask Patrick? Or Michelle on Github?)

Inference system is running on the AKS cluster inference-system-AKS in resource group named LiveSRKWNotificationSystem. https://github.com/orcasound/aifororcas-livesystem/tree/main/InferenceSystem#deploying-an-updated-docker-build-to-azure-kubernetes-service is still accurate. Each location has its own namespace.

@pastorep @micya Scott suggested pinging you two. Any way to tell what nodes OrcaHello is actively monitoring? The status endpoint is not node specific, and the detections endpoint doesn't give any information on what is actively being monitored.

Not programmatically. Currently, we have a separate container image per location (refer to yaml files here) . The code in all images are the same, but there is a different config file in each image (config files here). I'm not sure which config goes to which image, but you could probably poke through the images in Azure Container Registry to figure it out.

Suggestions:

Unify the docker images into one and inject per-location config at runtime.
Consider having each location deployment report its own location. This can be an endpoint that returns a string (either in the inference service or as a sidecar).
Consider having each location deployment send a heartbeat. A monitoring solution could just subscribe to the heartbeats to figure out which location is active. The heartbeat might also just send the location string.
You may also consider a more comprehensive kubernetes cluster monitoring solution. But since our usage is fairly simple and our cluster is like cattle and not pets, I suggest skipping this in favor of the heartbeat system proposed above.

dthaler · 2024-12-21T18:21:33Z

where are the .wav files Scott mentioned? Can they be attributed to hydrophones to provide any info on inference engine results?

micya · 2024-12-21T22:43:26Z

Not sure the relevance of wav files to this issue, but they are in the livemlaudiospecstorage storage account > audiowavs container. They are associated with individual hydrophones via aifororcasmetadatastore cosmos db > predictions > metadata. Please check with Mike Cowan on the schema & API to retrieve info from cosmos db.

micya mentioned this issue Sep 17, 2024

Add heartbeat/monitoring dashboard for inference system orcasound/aifororcas-livesystem#88

Open

dthaler mentioned this issue Sep 24, 2024

Show more granular OrcaHello status on dashboard #124

Closed

dthaler added the enhancement New feature or request label Oct 30, 2024

dthaler changed the title ~~Any way to get pulse of inference containers on Azure?~~ Show more granular OrcaHello status on dashboard Dec 21, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Show more granular OrcaHello status on dashboard #19

Show more granular OrcaHello status on dashboard #19

dthaler commented Jun 5, 2024

dthaler commented Jun 12, 2024

micya commented Jun 12, 2024

dthaler commented Dec 21, 2024

micya commented Dec 21, 2024

Show more granular OrcaHello status on dashboard #19

Show more granular OrcaHello status on dashboard #19

Comments

dthaler commented Jun 5, 2024

dthaler commented Jun 12, 2024

micya commented Jun 12, 2024

dthaler commented Dec 21, 2024

micya commented Dec 21, 2024