Monitoring fuse mounts #42

JohnStrunk · 2018-09-13T19:41:16Z

Describe the feature you'd like to have.
The CSI pod should be able to monitor the fuse mount processes and report on abnormal conditions.

What is the value to the end user? (why is it a priority?)
The CSI driver will spawn a FUSE process as a part of each volume mount. Currently, these extra processes are just "fire and forget." However, it is possible for one of these fuse processes to crash. Even with #3 this may not be detectable. A daemon within the pod should watch the fuse processes and ensure that if one crashes, a suitable error is logged so that an admin can easily diagnose why a pod has lost access to storage.

How will we know we have a good solution? (acceptance criteria)

An error message is logged to stdout (and visible via kubectl logs <container> when a fuse process abnormally exits
No error is logged if the fuse process exits due to an unmount

Additional context

Bonus points for detecting and logging loss of connection to the server (that may reconnect in the future). <== I suspect this will be visible with mount flag to log to stdout to be provided #3
It would be nice if there was a way to automatically remedy the crash, but I'm currently at a loss.

The text was updated successfully, but these errors were encountered:

humblec self-assigned this Sep 14, 2018

JohnStrunk added the feature label Sep 17, 2018

JohnStrunk added this to the GCS-1.0 milestone Sep 24, 2018

JohnStrunk removed this from the GCS-1.0 milestone Oct 3, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Monitoring fuse mounts #42

Monitoring fuse mounts #42

JohnStrunk commented Sep 13, 2018

Monitoring fuse mounts #42

Monitoring fuse mounts #42

Comments

JohnStrunk commented Sep 13, 2018