-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
DRBD Connection Failure in Kubernetes Environment with External LINSTOR Controller #683
Comments
SOS Report: sos_2024-07-09_23-37-58.tar.gz |
Have you checked for firewall settings on both nodes? Looks like the k8s node is struggling to receive anything from the proxmox node. |
For Proxmox, I actually don't use iptables at all and just rely on OPNsense to do it's job. On OPNsense, I can look at my live firewall rules, and I see the packets being allowed back and fourth to each node. Are there any other rules I should look for/add? Note, after updating the log level to "TRACE", I see this: 2024_07_10 16:11:20.777 [DrbdEventService] TRACE LINSTOR/Satellite - SYSTEM - DRBD 'events2': change connection name:pvc-ac80db5a-c34b-4fe1-b8ec-333dbbf13989 peer-node-id:0 conn-name:pve-epyc-01 connection:Connecting Edit: I should mention that seeing the packets in OPNsense only happened when I was connecting across vlans. I did iperf3 tests from both nodes when on the same vlan on port 7000 and I'm getting >10 gbits per second and jitter below .02ms. No lost packets. I think this may be a drbd issue and I should probably post this there instead. |
Just to make extra sure, did you test it in both directions? I.e. running the iperf3 "server" once on a k8s host and once on a proxmox host? What is interesting is that from the logs it looks like the proxmox nodes does not see any connection attempts at all, while the k8s worker seems to "talk" to something which does not look like DRBD, hence it timing out eventually. Other than that, yeah, might be better to open an issue on DRBD. You may want to upgrade to DRBD 9.2.10 on the k8s nodes first just to make sure, even if there does not seem to be any relevant change in the latest release. |
Environment
Setup Description
Description
When creating a resource using a PVC that uses the configured storageclass, the DRBD connection between the Kubernetes worker node and the Proxmox node (acting as the external LINSTOR controller) fails to establish. The connection attempts are continuous but unsuccessful.
Observed Behavior
DRBD Connection Failures:
DRBD Status:
Kubernetes Worker Node:
Proxmox Node:
Note: On both nodes, the connections cycle between Unconnected and Connecting states.
Additional data:
Configuration
Additional Information
Troubleshooting Steps Taken
Questions
The text was updated successfully, but these errors were encountered: