-
Notifications
You must be signed in to change notification settings - Fork 156
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Ceph with libvma crashed in both V6.9.1 and tag 7.0.5 #30
Comments
from the trace I see that a low lever libverbs API 'ibv_create_cq()' failure has occurred. Please verify netperf or sockperf are working as expected with VMA. You can follow the example on the VMA User Manual which can be downloaded from http://www.mellanox.com/downloads/Accelerator/VMA_7.0.4_package.zip |
Yes, both sockperf and iperf works. And the 2us latency seems pretty good. Here is the output:
|
And here are some local configurations for your reference. Please let me know if you need more. Thanks!
|
HI All, An update on this issue. The crash disappears after I upgraded mlnx_ofed from 3.0.1 to 3.0.2.This time I am using VMA version 7.0.4. However it hangs there instead of crash. Below is the trace. Any ideas? Seems VMA had sent out the connection request but never got ACK back. Thanks very much!
|
We did not test VMA with Ceph until now. It will take some time to reach that goal. Can you try running without the high polling factor: VMA_RX_POLL=0 VMA_SELECT_POLL=0 |
For now I'll update the issue title to represent the "Ceph" as target application. |
Thanks for your responding. The hang issue is still there after setting VMA_RX_POLL=0 VMA_SELECT_POLL=0. The process had a page locked flag(L) with it. This flag was there too in my last run.
And the hang trace is
|
Hi,
I tried to use libvma in my Ceph cluster recently. However, I met same crash in version 6.9.1(installed via RPM) and 7.0.5(installed from src with debug enabled). The core trace is pasted below. Cloud some one help to look at this issue? Thanks very much!
The text was updated successfully, but these errors were encountered: