Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[upstream] Segmentation fault at origin - Kmeans multi-threaded #101

Open
JackChuang opened this issue Mar 27, 2020 · 3 comments
Open

[upstream] Segmentation fault at origin - Kmeans multi-threaded #101

JackChuang opened this issue Mar 27, 2020 · 3 comments
Labels

Comments

@JackChuang
Copy link
Member

I ran "./kmeans -d 3 -c 20 -p 50000 -n 2 -t 4" twice. The first execution completed w/o any problem (awesome!). However, the 2nd execution gave me a segmentation fault on the origin node as shown below:
The execution log, dmesg logs on node0/1, and kmeans binary are attached.
kmeans_logs.zip

My setup is 2 VM using old x86.img downloaded from this repository's wiki page. Each VM has 8 vCPU with 40Gb RAM.

Please take a look. Thank you.

@jnarf
Copy link
Collaborator

jnarf commented Mar 30, 2020

I was able to reproduce this Jack on my setup. I am taking a look at it. And will update you asap.

@jnarf
Copy link
Collaborator

jnarf commented Mar 31, 2020

Jack I was able to triage the problem. I am going to have an updated version of the upstream kernel later today for you to test. It may roll back some changes until I figure out the full fix.

jnarf added a commit that referenced this issue Apr 1, 2020
process_remote_futex was missing pcn_kmsg_done before error exit.
@jnarf
Copy link
Collaborator

jnarf commented Apr 1, 2020

Fixed issue. ran with command: ./kmeans -d 1000 -c 10 -p 2000 -s 1000 -n 2 -t 8

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants