seqwish crashes on chrom21 for 3 hgsvc samples #18

glennhickey · 2019-07-08T13:23:03Z

This ran through fine on one sample (HG00514), but when I scaled up to 3 it crashed. The input sequences can be found here:

https://transfer.sh/SZ5pU/hgsvc-chr21-seqs.tar.gz

# runs in 40min
./pan-minimap2 hg38_chr21.fa HG00514_chr21_0.fa HG00514_chr21_1.fa HG00733_chr21_0.fa HG00733_chr21_1.fa NA19240_chr21_0.fa NA19240_chr21_1.fa | fpa drop -l 10000 > hgsvc_seqwish_fpa10000.paf

# (hgsvc_chr21.fa is the above sequences catted together with hg38 first)
seqwish -s hgsvc_chr21.fa -p hgsvc_seqwish_fpa10000.paf -t 16 -b work/x -g hgsvc_seqwish_fpa10000.gfa

# crashes after 7.5 hours
seqwish: /ebs1/seqwish/src/links.cpp:23: void seqwish::derive_links(seqwish::seqindex_t&, size_t, m\
mmulti::map<long unsigned int, long unsigned int>&, mmmulti::map<long unsigned int, long unsigned i\
nt>&, mmmulti::map<long unsigned int, long unsigned int>&): Assertion `v1.size() == v2.size() == 1'\
 failed.
Command terminated by signal 6

Is it possible that 126G of RAM is not enough?

The text was updated successfully, but these errors were encountered:

ekg · 2019-07-08T13:25:26Z

It should be more than enough. I wonder if you ran out of disk space though? This is the test case you sent?

…

On Mon, Jul 8, 2019, 15:23 Glenn Hickey ***@***.***> wrote: This ran through fine on one sample (HG00514), but when I scaled up to 3 it crashed. The input sequences can be found here: https://transfer.sh/SZ5pU/hgsvc-chr21-seqs.tar.gz # runs in 40min ./pan-minimap2 hg38_chr21.fa HG00514_chr21_0.fa HG00514_chr21_1.fa HG00733_chr21_0.fa HG00733_chr21_1.fa NA19240_chr21_0.fa NA19240_chr21_1.fa | fpa drop -l 1000 > hgsvc_seqwish_fpa10000.paf # (hgsvc_chr21.fa is the above sequences catted together with hg38 first) seqwish -s hgsvc_chr21.fa -p hgsvc_seqwish_fpa10000.paf -t 16 -b work/x -g hgsvc_seqwish_fpa10000.gfa # crashes after 7.5 hours seqwish: /ebs1/seqwish/src/links.cpp:23: void seqwish::derive_links(seqwish::seqindex_t&, size_t, m\ mmulti::map<long unsigned int, long unsigned int>&, mmmulti::map<long unsigned int, long unsigned i\ nt>&, mmmulti::map<long unsigned int, long unsigned int>&): Assertion `v1.size() == v2.size() == 1'\ failed. Command terminated by signal 6 Is it possible that 126G of RAM is not enough? — You are receiving this because you are subscribed to this thread. Reply to this email directly, view it on GitHub <#18?email_source=notifications&email_token=AABDQEOFAZWDTGPA5E5VSILP6M5TPA5CNFSM4H63U6T2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4G53HNHQ>, or mute the thread <https://github.com/notifications/unsubscribe-auth/AABDQEPMQ2RYY4ZSHEQDH5LP6M5TPANCNFSM4H63U6TQ> .

ekg · 2019-07-08T13:27:21Z

You did the name prefixing awk thing to make sure the sequences are all uniquely named?

…

On Mon, Jul 8, 2019, 15:25 Erik Garrison ***@***.***> wrote: It should be more than enough. I wonder if you ran out of disk space though? This is the test case you sent? On Mon, Jul 8, 2019, 15:23 Glenn Hickey ***@***.***> wrote: > This ran through fine on one sample (HG00514), but when I scaled up to 3 > it crashed. The input sequences can be found here: > > https://transfer.sh/SZ5pU/hgsvc-chr21-seqs.tar.gz > > # runs in 40min > ./pan-minimap2 hg38_chr21.fa HG00514_chr21_0.fa HG00514_chr21_1.fa HG00733_chr21_0.fa HG00733_chr21_1.fa NA19240_chr21_0.fa NA19240_chr21_1.fa | fpa drop -l 1000 > hgsvc_seqwish_fpa10000.paf > > # (hgsvc_chr21.fa is the above sequences catted together with hg38 first) > seqwish -s hgsvc_chr21.fa -p hgsvc_seqwish_fpa10000.paf -t 16 -b work/x -g hgsvc_seqwish_fpa10000.gfa > > # crashes after 7.5 hours > seqwish: /ebs1/seqwish/src/links.cpp:23: void seqwish::derive_links(seqwish::seqindex_t&, size_t, m\ > mmulti::map<long unsigned int, long unsigned int>&, mmmulti::map<long unsigned int, long unsigned i\ > nt>&, mmmulti::map<long unsigned int, long unsigned int>&): Assertion `v1.size() == v2.size() == 1'\ > failed. > Command terminated by signal 6 > > Is it possible that 126G of RAM is not enough? > > — > You are receiving this because you are subscribed to this thread. > Reply to this email directly, view it on GitHub > <#18?email_source=notifications&email_token=AABDQEOFAZWDTGPA5E5VSILP6M5TPA5CNFSM4H63U6T2YY3PNVWWK3TUL52HS4DFUVEXG43VMWVGG33NNVSW45C7NFSM4G53HNHQ>, > or mute the thread > <https://github.com/notifications/unsubscribe-auth/AABDQEPMQ2RYY4ZSHEQDH5LP6M5TPANCNFSM4H63U6TQ> > . >

glennhickey · 2019-07-08T13:41:30Z

The test case I sent the other day was just one sample (hg38 + 2 sequences). This one (I put a new link to the data above) contains those, plus another 4 sequences. I'm working on a disk with 1.6T free space.

I don't do any particular awking, but my sequences have unique names

grep '>' *.fa
HG00514_chr21_0.fa:>HG00514_chr21_0_0
HG00514_chr21_0.fa:>HG00514_chr21_0_1
HG00514_chr21_0.fa:>HG00514_chr21_0_2
HG00514_chr21_1.fa:>HG00514_chr21_1_0
HG00514_chr21_1.fa:>HG00514_chr21_1_1
HG00733_chr21_0.fa:>HG00733_chr21_0_0
HG00733_chr21_1.fa:>HG00733_chr21_1_0
hg38_chr21.fa:>chr21
NA19240_chr21_0.fa:>NA19240_chr21_0_0
NA19240_chr21_1.fa:>NA19240_chr21_1_0

ekg · 2019-07-16T15:25:04Z

@glennhickey I'm not sure that the fasta reader is going to be OK with the sequences named that way. But I can't be sure that this is the problem. I'll see if I can reproduce with a simpler test.

glennhickey mentioned this issue Sep 21, 2019

Option for acyclic reference path #19

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

seqwish crashes on chrom21 for 3 hgsvc samples #18

seqwish crashes on chrom21 for 3 hgsvc samples #18

glennhickey commented Jul 8, 2019 •

edited

Loading

ekg commented Jul 8, 2019 via email

ekg commented Jul 8, 2019 via email

glennhickey commented Jul 8, 2019

ekg commented Jul 16, 2019

seqwish crashes on chrom21 for 3 hgsvc samples #18

seqwish crashes on chrom21 for 3 hgsvc samples #18

Comments

glennhickey commented Jul 8, 2019 • edited Loading

ekg commented Jul 8, 2019 via email

ekg commented Jul 8, 2019 via email

glennhickey commented Jul 8, 2019

ekg commented Jul 16, 2019

glennhickey commented Jul 8, 2019 •

edited

Loading