Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Mapping quality and ploidy inference #17

Open
mason-linscott opened this issue Mar 8, 2022 · 0 comments
Open

Mapping quality and ploidy inference #17

mason-linscott opened this issue Mar 8, 2022 · 0 comments

Comments

@mason-linscott
Copy link

Hi @clwgg,

I am using nQuire to determine the ploidy of a highly repetitive metazoan genome (85% repeat content) using paired end data at 60x coverage. When I run nQuire using the defaults and denoise the bin file, the output of lrdmodel indicates the genome is a tetraploid. However, when I restrict the sites to those that have a mapping quality greater than 30 and coverage less than 200 the diploid model has the lowest delta likelihood.

In your manuscript "The exact coverage and number of positions needed for a reliable estimation of ploidy will however depend on the complexity and repetitiveness of the genome. Additionally, it is possible to obtain high quality positions by using BED files to define regions of low repetitiveness, where base frequencies can be more confidently assessed." Could you please elaborate on this? Any advice for choosing between these results or for a different approach would be appreciated.

lrmodel results below and denoised histos attached.

BASE denoised:
nQuire/nQuire lrdmodel base_denoised.bin file free dip tri tet d_dip d_tri d_tet base_denoised.bin 8828893.543759 -109585.312918 3402379.079818 7818904.804943 8938478.856677 5426514.463940 1009988.738816

q30 max cov 200
nQuire/nQuire lrdmodel c20q30m200_denoised.bin file free dip tri tet d_dip d_tri d_tet c20q30m200_denoised.bin 3851483.940194 3626143.067490 2214822.767299 1718518.690290 225340.872705 1636661.172895 2132965.249904

base_histo.txt
q30m200_histo.txt

Thank you!
-M

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant