Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[FEATURE EPIC] Call SNPs, Indels & SVs with iGenVar #36

Open
5 of 6 tasks
Irallia opened this issue Oct 20, 2020 · 0 comments
Open
5 of 6 tasks

[FEATURE EPIC] Call SNPs, Indels & SVs with iGenVar #36

Irallia opened this issue Oct 20, 2020 · 0 comments
Assignees
Labels

Comments

@Irallia
Copy link
Collaborator

Irallia commented Oct 20, 2020

This is an overview over all epics.

As a geneticist for rare diseases, I would like to analyze the deletions in a patient's genome so that the events characterized by the disease can be detected with the help of databases. This helps to narrow down the diagnosis and to initiate tailored therapies corresponding to the genon type.

This includes the following aspects:


Input

Differentiating between the inputs will be processed in the course of Issue seqan/product_backlog#17.
Create a Structure for BAM Indexing seqan/product_backlog#88

Algorithms

Call SNPs & Indels:

-> generating candidate haplotypes
-> local realignment using the pair HMM Model against the candidate haplotypes -> matrix of likelihoods for each read
-> local assembly: assemble these window aligned reads into an assembly graph of local variation
-> infer variants from assembled haplotypes: "Despite its name, HaplotypeCaller does not actually call haplotypes. Rather, it generates haplotypes as an intermediate step to discover variants at individual loci. Here we describe how the GATK engine determines which alt alleles exist in locally assembled haplotypes." (-> variant qualtiy score model)

Call SVs:

Call Deletions from long reads seqan/product_backlog#32

Call Insertions from long reads seqan/product_backlog#93

Add all Methods of Vaquita seqan/product_backlog#84

Call SVs in short reads seqan/product_backlog#17

Cluster SVs: seqan/product_backlog#26

Refinement

TODO... (sViper, ...)

Output

We need to decouple the output from the functionality #6 so that we can write it to an output file #8 with an output option seqan/product_backlog#21.
Then a VCF parser has to be developed in SeqAn3 #9 #10, which we want to use for iGenVar #11.

Testing ✅

We want to check the code with CLI seqan/product_backlog#4 and API tests seqan/product_backlog#12 seqan/product_backlog#13 and cover it completely.
-> We now have a codecoverage of > 85%! seqan/product_backlog#116
In order to implement the CodeCoverage, we are waiting for an update in the app template: seqan/app-template#30.
Update: CLI tests are implemented. 🎉

Refinements, bugs, and requests

@Irallia Irallia added the Epic label Oct 20, 2020
@Irallia Irallia self-assigned this Oct 20, 2020
@Irallia Irallia changed the title [FEATURE EPIC] Call Deletions with iGenVar [FEATURE EPIC] Call SNPs, Indels & SVs with iGenVar Mar 30, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant