Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Update reference proteome analysis to consider patient genotype #948

Open
malachig opened this issue Apr 7, 2023 · 3 comments
Open

Update reference proteome analysis to consider patient genotype #948

malachig opened this issue Apr 7, 2023 · 3 comments
Labels

Comments

@malachig
Copy link
Member

malachig commented Apr 7, 2023

Currently we compare peptide sequences against a reference proteome from Ensembl.

It would be more correct to create a proteome that incorporated SNPs that impact protein sequence and then look for matches between our peptide sequence and THAT personalized proteome reference.

We could take our germline VCF, use it to update a reference genome fasta with het/hom SNPs, extract CDS sequences based on all annotated protein coding transcripts from Ensembl and then build the peptide reference for comparison from that.

@malachig
Copy link
Member Author

At least two complexities to this:

  • Insertions and deletions
  • Diploid representation

@malachig malachig added the grant label Apr 19, 2023
@malachig
Copy link
Member Author

For a description of a graph based approach to tackling this problem refer to section: "Derivation of Splicing-Derived Peptides" from Kahles et al. "Comprehensive Analysis of Alternative Splicing Across Tumors from 8,705 Patients."

https://www.sciencedirect.com/science/article/pii/S1535610818303064?via%3Dihub#sec5

@susannasiebert
Copy link
Contributor

As a stop-gap to at least incorporate single nucleotide variants we could investigate using pyfaidx FastaVariant (https://github.com/mdshw5/pyfaidx):

The FastaVariant class provides a way to integrate single nucleotide variant calls to generate a consensus sequence.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

2 participants