Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

add some function to extract cdna and/or protein sequences. #494

Open
xiekunwhy opened this issue Nov 18, 2024 · 3 comments
Open

add some function to extract cdna and/or protein sequences. #494

xiekunwhy opened this issue Nov 18, 2024 · 3 comments

Comments

@xiekunwhy
Copy link

Hi,

Hope that seqkit can extract transrcipt/cdna/gene/protein sequences from references fasta and gff/gtf file. gffread can do this but it can not handle large chromosomes correctly.

Best,
Kun

@shenwei356
Copy link
Owner

Is this what you want ? https://bioinf.shenwei.me/seqkit/usage/#subseq

How about other tools such as bedtools, bedops?

@xiekunwhy
Copy link
Author

No exactly, bedtools and bedops also not have function meet gffread (https://github.com/gpertea/gffread, gpertea/gffread#125) exactly. gffio (https://github.com/lh3/gffio) works but output empty file when extract protein sequence using gff and reference sequence.

@shenwei356
Copy link
Owner

can extract transrcipt/cdna/gene/protein sequences from references fasta and gff/gtf file

Can seqkit subseq do that? It supports GTF files if I remember correctly.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants