-
Notifications
You must be signed in to change notification settings - Fork 125
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
--mirna_gtf for organism with no miRBase GFF file #329
Comments
Tried again on latest version (2.3.1) and getting the same error. |
Hi @OliverH96,
You have to check if |
Apologies for getting back to you so late. This did seem to advance the pipeline slightly, but am now getting a different error:
|
Hi @OliverH96, this one is a bit tough to debug without being able to run the pipeline with the exact setting that you used. Would it be possible for you to share a tar.gz file with the input files or a truncated version of the files here? |
Please also use the latest dev version because this issue might have been solved when updating to latest mirtop. |
I recently tried to debug this, I used this command: nextflow run nf-core/smrnaseq -r dev -latest -profile docker --input https://github.com/nf-core/test-datasets/raw/smrnaseq/samplesheet/v2.0/samplesheet-full.csv --outdir results --with_umi false --mirtrace_species oar --fasta ../files/Ovis_aries_rambouillet.ARS-UI_Ramb_v2.0.dna.toplevel.fa.gz --mirna_gtf ../files/rumimir_sheep.gff --mature https://github.com/nf-core/test-datasets/raw/smrnaseq/miRBase/mature.fa --hairpin https://github.com/nf-core/test-datasets/raw/smrnaseq/miRBase/hairpin.fa --filter_contamination false --skip_mirdeep true -c ../files/rumimir.config -resume fasta was downloaded from https://ftp.ensembl.org/pub/release-112/fasta/ovis_aries_rambouillet/dna/Ovis_aries_rambouillet.ARS-UI_Ramb_v2.0.dna.toplevel.fa.gz rumimir.config config file contains the following:
But I encountered that I'm still getting the missing database issue, I opened a ticket for this in mirtop: miRTop/mirtop#90 |
I created a PR to fix the issue with the database argument, but I spotted that even if the database argument is properly parsed, there is no parser capable of reading the gff file from rumimir. So currently any databases besides mirgenedb and mirbase are not supported by the pipeline. |
Description of the bug
I'm using sheep miRNA data. miRBase contains a few entries for sheep miRNAs but does not provide a gff file on it's download page. I instead used a gff of sheep miRNAs from the RumimiR database (https://rumimir.sigenae.org/), but reach an error at the mirtop_quant step.
I've uploaded the gff file used, but appended the file extension to .txt to allow for uploading.
My params file:
input: '/gpfs01/home/sbzoh/F1_Seminal_Plasma_RNA/rawData/Fastq/F1_SeminalPlasma_Samplesheet.csv'
outdir: '/gpfs01/home/sbzoh/F1_Seminal_Plasma_RNA/smrnaseq_output'
with_umi: false
mirtrace_species: 'oar'
fasta: '/gpfs01/home/sbzoh//refGenome/Ovis_aries_rambouillet.ARS-UI_Ramb_v2.0.dna.toplevel.fasta'
mirna_gtf: '/gpfs01/home/sbzoh//refGenome/rumimir_sheep.gff'
mature: '/gpfs01/home/sbzoh//refGenome/mature.fa'
hairpin: '/gpfs01/home/sbzoh//refGenome/hairpin.fa'
filter_contamination: false
skip_mirdeep: true
Command used and terminal output
Relevant files
nextflow.log
rumimir_sheep.txt
System information
Nextflow version (23.10.1)
Hardware (HPC)
Executor (slurm)
Container engine: (Singularity)
OS (CentOS Linux)
Version of nf-core/smrnaseq (2.3.0)
The text was updated successfully, but these errors were encountered: