You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Currently, we need to manually specify in the config.yaml (or references config) that we want bowtie2 or hisat or star or whatever indexes, and that we want the different conversions for refflat, gffutils db, etc.
One option is to add these all to the included reference configs; another option is to hard-code them into the references workflow so that there is always a rule available for them (even though a particular workflow may not need all of them, in which case they will not be built).
A nice side effect is that when running the references workflow, it will create all the files needed so they will always be on hand.
The text was updated successfully, but these errors were encountered:
After working on this a bit in #209, I realized that we should keep it the way it is for now, for the following reasons:
gffutils db often requires custom kwargs to handle all the idiosyncracies of a GTF file. There's not a good "default" way of running this. And there are cases like GENCODE human GTF where the wrong settings cause it take forever to create a db.
genome fastas and transcriptome fastas both fall under the "fasta" field in the references config. It doesn't make sense to build a salmon index for a genome fasta or a star index for a transcriptome fasta.
While we want to retain the ability to pass kwargs, it's not clear that these should be overrides.
Possible solutions to these issues;
Don't do the gffutils conversion. Not sure how useful it is anyway.
Change the config specification to use "genome_fasta" and "transcriptome_fasta" rather than just "fasta". This has a nice side-effect in that it helps tie together gtf/genome/transcriptome in a way that they are not tied together now.
Change the "conversions" key to be "conversions_overrides" or something.
Currently, we need to manually specify in the config.yaml (or references config) that we want bowtie2 or hisat or star or whatever indexes, and that we want the different conversions for refflat, gffutils db, etc.
One option is to add these all to the included reference configs; another option is to hard-code them into the references workflow so that there is always a rule available for them (even though a particular workflow may not need all of them, in which case they will not be built).
A nice side effect is that when running the references workflow, it will create all the files needed so they will always be on hand.
The text was updated successfully, but these errors were encountered: