Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add two parameters for taxonomic classification with DADA2 #681

Conversation

d4straub
Copy link
Collaborator

@d4straub d4straub commented Dec 19, 2023

  1. Added two parameters:
  • --dada_addspecies_allowmultiple: Defines the behavior when multiple exact matches against different species are returned. By default only unambiguous identifications are returned. If TRUE, a concatenated string of all exactly matched species is returned.
  • --dada_taxonomy_rc: Reverse-complement of each sequences will be used for classification if it is a better match to the reference sequences than the forward sequence.
  1. Updated usage.md to explain that --dada_taxonomy_rc is recommended with --dada_ref_taxonomy gtdb according to https://zenodo.org/records/4735821:

Another concern that was raised was the orientation of the DB sequences, to get past this problem please use the tryRC = TRUE argument in the assignTaxonomy command within DADA2, this will search your ASVs in the reverse complement as well.

  1. Updated the default of --kraken2_confidence because it was always showing up as changed even if it was default.

Closes #670

PR checklist

  • This comment contains a description of changes (with reason).
  • If you've fixed a bug or added code that should be tested, add tests!
  • If you've added a new tool - have you followed the pipeline conventions in the contribution docs
  • If necessary, also make a PR on the nf-core/ampliseq branch on the nf-core/test-datasets repository.
  • Make sure your code lints (nf-core lint).
  • Ensure the test suite passes (nextflow run . -profile test,docker --outdir <OUTDIR>).
  • Usage Documentation in docs/usage.md is updated.
  • Output Documentation in docs/output.md is updated.
  • CHANGELOG.md is updated.
  • README.md is updated (including new tool citations and authors/contributors).

Copy link

github-actions bot commented Dec 19, 2023

nf-core lint overall result: Passed ✅ ⚠️

Posted for pipeline commit d0786b7

+| ✅ 154 tests passed       |+
#| ❔   3 tests were ignored |#
!| ❗   2 tests had warnings |!

❗ Test warnings:

  • readme - README did not have a Nextflow minimum version badge.
  • schema_lint - Parameter input is not defined in the correct subschema (input_output_options)

❔ Tests ignored:

✅ Tests passed:

Run details

  • nf-core/tools version 2.10
  • Run at 2023-12-19 14:05:27

@d4straub
Copy link
Collaborator Author

Hi @erikrikarddaniel , I added here a parameter to allow multiple exact matches, I could not see anywhere trouble. However, could there be any consequences to SBDI?

Copy link
Member

@erikrikarddaniel erikrikarddaniel left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

👍
Since the default is to keep the old behaviour, I can't see any problem with this.

@d4straub
Copy link
Collaborator Author

Thanks!

@d4straub d4straub merged commit 71ee3c1 into nf-core:dev Dec 20, 2023
18 checks passed
@d4straub d4straub deleted the add-dada_addspecies_allowmultiple-dada_taxonomy_rc branch December 20, 2023 07:15
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants