ITSx chimera detection #9
catnipping
started this conversation in
General
Replies: 1 comment 3 replies
-
This is interesting, thank you. It sounds like we do need more chimera detection. Having said that, why you prefer --uchime_denovo seems very relevant for this discussion before I implement! |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
I was wondering if the chimera detection function embedded in ITSx was robust enough to be reliable without the need of an additional chimera detection step (e.g., uchime_ref or uchime_denovo in vsearch).
Looking into ITSx manual, this is their definition of chimeric sequence: "[chimeric sequences] are sequences with### domains located in the wrong order. This is useful on full-length or near full length data sets, but should not be used on short reads as it could increase the number of false negatives when run on short sequences." Moreover, it states "Note that this is not a robust measure against chimeras of all kinds". Lastly, regarding the option --allow_reorder: "If turned off, a file of potentially chimeric sequences (with profile matches in the wrong order) is written, allowing for, e.g., rudimentary assembly chimera detection."
My understanding is that ITSx would not detect a sequence as chimeric if the chimera resulted from the hybridisation of two different organisms, but the amplicon maintained the correct order (e.g. SSU-ITS1-5.8S from Fungus1 hybridised with ITS2-LSU from Fungus2).
During a PCR reaction, a chimera is generated by the extension of an aborted amplicon that functions as a primer in the next PCR cycle; this generates amplicons that are "half something and half something else", rather than an amplicon with a complete structural rearrangement (e.g., ITS2 before ITS1, etc). As a consequence, I believe the results of ITSx chimera detection function should be handled with care (as the developers themselves suggest).
I think it would be a good idea to add the --uchime_denovo function of vsearch rather than relying on ITSx only. The reason why I tend to prefer it to the --uchime_ref function (at least if my data include loads of poorly annotated sequences) is a topic for another discussion :)
Beta Was this translation helpful? Give feedback.
All reactions