already demultiplexed fastqs: mapping file + prep of fastq files #3

elpape · 2017-02-10T13:38:55Z

Hi Taruna or Holly!

I just wanted to make sure I understood how the mapping file for already demultiplexed fastqs should be set up.

BarcodeSequence is in my case not needed (since reads are already demultiplexed) and this column can be either left empty (according to description on the qiime website) or can contain a dummy string (e.g. NNNN)
LinkerPrimerSequence: is this the sequence of the forward gene-specific primer? (if so, why is it called LinkerPrimerSequence - very confusing) the Qiime website says: "each value in that column corresponds to the primer used to amplify that sample". since in my case primers were extended in a 2-step PCR where in the first step the gene-specific primer was added with a special tag attached, do I also need to include the sequence of the tag? My forward gene-specific primer should be SSU_F04, so I should just put down the sequence of this primer in this column, right?
ReversePrimer: this is the reverse gene-specific Primer? should I include the tag? (in my case, primer should be SSU_R22)

In addition, I was wondering for step 1c in the tutorial, why you only truncate the reverse primer? should you not truncate the forward primer as well?

Thanks!
Ellen

tarunaaggarwal · 2017-02-14T18:56:15Z

Hi Ellen,

I'd keep the dummy string in the barcode column. Sometimes Qiime complains if a cell is empty.
The LinkerPrimerSequence is a little confusing. Yes, just put the sequence down in the LinkerPrimerSequence column. If it helps, here is sample mapping file from Holly. You will notice that she has both forward and reverse primer seqs in the LinkerPrimerSequence column but has an additional column with the primer orientation info. You can create a mapping file like that as well.
We truncate the reverse primer only because the forward primer should have already been removed during the demultiplexing step. You can check for the presence of your forward primer in your seqs by typing the following. The result should be 0.

grep -c "^primer-seq" file.fasta

Hope this helps. Let me know if you have more questions. Thanks!
Taruna

elpape · 2017-02-28T14:17:49Z

Hi Taruna,

Thanks for your feedback and my apologies for my late reply (I will hopefully find the time to continue working on this in April).

I am confused about putting both the forward and reverse primers in the same column (under LinkerPrimerSequence). As can be seen from the example mapping file, this means that you have two diff sample IDs for the same sample (one with suffix F04 and another one with suffix R22). Does that not complicate later processing/analyses, as they are in fact the same sample? Probably you can just merge these two later on (making use of pattern recognition), I guess..

Thanks,
Ellen

tarunaaggarwal · 2017-02-28T18:09:10Z

Hi Ellen,

Right! I see how that is confusing. So when I posted that reply, I guess my understanding of the dataset was a little lacking. These data of ours contain non-overlapping amplicons which makes things complicated. I agree with you that the samples IDs are two different IDs.

So here is my new answer--

LinkerPrimerSequence is the forward primer and the ReversePrimer is the reverse primer. Given your primers, I believe I have the correct sequences for them. This is what your mapping file should look like. Please double check the primer sequences.

elpape · 2017-03-20T13:28:58Z

Hi Taruna, It seems the mapping file you are referring to has been removed. Can you put it back? Thank you!! Ellen From: Taruna [mailto:[email protected]] Sent: 28 February 2017 19:09 To: BikLab/BITMaB-workshop <[email protected]> Cc: Ellen Pape <[email protected]>; Author <[email protected]> Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3) Hi Ellen, Right! I see how that is confusing. So when I posted that reply, I guess my understanding of the data set was a little lacking. This data of ours contains non-overlapping amplicons which makes things a little complicated. I agree with you that the samples IDs are two different IDs. So here is my modified answer. LinkerPrimerSequence is the forward primer and the ReversePrimer is the reverse primer. Given your primers, I believe I have the correct sequences for them. This<https://www.dropbox.com/s/tpdyj9xzqz0l9i4/epape_mapping_file_Feb2017.txt?dl=0> is what your mapping file should look like. Please double check the primer sequences. — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#3 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AYf2_fROce0RZBdU4HLf5X1Nd8wDdhRHks5rhGLGgaJpZM4L9Y2n>.

tarunaaggarwal · 2017-03-20T17:54:08Z

It is fixed now...Sorry about that

elpape · 2017-03-22T09:21:46Z

Hi Taruna! Sorry don’t want to be difficult but the link still does not work :/ Cheers, Ellen From: Taruna [mailto:[email protected]] Sent: 20 March 2017 18:54 To: BikLab/BITMaB-workshop <[email protected]> Cc: Ellen Pape <[email protected]>; Author <[email protected]> Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3) It is fixed now...Sorry about that — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#3 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AYf2_T16xFfEOHMcoJowgCaw9NPvG50yks5rnr1AgaJpZM4L9Y2n>.

tarunaaggarwal · 2017-03-22T14:48:03Z

Weird! Okay, here it is. Hopefully this one works!

elpape · 2017-03-22T15:26:51Z

Yes! This works! ☺ From: Taruna [mailto:[email protected]] Sent: 22 March 2017 15:48 To: BikLab/BITMaB-workshop <[email protected]> Cc: Ellen Pape <[email protected]>; Author <[email protected]> Subject: Re: [BikLab/BITMaB-workshop] already demultiplexed fastqs: mapping file + prep of fastq files (#3) Weird! Okay, here<https://www.dropbox.com/s/y1z4woakzzjvvmf/epape_mapping_file_March2017.png?dl=0> it is. Hopefully this one works! — You are receiving this because you authored the thread. Reply to this email directly, view it on GitHub<#3 (comment)>, or mute the thread<https://github.com/notifications/unsubscribe-auth/AYf2_S8aSAo3F6UvQH783MKBYfqugqNkks5roTSkgaJpZM4L9Y2n>.

jianshu93 · 2018-08-09T07:28:53Z

what if forward primer is not removed in per sample based situation. How can I remove forward primer in that case. How should I change the parameter in split_libraries_fastq.py parameters to remove forward primer? add 'mapping_fps' parameter?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

already demultiplexed fastqs: mapping file + prep of fastq files #3

already demultiplexed fastqs: mapping file + prep of fastq files #3

elpape commented Feb 10, 2017

tarunaaggarwal commented Feb 14, 2017 •

edited

Loading

elpape commented Feb 28, 2017

tarunaaggarwal commented Feb 28, 2017 •

edited

Loading

elpape commented Mar 20, 2017 via email

tarunaaggarwal commented Mar 20, 2017

elpape commented Mar 22, 2017 via email

tarunaaggarwal commented Mar 22, 2017

elpape commented Mar 22, 2017 via email

jianshu93 commented Aug 9, 2018

already demultiplexed fastqs: mapping file + prep of fastq files #3

already demultiplexed fastqs: mapping file + prep of fastq files #3

Comments

elpape commented Feb 10, 2017

tarunaaggarwal commented Feb 14, 2017 • edited Loading

elpape commented Feb 28, 2017

tarunaaggarwal commented Feb 28, 2017 • edited Loading

elpape commented Mar 20, 2017 via email

tarunaaggarwal commented Mar 20, 2017

elpape commented Mar 22, 2017 via email

tarunaaggarwal commented Mar 22, 2017

elpape commented Mar 22, 2017 via email

jianshu93 commented Aug 9, 2018

tarunaaggarwal commented Feb 14, 2017 •

edited

Loading

tarunaaggarwal commented Feb 28, 2017 •

edited

Loading