-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Applied Genome Analysis Weekly Update #1
Comments
This week I inspected the output files from Get Organelle & discussed the results with my advisor. The output file was incomplete, and I tried troubleshooting by changing some parameters based on the FAQ (https://github.com/Kinggerm/GetOrganelle/wiki/FAQ#how-to-assemble-a-target-organelle-genome-using-my-own-reference) but got stuck. A lab mate recommended I try NovoPlasty (https://github.com/ndierckx/NOVOPlasty) in place of GetOrganelle. I've been troubleshooting that portion of the script this week and finally got it to run. I ran into an issue with Novoplasty -- even when I specified the directory output in the configuration file it only saves files to the directory where the job submission script was submitted. I had to manually move to /scratch to get it to run. I also ran into the same issue with Meraculous. Both scripts are running currently, so I hope to have results to check next week! |
I visualized read coverage in IGV & there was pretty even ~40x coverage, which puzzled my lab & I considering our largest contig from the assembly was ~13,000bp (which is only ~10% of the plastome based on the reference sequence). There were a few gaps that may have affected the assembly, but it seems unlikely that this is the main reason since the coverage shows regions of ~30,0000bp without gaps. I pulled a gff from ncbi to visualize which genes the coverage gaps are located & plan to discuss with my lab & PI next week (see images below). I also re-ran Novoplasty with a reference AND seed to see if this improves the assembly. I also am testing one more chloroplast assembler, FastPlast. I reached out to the GACRC to troubleshoot meraculous, and it turns out that there is a version loaded that is missing dependencies. They told me to use a different version, and the script is running fine now! I also changed the "min depth" parameter because I still had an error after fixing the dependency issue. It's currently running now. |
I've been spending this week writing up my report & trying to make sense of our plastome results! |
Plan for your class project:
We recently generated Illumina shotgun sequence data for the non-model plant organism (my study system), Geranium maculatum. I plan to evaluate genome structure with a K-mer analysis, and assemble the plastome & nuclear genome.
Scripts developed: https://github.com/mellamosummer/G_maculatum_novogene/blob/b1b33c844cd09fc139e22747883da07875612990/scripts/G_maculatum.sh
complete
complete
complete
complete
Testing script now
The text was updated successfully, but these errors were encountered: