Skip to content

Releases: sanger-tol/treeval

1.2.0 - Ancient Destiny

14 Jan 10:18
577d30b
Compare
Choose a tag to compare

[1.2.0] - Ancient Destiny [14/01/2025]

  • Added Steps - subworkflows are now split into configurable steps.
  • Added a .bin output - this will later be used to create modified pretext maps in downstream processes.
  • Added .ktab as an output.
  • All modules have been updated to remove any Anaconda dependency e.g. the defaults channel.
  • Some module versions have been updated too.
  • Replaces PyFasta split with seqtk split - fasta split is severely out of date now.
  • CICD has been reverted to near the original template - this was changed, originally, to mitigate an error with nextflow being unable to download the large number of containers TreeVal has.
  • MerquryFK Module patch to remove memory arg in module as it is unnessecary.
  • Modules updated to ensure that version information is being saved.
  • BUSCO module now includes the arg --metaeuk by default.
  • Changing GENESET to be more userfriendly.
  • Adds a JBrowse Only workflow (this will lead to an update to the FULL workflow which can now call JBROWSE_ONLY and RAPID).
  • Removing dp24's liberal use of spaces.
  • Resource review.

Full Changelog: 1.1.1...1.2.0

v1.1.1 - Ancient Aurora (H1)

04 Jul 08:53
b7475b7
Compare
Choose a tag to compare

[1.1.1] - Ancient Aurora H1 - [2024-07-04]

Our first hotfix for TreeVal, including changing the name of some outputs, correcting some code and fixing an issue with cram files containing multiple RG lines. This will require further fixes in the future, this will include the splitting of the cram file depending on the number of RG lines.

What's Changed

Full Changelog: 1.1.0...1.1.1

v1.1.0 - Ancient Aurora

09 Apr 09:00
dd32db5
Compare
Choose a tag to compare

[1.1.0] - Ancient Aurora - [2024-04-26]

The second release for sanger-tol, created with the nf-core template.

This builds on the initial release by adding subworkflows which generate Kmer-based coverage tracks and a Kmer spectra graph. There are also a number of updates to the logic used throughout the pipeline, as well as to the resources required by a significant number of modules.

What's Changed

  • Updates to the resource allocation methods used by a number of modules in the base.config.
  • Added a flag to stop the usage of Juicer.
  • Subworkflow to generate a kmer-based coverage track.
  • Subworkflow to generate/update a kmer spectra graph.
  • Subworkflow to use minimap2 for HiC mapping, if selected.
  • Subworkflow to use BWAmem2 for HiC mapping, if selected.
  • Subworkflow to ingest Pretext accessory files into the Pretext file, simplifying post-TreeVal data manipulation.
  • Updated the logic in use throughout the pipeline.
  • Updated the modules.config to include some of the logic, cleaning the code.
  • Updated the HiC subworkflow to include subsampling the HiC data for Juicer due to resource requirements with large amounts of data.
  • Updated the YAML_INPUT subworkflow, this now contains "flags" to change some software options.
  • Updated the data names in the input YAML to reduce confusion.
  • Updated software (Pretext{View, Snapshot, Graph}) to allow for use on large genomes with big data.
    • Added associated patch files and CPU architecture files.
  • Updated the minimap2 align module to remove samtools view in preference of paftools for our use case.
  • Updated the test.yml in line with the above changes.
  • Updated the SELFCOMP subworkflow to allow for the parallelisation of the work on large genomes.
  • Updated the READ_COVERAGE subworkflow to produce the scaffold-based AVG coverage and STND coverage
  • Updated Modules from NF-Core - mostly relates to module structure rather than software.
  • Updated the SummaryStats output to include HiC container counts.
  • Added -T / -t flags where possible to minimise the use of the /tmp directory.
  • Replaced CONCAT_MUMMER with CATCAT for simplicity.
  • Removed JUICER from the RAPID entry point.
  • Removed the CSI or TBI logic. CSI is now used by default, this simplifies the workflow and enlarges the capacity to handle much larger genomes. The logic block previously required was then moved.
  • Added NF-DOWNLOAD to the CI-CD due to an error that causes incomplete downloaded when downloading a number of images at the same time.
  • Added the RAPID_TOL entry point which is more geared towards the requirements of Sanger.
  • Fix a bug in build_alignment_blocks.py to avoid indexing errors happening in large genomes.
  • Change output BEDGRAPH from EXTRACT_TELO module.
  • Fix to the telomere output.
  • Fix to the SelfComp scripts.

New Contributors

  • @gq1 made their first contribution in #186

Contributors

Full Changelog: 1.0.0...v1.1.0

v1.0.0 – Ancient Atlantis

27 Sep 13:21
2413fe5
Compare
Choose a tag to compare

[1.0.0] - Ancient Atlantis - [2023-09-27]

Initial release of sanger-tol/treeval, created with the nf-core template.

This pipeline allows for the generation of data needed for the high-quality manual curation of genomic assemblies.

What's Changed

New Contributors

Full Changelog: https://github.com/sanger-tol/treeval/commits/v1.0.0