4.4.0.0 #8244
Replies: 2 comments
-
I met a problem with run CFSAN SNP pipeline. Error: CLASSPATH is not configured with the path to GenomeAnalysisTK.jar. I have checked the path and everything is fine in my sense. Can you help me? It is an urgency. Thank you. |
Beta Was this translation helpful? Give feedback.
-
The docker image for 4.4.0.0 is 2.81 GB, which is 1.2 GB larger (75% larger) than the 4.3.0.0 tag, which was 1.63 GB. (See dockerhub for confirmation.) Is there a reason why 4.4.0.0 is so much larger than 4.3.0.0? |
Beta Was this translation helpful? Give feedback.
-
Download release: gatk-4.4.0.0.zip
Docker image: https://hub.docker.com/r/broadinstitute/gatk/
Highlights of the 4.4.0.0 release:
We've moved to Java 17, the latest long-term support (LTS) Java release, for building and running GATK! Previously we required Java 8, which is now end-of-life.
Significant enhancements to
SelectVariants
, including arguments to enableGVCF
filtering support and to work with genotype fields more easily.A new tool
SVConcordance
, that calculates SV genotype concordance between an "evaluation" VCF and a "truth" VCFBug fixes and enhancements to the support for the Ultima Genomics flow-based sequencing platform introduced in GATK 4.3.0.0
Full list of changes:
Flow-based Variant Calling
FlowFeatureMapper
: added surrounding-median-quality-size feature (FlowFeatureMapper: added surrounding-median-quality-size feature #8222)Mutect2
parameters to make them consistent with theHaplotypeCaller
parameters (Updated M2 parameters to make them consistent with HC #8186)SelectVariants
SelectVariants
(Enabled GVCF type filtering support in SelectVariants #7193)--ignore-non-ref-in-types
to support correct handling of VariantContexts that contain a NON_REF allele. This is necessary because every variant in a GVCF file would otherwise be assigned the type MIXED, which makes it impossible to filter for e.g. SNPs.SelectVariants
: added new arguments for controlling genotype JEXL filtering (SelectVariants JEXL filter fixes and refactor #8092)-select-genotype
: with this new genotype-specific JEXL argument, we support easily filtering by genotype fields with expressions like 'GQ > 0', where the behavior in the multi-sample case is 'GQ > 0' in at least one sample. It's still possible to manually access genotype fields using the old-select
argument and expressions such asvc.getGenotype('NA12878').getGQ() > 0
.--apply-jexl-filters-first
: This flag is provided to allow the user to do JEXL filtering before subsetting the format fields, in particular the case where the filtering is done on INFO fields only, which may improve speed when working with a large cohort VCF that contains genotypes for thousands of samples.SV Calling
SVConcordance
, that calculates SV genotype concordance between an "evaluation" VCF and a "truth" VCF (Implement SVConcordance tool #7977)SVAnnotate
(Recognize MEI DELs with ALT format <DEL:ME> in SVAnnotate #8125)AnalyzeSaturationMutagenesis
(don't sort output #8053)Notable Enhancements
GenotypeGVCFs
: added an--keep-specific-combined-raw-annotation
argument to keep specified raw annotations (Adding argument to GenotypeGVCFs to keep only RAW_GT_COUNT #7996)VariantAnnotator
now warns instead of fails when the variant contains too many alleles (VariantAnnotator fails when the variant contains too many alleles #8075)GenomicsDB
arguments to theCreateSomaticPanelOfNormals
tool (Expose GenomicsDBArgumentCollection with CreateSomaticPanelOfNormals #6746)DeprecatedFeature
annotation and a process for officially marking GATK tools as deprecated (Add DeprecatedFeature annotation. #8100)close()
methods from hiding underlying errors (Prevent close from hiding causal exceptions #528 #7764)Bug Fixes
VariantRecalibrator
to sometimes fail if user provided duplicate -an options (Fixed issue causing VariantRecalibrator to sometimes fail if user provides duplicate-an
options #8227)ReblockGVCF
: remove A,R, and G length attributes whenReblockGVCF
subsets an allele (removing A,R, and G length attributes when ReblockGVCF subsets an allele #8209)ReblockGVCF
would not remove all of them at sites where an allele was dropped. This makes the output gVCF invalid since the annotation length no longer matches the length described in the header at those sites. Now we fix up F1R2, F2R1, and AF annotations and remove any other annotations that are not already handled that are defined as A, R, or G length in the header.gCNV
bug that breaks the inference when only 2 intervals are provided (Number of intervals edge case gCNV fix #8180)GenotypingEngine
(Fix NPE from unintialized logger in GenotypingEngine #8159)StreamingPythonExecutor
/CNNScoreVariants
(Fix asynchronous Python exception propagation in StreamingPythonExecutor/CNNScoreVariants. #7402)ShiftFasta
where the interval list output was never written (Fixing issue in ShiftFasta where the intervals aren't written #8070)MergeAnnotatedRegions
now requires a reference as asserted in its documentation (require reference as asserted in documentation #8067)Miscellaneous Changes
VariantRecalibrator
argument and an oldReblockGVCF
argument that produced invalid GVCFs (Deprecate some args #8140)GnarlyGenotyper
code with a diploid assumption to prepare for adding haploid support toGnarlyGenotyper
(Deprecate some args #8140)ReblockGVCF
: add error message for when tree-score-threshold is set but the TREE_SCORE annotation is not present (add error message for when tree-score-threshold is set in ReblockGVCF… #8218)TransferReadTags
: allow empty unaligned bams as input (Allow empty input in TransferReadTags #8198)JointVcfFiltering
WDL and expanded tests. (Refactored JointVcfFiltering WDL and expanded tests. #8074)#carrot_pr
to trigger branch vs master comparison runs (Updated carrot publish github action version #8084)File.createTempFile()
withIOUtils.createTempFile()
to ensure that temp files are deleted on shutdown (Replace File.createTempFile with IOUtils.createTempFile #6780)CNNScoreVariants
tool classes. (Don't require python to just instantiate tool classes. #8128)Funcotator
methods and fields protected so it is easier to extend the tool (Make several Funcotator methods and fields protected instead of private #8124) (Make several Funcotator methods and fields protected #8166)ProcessControllerAckResult
API (Test for presence of ack result message and simplify ProcessControllerAckResult API #7816)DirichletAlleleDepthAndFractionIntegrationTest
(Fixed the broken test in DirichletAlleleDepthAndFractionIntegrationTest #7963)HaplotypeCaller
test files that are no longer needed (Removed GATK3.5 VCFs from HC integration test files. #7634)Documentation
OMP_NUM_THREADS
andMKL_NUM_THREADS
toGermlineCNVCaller
andDetermineGermlineContigPloidy
(Added documentation on OMP_NUM_THREADS and MKL_NUM_THREADS to GermlineCNVCaller and DetermineGermlineContigPloidy. #8223)PileupDetectionArgumentCollection
documentation (Small improvements to the pileup arguments #8050)VariantAnnotator
(Fix minor documentation mistake in VariantAnnotator #8145)Dependencies
Java 17
, the latest LTS Java release, for building/running GATK (Java 17 and Spark 3.3.0. #8035)Gradle
to 7.5.1 (Update Gradle to 7.5.1 #8098)HTSJDK
to 3.0.5 (Java 17 and Spark 3.3.0. #8035)Picard
to 3.0.0 (Java 17 and Spark 3.3.0. #8035)Barclay
to 5.0.0 (Java 17 and Spark 3.3.0. #8035)GenomicsDB
to 1.4.4 (Move to GenomicsDB 1.4.4 #7978)Spark
to 3.3.1 (Java 17 and Spark 3.3.0. #8035)Hadoop
to 3.3.1. (Use Java 11.0.11+9 for testing, upgrade hadoop to v3.3.1. #8102)commons-text
1.10.0 to fix a security vulnerability (Add constraint to transitive dependency commons-text #8071)This discussion was created from the release 4.4.0.0.
Beta Was this translation helpful? Give feedback.
All reactions