- [bugfix] --
ALL.reformatGeneFamilies.py
doesn't need the ancestral species to have genes - [bugfix] -- NHX trees can now miss internal labels, as AGORA can generate names by itself.
- [bugfix] -- Species names in the gene trees are not transformed by AGORA anymore, and simply have to match the names in the species tree.
- [new] --
misc.compareGenomes.py
now defaults to colouring the chromosomes according to their size
- [new] -- The
agora-*.py
scripts now accept existing orthology groups as an alternative to gene trees. - [new] -- New
-compress
option to change the compression algorithm (or skip compression).
- [major change] -- Migrated to Python 3.
- [change] -- Settled on using "blocks" for all reconstructions: no contigs / scaffolds any more.
- [change] -- Removed the scripts used to convert to our own tree format since AGORA now natively supports Newick and NHX.
- [change] --
buildSynteny.integr-scaffolds.py
now outputs all the support scores, incl. the ones from the input blocks. - [new] -- Up-to-date help messages for all scripts.
- [new] -- Option to sort the output genomes by decreasing block size.
- [new] -- Workflow steps are now skipped if they have already been run (plus an option to force rerunning an entire workflow).
- [new] -- Dropped the dependency on joblib (Python's
multiprocessing
now works well enough for our needs). - [bugfix] -- Translated some French comments.
- [change] -- Changed the parameters of gene-families filtering in the Vertebrates workflow to the values used for Genomicus production.
- [new] -- Support for macOS (Catalina).
- [new] -- Added the ability to do the second reconstruction pass
(scaffolds) in a multi-integration fashion in the library
myAgoraWorkflow
. - [new] -- New preset to run reconstructions for Plants
(
agora-plants.py
). - [new] -- New script to test all reconstruction parameters and
automatically select the best. It should apply to any clade
(
agora-generic.py
).
- [major change] -- Renamed all scripts and most options to match the upcoming publication.
- [bugfix] -- Added the missing
+onlySingletons
option to thebuildSynteny.integr-extend.py
call inagora2.py
. - [change] -- New
-extantSpeciesFilter
parameter to control which extant species can be used to run pairwise comparisons (buildSynteny.pairwise-conservedPairs.py
andbuildSynteny.integr-groups.py
). The option is exposed in all workflow scripts. InbuildSynteny.integr-groups.py
it replaces the third positional command-line argument. - [change] -- Removed the multithreading option of
buildSynteny.integr-copy.py
as it does not give any benefits. - [change] -- Run the workflow steps sequentially by default in the agora scripts, with an option to enable their parallelisation.
- [new] -- Print memory usage stats when running workflows.
- [new] -- Decreased the memory usage of
ALL.filterGeneFamilies-size.py
,buildSynteny.pairwise-conservedPairs.py
, andbuildSynteny.integr-groups.py
.
- [bugfix] -- Avoid
ALL.extractGeneFamilies.py
crashing because of aValueError
in certain conditions. - [bugfix] -- Some mitochondrial genomes were excluded from the reconstructions.
- [bugfix] --
buildSynteny.integr-refine.py
doesn't crash with aZeroDivisionError
when there isn't a previous score to compare against. - [change] --
buildSynteny.integr-refine.py
andbuildSynteny.integr-groups.py
can now run across multiple cores. - [change] -- The path evaluation parameter of
buildSynteny.integr-refine.py
now defaults to the one used in the paper. - [change] -- Removed two Python dependencies:
numpy
andenum
. - [change] -- Unary nodes are now allowed in the species-tree.
- [new] -- New library (
myAgoraWorkflow
) to simplify the creation of workflows around AGORA scripts. - [new] -- Print CPU usage stats when running workflows.
- [new] -- New scripts (
agora1.py
andagora2.py
) to run the default workflows.
- [change] -- Moved the LibsDyoGen inside
scripts/utils/
- [change] --
buildSynteny.integr-groups.py
now accepts_
as a value for theusedSpecies
parameter and interprets it as the same value as thetarget
parameter. - [change] --
ALL.extractGeneFamilies.py
now outputs trees in the same format as the input trees
- [bugfix] --
agora.py
now makes the script it launches follow thenbThreads
option. - [bugfix] -- Can now have multiple ancGenes size instructions in the configuration file.
- [bugfix] --
agora.py
now applies correct dependencies for the pairwise comparisons - [change] -- Moved and renamed the scripts that convert file formats.
- [new] -- All scripts now natively support Newick species tree and NHX gene trees.
- [new] -- New
publish
instruction to convert the ancestral genomes from the diags format to ancGenomes. - [new] --
agora.py
can now generate the first set of ancGenes (all)
- [bugfix] --
agora.py
was not detecting task failures. - [bugfix] --
agora.py
was always creating an ancGenes-filter task even when it doesn't need one. - [change] -- Added a
-LOG.ancGraph
option to all thebuildSynteny.integr
scripts (instead of hardcoded paths). TheintegrOutput
option of the configuration files now refer to this per-ancestor log file, rather than the standard output of the scripts (which is now empty). - [change] -- Relative paths for input genes and species-tree are now evaluated from the location of the configuration file.
- [change] -- Following a rewrite of the HowTo, the structure of the output
directories as defined in the configuration files has changed:
diags/pairwise
→pairwise
,diags/integr
→integrDiags
.
Initial release