Skip to content

Latest commit

 

History

History
332 lines (259 loc) · 17.3 KB

CHANGELOG.md

File metadata and controls

332 lines (259 loc) · 17.3 KB

Changelog

All notable changes to this project will be documented in this file.

The format is based on Keep a Changelog (+ the Migration Guide), and this project adheres to Semantic Versioning.

[Unreleased]

SCIENTIFIC CHANGES

  • The default ICC calculation was changed from ICC1,1 to ICC2,1 (#176)
  • The GsdIluz algorithm was reworked. This fixes some discrepancies with the original implementation and should improve the results in many cases. We now also provide a version of the original peak detection algorithm that was used in the matlab implementation. This can be used by setting `GsdIluz(**GsdIluz.PredefinedParameters.original). (#182, #187)

Added

  • We added first scripts for the revalidation. This includes a script to extract the old results and put them into files the new evaluation pipeline can read. Further, we have a first sceleton for what the gsd-revalidation could look like. (#182)
  • All pipelines now pass much more metadata to the algorithms action methods. This should make it easier to implement algorithms that need more context. At the moment this is only used in the DummyAlgorithm to load existing results based on the participant and test. For this to work across all algorithms, we had to change some logic in the LrcUllrich algorithm to selectively parse the kwargs that it needs. (#182)
  • When using Custom Aggregations, it is now easier to pass arguments to the underlying aggregation functions. This is a breaking change, as we require you to now pass the desired name of the output column as part of the column_name. CustomOperation(... func=A.icc, column_name="my_column") -> CustomOperation(... func=A.icc, column_name=("icc", "my_column")) This removes the requirement that func needs to be a proper function object with a __name__ attribute. Hence, you can now use lambdas and partials without issue. (#176)
  • It is now possible to split custom aggregations into multiple columns. For example, A.icc returned two values, the ICC and the confidence intervals. Before, both values where put into the same column. Now you can specify two column names in Custom Aggregation to split the values. CustomOperation(... func=A.icc, column_name=[("icc", "my_column"), ("icc_ci", "my_ci_column")]) (#176)
  • Confidence intervals where added as a aggregation function. (#176)

Removed

  • In anticipation of the tpcp>2.0 update, we removed the score methods from all pipelines. You now need to provide the scorer directly to the MetaPipelines/Optimizers. (#182)

[0.10.0] - 2024-10-22

Added

  • Added a simple way to add performance (as in "time algo needs to run") tracking to algorithms
  • Performance values are now reported by the GSD algorithms and the full pipelines via the perf_ attribute.
  • The TVS datasets now have two new columns {test/recording}_name and {test/recording}_name_pretty that have easier to read names for the tests and recordings.

Changed

  • Swapped out the peak detection per window in GSD Iluz to a custom vectorized one that can be jit compiled. This provides a 2-3x speedup for large inputs.
  • Critical path in GSD-Ionesco is now jit compiled.

For Developers

  • We now support adding a .env file to the root of the project to set environment variables and have them loaded automatically when running the tests or examples for the TVS dataset. When test depend on env vars, use the mobgap.misc.get_env_var function to access them.

[0.9.0] - 2024-10-21

Changed

  • The calculate_matched_gsd_performance_metrics function now always return the error metrics that depend on TN samples, not just when TN samples exist. This way the output structure is consistent, and we can avoid bugs in scorer functions, where some datapoints might unexpectedly return a different set of error metrics.
  • The official script to aggregate DMOs, now correctly converts stride length values to cm and variance values to cm^2. Note, that the internal function within mobgap, does not do this conversion automatically, as we use different units in mobgap internally. If you need to match the official script, you have to do this conversion manually. see examples/aggregation/_99_cvs_agg_pipeline_no_exc.py.

Fixed

  • The zero division hint for error functions did not properly replace the value with NaN, when the result of the operation was -inf instead of inf.

[0.8.0] - 2024-09-13

Scientific Changes

  • BREAKING: Using "high precision" standard value of gravity everywhere (9.80665 m/s^2). Before we were using 9.81 m/s^2 during the loading of MobiliseD matlab files and when calculating the stride length validity threshold. This should have a small impact on the results of most algorithms, hence, we marked it as a breaking change.

Added

  • The apply_aggregations and apply_transformation functions now have an option (on by default) to ignore transformations that expect columns that don't exist. This should allow the use of the default aggregations and transformations in more situations.

Changed

  • The matlab loader now accepts infoForAlgo files that are missing some of the optional fields.

[0.7.0] - 2024-09-13

Scientific Changes

  • CRITICAL: The default thresholds for both variants of the GSDIonescu algorithms were not correctly adapted to the input data being in m/s2 instead of g. With the old threshold far to many gait sequences were detected. This fix should have a substantial positive impact on the performance of the Impaired pipeline.

Fixed

  • as_samples now correctly preserves the index of a dataframe.

[0.6.0] - 2024-07-26

Added

  • Evaluation Examples for CAD and Stride Length (#174)

Changed

  • The reference data parser now allows missing information. In case either no turns, no ICs or no strides are available, the respective attribute of the reference data is set to None. Before this threw an error.

[0.5.1] - 2024-07-24

  • Correctly specified minimal pandas version as >=2.2.0

[0.5.0] - 2024-07-16

Changed

  • IMPORTANT:All algorithms now require the input to be in the body frame. This ensures that the correct axis is used in all cases. (#170, #171)
  • The Coordinate Sytem guide was reworked to cover all different coordinate systems and hopefully prevent common mistakes. (#170)
  • BaseMobilisedPipeline is renamed to GenericMobilisedPipeline. BaseMobilisedPipeline is now the "actual" baseclass for Mobilise-D pipelines, without any implementation.
  • MobilisedMetaPipeline has been renamed to MobilisedPipelineUniversal. This should make the naming more consistent (compare MobilisedPipelineHealthy, MobilisedPipelineImpaired).
  • MobilisedPipelineUniversal now has proxy attributes for all the primary results expected in BaseMobilisedPipeline and can be used as a drop-in replacement for GenericMobilisedPipeline.
  • Gaitmap has been removed as a dependency. All used functions are now vendored. This should reduce the number of dependency mobgap pulls in. (#166)
  • Significant performance improvements for GsdAdaptiveIonescu. (#167). Thanks @tbonewmy !

Fixed

  • IMPORTANT: The TD Elgohary algorithm was using the wrong axis for the calculation of the angle. This was fixed and the results should now be correct. (#170)
  • The turning and stride length algorithm both had an option to be used with prior orientation estimation. However, this did not work due to a coordinate system mismatch. The Madgwick algorithm has been modified to allow returning data in the correct coordinate system. (#170)
  • In case no index-col-names are passed to the DatasetFromData class and the data index is just a string and not a tuple, a generic column (level_0) is correctly created. (#164)
  • The coordinate system was not documented correctly. y and z axis were flipped in the description.

Added

  • The Madgwick algorithm is now available in the mobgap package. The implementation is copied from gaitmap. Note, that its implementation differs slightly from the original Madgwick implementation and it supports the concept of body frame data, that only exists in mobgap. (#166, #170)
  • A bunch new helper tools to perform coordinate transforms. (#170)
  • Added various array and rotation helpers that were originally imported from gaitmap. (#166)
  • Added explicit evaluation challenges for the Gait Sequence Detection algorithms. (#169)

[0.4.0] - 2024-06-17

Added

  • Example on how to create custom datasets (#155, #145)
  • Pipelines to run the P1 and P2 Mobilise-D pipelines (#145)
  • A "naive" walking speed calculation method that just multiplies the cadence with the step length. (#148)
  • The Zjilstra Stride/Step length algorithm (#142)
  • A new GSD algorithm called GsdIonescu (#143)
  • A new GSD algorithm called GsdAdaptiveIonescu (#53)
  • The ElGohary Turing Detection algorithm (#131)
  • The iter_gs method now has a new argument id_col_name to specify the column of the gs_list that should be used to infer the id of the returned regions. Even without the argument, the method will try to infer the id column. This allows the use with other region lists besides gs/wb lists. (#135)
  • The GsIterator now has a way to iterate sub-regions using iter_subregions, the subregion context manager and the with_subregion method. This allows for nested iteration over regions and is the basis of our support for "refined GaitSequence". (#135)
  • A refine_gs method that returns a new gait sequence that starts from the first IC and ends at the last IC. This can be used with the new subregion iteration to iterate over the subregions of a gait sequence. (#135)
  • The MobiliseDAggregator can now take None as grouping parameter, which results in all WBs being aggregated together. (#141)
  • The Multi-Df groupby now has a way to pass parameters to the underlying .groupby call. (#141)
  • A method to generate a stride list from initial contacts (strides_list_from_ic_lr_list). (#141)
  • A method to interpolate per-sec values to regions (usually strides) (naive_sec_paras_to_regions). (#141)
  • All the loader functions for the matlab format now have the option to skip a test, if either data or reference data is not available (#125)
  • Matlab-loader dataclasses now have the option to use a pre-computed index instead of deriving the test list by loading the Matlab file itself. This should help reducing the initial index creation time for large datasets. (#125)
  • A loader for the Mobilise-D TVS dataset, which will be published end on June. (#125)
  • General methods to calculate and aggregate error metrics for final WB-level parameters. (#126)

Changed

  • Renamed algorithm folders to easier to understand names(gsd -> gait_sequences, icd -> initial_contacts, lrc -> laterality) (#156)
  • Many algorithms are now more robust to short data and correctly return empty results if the data is too short instead of erroring out. (#145)
  • The Gait Sequence iterator does not allow for dynamic attribute access anymore. Only the use of the results_ object is allowed. (#135)
  • Aggregations for Typed iterators/Gait Sequence iterators only take a single argument now. This is a list of return type tuples that contain ALL results (before the agg funcs only got the values for one of the result attributes) and the input, but also additional context information that can be extended by the iterator object. (#135)
  • Reference WB ids now start at 0 again instead of 1.
  • Reference parameters like turns and initial contacts that exist per WB are now numbered per WB.
  • The MobilseDAggregator now uses new more expressive names by default. (#141)
  • The expected Cadence output now has a new column name cadence_spm instead of cad_spm. (#141)
  • The result attribute for Cadence in all Cadence algorithms and the GSIteration is now called cadence_per_sec instead of cad_per_sec. (#141)
  • The Mobilise-D datasets metadata attribute is renamed to recording_metadata and is now a dictionary instead of a named tuple. It also contains more information about the recording. (#141)
  • All file/directory based versions of the Mobilise-D datasets now require a measurement_condition argument. (#141)
  • All datasets now have a participant_metadata attribute that contains information about the participant. (#141)
  • The Cadence method does now include "incomplete" seconds. This means the "partial" last second of a recording is now included in the output. This ensures that all strides are covered by the output. (#141)
  • The MobiliseDAggregator does not round the output anymore and further does not convert stride length values to "cm" anymore. If this is desired, the user has to do this manually. (#145)

Fixed

  • The check that we assume a gs to start and end with an IC was not correctly performed for the end IC and a warning was raised incorrectly. This is now fixed. (#135)
  • The GsIterator does not throw an error anymore, if the GS list was empty. (#135)
  • The reference parameters for turns loaded from Matlab files now have the correct units.
  • GSDIonescu was incorrectly handling the end sample of the GS. This could lead to GS that end outside the data. (#145)

Removed

  • When data is loaded, the error/warning that none of the sensors where available is removed (#125)

[0.3.0] - 2024-04-23

Added

  • All dataset class have a new property called data_ss referring to the single sensor that should be used in the algorithms. This was added to move all data related config (i.e. which sensor to use in a pipeline) to the dataset class, making it easier to implement dataset agnostic pipelines (#119)
  • A evaluation pipeline for GSD (#124)
  • ML based LR classification (#106)
  • A evaluation/optimization pipeline for LRC (#106)

Changed

  • The loaded reference data now has stricter dtypes (#119)
  • Renamed LRD (left-right-detection) to LRC (left-right-classification) (#106)
  • For GSD and IC evaluation metrics it is now possible to configure what happens in case of 0-division (#127)

Fixed

  • Loading the reference data from a trial without identified WBs will not raise an error anymore, but will correctly return an empty DataFrame (#119)
  • We use operating system independent pandas dtypes everywhere (#118)
  • Fixed issue with multi-groupby, that grouping by just a subset of the index cols resulted in cryptic error messages (#132)

Development

  • RTD previews are now build for PRs.

[0.2.0] - 2024-03-22

Added

  • Example Data fetching via Pooch (#113)

[0.1.0] - 2024-03-22

Initial Release