GitHub - aimclub/Fedot.Industrial: Python framework for automated time series classification, regression and forecasting

Code
CI/CD
Docs & Examples
Downloads
Support
Languages
Funding

Fedot.Ind is a automated machine learning framework designed to solve industrial problems related to time series forecasting, classification, and regression. It is based on the AutoML framework FEDOT and utilizes its functionality to build and tune pipelines.

Installation

Fedot.Ind is available on PyPI and can be installed via pip:

pip install fedot_ind

To install the latest version from the main branch:

git clone https://github.com/aimclub/Fedot.Industrial.git
cd FEDOT.Industrial
poetry install

How to Use

Fedot.Ind provides a high-level API that allows you to use its capabilities in a simple way. The API can be used for classification, regression, and time series forecasting problems, as well as for anomaly detection.

To use the API, follow these steps:

Import FedotIndustrial class

from fedot_ind.api.main import FedotIndustrial

2. Initialize the FedotIndustrial object and define the type of modeling task. It provides a fit/predict interface:

FedotIndustrial.fit() begins the feature extraction, optimization and returns the resulting composite pipeline;
FedotIndustrial.predict() predicts target values for the given input data using an already fitted pipeline;
FedotIndustrial.get_metrics() estimates the quality of predictions using selected metrics.

NumPy arrays or Pandas DataFrames can be used as sources of input data. In the case below, x_train / x_test, y_train / y_test are pandas.DataFrame() and numpy.ndarray respectively:

dataset_name = 'Epilepsy'
industrial = FedotIndustrial(problem='classification',
                             metric='f1',
                             timeout=5,
                             n_jobs=2,
                             logging_level=20)

train_data, test_data = DataLoader(dataset_name=dataset_name).load_data()

model = industrial.fit(train_data)

labels = industrial.predict(test_data)
probs = industrial.predict_proba(test_data)
metrics = industrial.get_metrics(target=test_data[1],
                                 rounding_order=3,
                                 metric_names=['f1', 'accuracy', 'precision', 'roc_auc'])

More information about the API is available in the documentation section:

Documentation and examples

The comprehensive documentation is available on wikipage.

Useful tutorials and examples can be found in the examples folder.

Topic	Example
Classification	Basic , Federated AutoML, Proba Calibration, Multimodal
Regression	Basic
Forecasting	Basic, Exogen, With strategy
Model ensemble	Kernel Ensemble

Benchmarking

Univariate time series classification

Benchmarking was performed on the collection of 112 out of 144 datasets from the UCR archive.

Algorithm	Top-1	Top-3	Top-5	Top-Half
Fedot_Industrial	17.0	23.0	26.0	38
HC2	16.0	55.0	77.0	88
FreshPRINCE	15.0	22.0	32.0	48
InceptionT	14.0	32.0	54.0	69
Hydra-MR	13.0	48.0	69.0	77
RDST	7.0	21.0	50.0	73
RSTSF	6.0	19.0	35.0	65
WEASEL_D	4.0	20.0	36.0	59
TS-CHIEF	3.0	11.0	21.0	30
HIVE-COTE v1.0	2.0	9.0	18.0	27
PF	2.0	9.0	27.0	40

Multivariate time series classification

Benchmarking was performed on the following datasets: BasicMotions, Cricket, LSST, FingerMovements, HandMovementDirection, NATOPS, PenDigits, RacketSports, Heartbeat, AtrialFibrillation, SelfRegulationSCP2

Algorithm	Mean Rank
HC2	5.038
ROCKET	6.481
Arsenal	7.615
Fedot_Industrial	7.712
DrCIF	7.712
CIF	8.519
MUSE	8.700
HC1	9.212
TDE	9.731
ResNet	10.346
mrseql	10.625

Time series regression

Benchmarking was performed on the following datasets: HouseholdPowerConsumption1, AppliancesEnergy, HouseholdPowerConsumption2, IEEEPPG, FloodModeling1, BeijingPM25Quality, BenzeneConcentration, FloodModeling3, BeijingPM10Quality, FloodModeling2, AustraliaRainfall

Algorithm	Mean Rank
FreshPRINCE	6.014
DrCIF	6.786
Fedot_Industrial	8.114
InceptionT	8.957
RotF	9.414
RIST	9.786
TSF	9.929
RandF	10.286
MultiROCKET	10.557
ResNet	11.171
SingleInception	11.571

Real world cases

Building energy consumption

Link to the dataset on Kaggle

Full notebook with solution is here

The challenge is to develop accurate counterfactual models that estimate energy consumption savings post-retrofit. Leveraging a dataset comprising three years of hourly meter readings from over a thousand buildings, the goal is to predict energy consumption (in kWh). Key predictors include air temperature, dew temperature, wind direction, and wind speed.

Results:

Algorithm	RMSE_average
FPCR	455.941
Grid-SVR	464.389
FPCR-Bs	465.844
5NN-DTW	469.378
CNN	484.637
Fedot.Industrial	486.398
RDST	527.927
RandF	527.343

Permanent magnet synchronous motor (PMSM) rotor temperature

Link to the dataset on Kaggle

Full notebook with solution is here

This dataset focuses on predicting the maximum recorded rotor temperature of a permanent magnet synchronous motor (PMSM) during 30-second intervals. The data, sampled at 2 Hz, includes sensor readings such as ambient temperature, coolant temperatures, d and q components of voltage, and current. These readings are aggregated into 6-dimensional time series of length 60, representing 30 seconds.

The challenge is to develop a predictive model using the provided predictors to accurately estimate the maximum rotor temperature, crucial for monitoring the motor's performance and ensuring optimal operating conditions.

Results:

Algorithm	RMSE_average
Fedot.Industrial	1.158612
FreshPRINCE	1.490442
RIST	1.501047
RotF	1.559385
DrCIF	1.594442
TSF	1.684828

R&D plans

– Expansion of anomaly detection model list.

– Development of new time series forecasting models.

– Implementation of explainability module (Issue)

Citation

Here we will provide a list of citations for the project as soon as the articles are published.

@article{REVIN2023110483,
title = {Automated machine learning approach for time series classification pipelines using evolutionary optimisation},
journal = {Knowledge-Based Systems},
pages = {110483},
year = {2023},
issn = {0950-7051},
doi = {https://doi.org/10.1016/j.knosys.2023.110483},
url = {https://www.sciencedirect.com/science/article/pii/S0950705123002332},
author = {Ilia Revin and Vadim A. Potemkin and Nikita R. Balabanov and Nikolay O. Nikitin
}

Supported by

The study is supported by the Research Center Strong Artificial Intelligence in Industry of ITMO University as part of the plan of the center's program: Development of AutoML framework for industrial tasks.

Name	Name	Last commit message	Last commit date
Latest commit v1docq Updates after Neiry colab (#165 ) Oct 17, 2024 25f5497 · Oct 17, 2024 History 321 Commits
.github	.github	feat: get rid of ts files, convert to arff (#164 )	Sep 24, 2024
benchmark	benchmark	Industrial Release 0.4.2 (#141 )	May 17, 2024
docs/img	docs/img	Updates after Neiry colab (#165 )	Oct 17, 2024
examples	examples	Updates after Neiry colab (#165 )	Oct 17, 2024
fedot_ind	fedot_ind	Updates after Neiry colab (#165 )	Oct 17, 2024
tests	tests	Updates after Neiry colab (#165 )	Oct 17, 2024
.codecov.yml	.codecov.yml	updated coverage upload script (#142 )	Jun 13, 2024
.gitignore	.gitignore	Fedot 3 (#91 )	Sep 29, 2023
.pep8speaks.yml	.pep8speaks.yml	Documentation update (#97 )	Oct 13, 2023
LICENSE.md	LICENSE.md	Create LICENSE.md	Aug 30, 2022
MANIFEST.in	MANIFEST.in	API refactoring (#67 )	Apr 19, 2023
README.rst	README.rst	Updates after Neiry colab (#165 )	Oct 17, 2024
README_en.rst	README_en.rst	Updates after Neiry colab (#165 )	Oct 17, 2024
pyproject.toml	pyproject.toml	Release Fedot.Industrial 0.5. Anomaly detection, Forecasting,Sampling…	Jul 12, 2024
requirements.txt	requirements.txt	Release Fedot.Industrial 0.5. Anomaly detection, Forecasting,Sampling…	Jul 12, 2024
setup.py	setup.py	Autopep8 action (#140 )	May 17, 2024
sweep.yaml	sweep.yaml	Configure Sweep (#147 )	May 28, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Installation

How to Use

Documentation and examples

Benchmarking

Univariate time series classification

Multivariate time series classification

Time series regression

Real world cases

Building energy consumption

Permanent magnet synchronous motor (PMSM) rotor temperature

R&D plans

Citation

Supported by

About

Releases 2

Contributors 17

Languages

License

aimclub/Fedot.Industrial

Folders and files

Latest commit

History

Repository files navigation

Installation

How to Use

Documentation and examples

Benchmarking

Univariate time series classification

Multivariate time series classification

Time series regression

Real world cases

Building energy consumption

Permanent magnet synchronous motor (PMSM) rotor temperature

R&D plans

Citation

Supported by

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 2

Contributors 17

Languages