feat: Implement cache for predictions #1334

DRMPN · 2024-09-10T20:41:49Z

This is a 🙋 feature or enhancement.

Summary

⚠WIP⚠

Implements DataCache for composer.
- Adds cache for metrics.
- Adds cache for node's predictions.
Renames pipelines_cache to operations_cache.
Removes ancient .pyc files.

Context

Resolves #1291

…tional database

pep8speaks · 2024-09-10T20:42:06Z

Hello @DRMPN! Thanks for updating this PR. We checked the lines you've touched for PEP 8 issues, and found:

In the file fedot/core/caching/data_cache.py:

Line 2:1: F401 'typing.List' imported but unused
Line 2:1: F401 'typing.Union' imported but unused

In the file fedot/core/caching/data_cache_db.py:

Line 4:1: F401 'os.getpid' imported but unused
Line 5:1: F401 'typing.List' imported but unused
Line 5:1: F401 'typing.Tuple' imported but unused
Line 5:1: F401 'typing.TypeVar' imported but unused
Line 7:1: F401 'numpy as np' imported but unused

In the file fedot/core/optimisers/objective/data_objective_eval.py:

Line 134:9: F841 local variable 'predicted_train' is assigned to but never used

In the file fedot/core/pipelines/pipeline.py:

Line 97:121: E501 line too long (122 > 120 characters)

In the file test-cache.py:

Line 7:1: F401 'cProfile' imported but unused
Line 8:1: F401 'pstats' imported but unused
Line 9:1: F401 'pstats.SortKey' imported but unused

Comment last updated at 2024-12-17 08:51:30 UTC

github-actions · 2024-09-10T20:42:47Z

Code in this pull request still contains PEP8 errors, please write the /fix-pep8 command in the comments below to create commit with automatic fixes.

Comment last updated at Tue, 17 Dec 2024 11:52:09

kasyanovse · 2024-10-13T08:06:36Z

fedot/core/caching/data_cache_db.py

+            with closing(sqlite3.connect(self.db_path)) as conn:
+                with conn:


Почему бы не реализовать DataCacheDB как синглтон, подключаясь к БД один раз при инициализации (реинициализации в новых инстансах питона в многопотоке)?

DRMPN · 2024-11-27T20:02:27Z

Для pipeline и nodes уже есть кэширование, если правильно понимаю, поэтому сделал кэширование для метрик.

По логам можно сделать вывод, что композирование работает быстрее без этого кэша. Я смотрел на параметр s/gen - количество затрачиваемых секунд на одну генерацию.

Возможно для более точных результатов эксперимента нужно увеличить количество запусков хотя бы 10 для каждого варианта, увеличить timeout и запускать на сервере, а не на локальном компьютере.

С кэшом метрик:

2024-11-27 22:35:45,122 - Topological features operation requires extra dependencies for time series forecasting, which are not installed. It can infuence the performance. Please install it by 'pip install fedot[extra]'
2024-11-27 22:35:45,587 - ApiComposer - Initial pipeline was fitted in 0.3 sec.
2024-11-27 22:35:45,587 - ApiComposer - Taking into account n_folds=5, estimated fit time for initial assumption is 1.5 sec.
2024-11-27 22:35:45,591 - ApiComposer - AutoML configured. Parameters tuning: False. Time limit: 3 min. Set of candidate models: ['adareg', 'catboostreg', 'dtreg', 'fast_ica', 'isolation_forest_reg', 'knnreg', 'lasso', 'lgbmreg', 'linear', 'normalization', 'pca', 'poly_features', 'ransac_lin_reg', 'ransac_non_lin_reg', 'rfr', 'ridge', 'scaling', 'sgdr', 'svr', 'xgboostreg'].
2024-11-27 22:35:45,608 - ApiComposer - Pipeline composition started.
Generations:   0%|                                                                                               | 0/10000 [00:00<?, ?gen/s]2024-11-27 22:35:47,836 - MultiprocessingDispatcher - 2 individuals out of 2 in previous population were evaluated successfully.
2024-11-27 22:36:18,285 - MultiprocessingDispatcher - 21 individuals out of 21 in previous population were evaluated successfully.
2024-11-27 22:36:44,500 - MultiprocessingDispatcher - 14 individuals out of 14 in previous population were evaluated successfully.
2024-11-27 22:37:06,453 - MultiprocessingDispatcher - 18 individuals out of 18 in previous population were evaluated successfully.
Generations:   0%|                                                                                   | 1/10000 [01:20<224:33:35, 80.85s/gen]2024-11-27 22:37:49,802 - MultiprocessingDispatcher - 24 individuals out of 24 in previous population were evaluated successfully.
2024-11-27 22:37:58,044 - MultiprocessingDispatcher - 7 individuals out of 7 in previous population were evaluated successfully.
Generations:   0%|                                                                                   | 2/10000 [02:12<176:44:30, 63.64s/gen]2024-11-27 22:38:32,890 - MultiprocessingDispatcher - 31 individuals out of 31 in previous population were evaluated successfully.
Generations:   0%|                                                                                   | 3/10000 [03:04<162:15:07, 58.43s/gen]2024-11-27 22:38:50,374 - GroupedCondition - Optimisation stopped: Time limit is reached
Generations:   0%|                                                                                   | 3/10000 [03:04<171:01:37, 61.59s/gen]
2024-11-27 22:38:50,508 - ApiComposer - Model generation finished
2024-11-27 22:38:50,535 - FEDOT logger - Final pipeline was fitted
2024-11-27 22:38:50,536 - FEDOT logger - Final pipeline: {'depth': 2, 'length': 2, 'nodes': [linear, scaling]}
linear - {}
scaling - {}
{'rmse': 0.0}
                           days  hours  minutes  seconds  milliseconds
Data Definition (fit)         0      0        0        0            34
Data Preprocessing            0      0        0        5           551
Fitting (summary)             0      0        3        5           260
Composing                     0      0        3        5           102
Train Inference               0      0        0        0            25
Tuning (composing)            0      0        0        0             0
Tuning (after)                0      0        0        0             0
Data Definition (predict)     0      0        0        0             6
Predicting                    0      0        0        0             6

Без кэша метрик:

2024-11-27 22:40:13,935 - Topological features operation requires extra dependencies for time series forecasting, which are not installed. It can infuence the performance. Please install it by 'pip install fedot[extra]'
2024-11-27 22:40:14,493 - ApiComposer - Initial pipeline was fitted in 0.5 sec.
2024-11-27 22:40:14,493 - ApiComposer - Taking into account n_folds=5, estimated fit time for initial assumption is 2.3 sec.
2024-11-27 22:40:14,498 - ApiComposer - AutoML configured. Parameters tuning: False. Time limit: 3 min. Set of candidate models: ['adareg', 'catboostreg', 'dtreg', 'fast_ica', 'isolation_forest_reg', 'knnreg', 'lasso', 'lgbmreg', 'linear', 'normalization', 'pca', 'poly_features', 'ransac_lin_reg', 'ransac_non_lin_reg', 'rfr', 'ridge', 'scaling', 'sgdr', 'svr', 'xgboostreg'].
2024-11-27 22:40:14,515 - ApiComposer - Pipeline composition started.
Generations:   0%|                                                                                               | 0/10000 [00:00<?, ?gen/s]2024-11-27 22:40:16,628 - MultiprocessingDispatcher - 2 individuals out of 2 in previous population were evaluated successfully.
2024-11-27 22:40:45,262 - MultiprocessingDispatcher - 21 individuals out of 21 in previous population were evaluated successfully.
2024-11-27 22:41:09,459 - MultiprocessingDispatcher - 14 individuals out of 14 in previous population were evaluated successfully.
2024-11-27 22:41:29,641 - MultiprocessingDispatcher - 18 individuals out of 18 in previous population were evaluated successfully.
Generations:   0%|                                                                                   | 1/10000 [01:15<208:40:15, 75.13s/gen]2024-11-27 22:42:14,612 - MultiprocessingDispatcher - 27 individuals out of 27 in previous population were evaluated successfully.
2024-11-27 22:42:20,201 - MultiprocessingDispatcher - 3 individuals out of 3 in previous population were evaluated successfully.
2024-11-27 22:42:21,971 - MultiprocessingDispatcher - 3 individuals out of 3 in previous population were evaluated successfully.
Generations:   0%|                                                                                   | 2/10000 [02:07<171:24:25, 61.72s/gen]2024-11-27 22:42:59,166 - MultiprocessingDispatcher - 28 individuals out of 28 in previous population were evaluated successfully.
Generations:   0%|                                                                                   | 3/10000 [03:02<163:16:10, 58.79s/gen]2024-11-27 22:43:17,349 - GroupedCondition - Optimisation stopped: Time limit is reached
Generations:   0%|                                                                                   | 3/10000 [03:02<169:14:14, 60.94s/gen]
2024-11-27 22:43:17,455 - ApiComposer - Model generation finished
2024-11-27 22:43:17,480 - FEDOT logger - Final pipeline was fitted
2024-11-27 22:43:17,480 - FEDOT logger - Final pipeline: {'depth': 3, 'length': 3, 'nodes': [linear, scaling, resample]}
linear - {}
scaling - {}
resample - {'balance': 'expand_minority', 'replace': False, 'balance_ratio': 1}
{'rmse': 0.0}
                           days  hours  minutes  seconds  milliseconds
Data Definition (fit)         0      0        0        0            42
Data Preprocessing            0      0        0        5           322
Fitting (summary)             0      0        3        3           452
Composing                     0      0        3        3           324
Train Inference               0      0        0        0            22
Tuning (composing)            0      0        0        0             0
Tuning (after)                0      0        0        0             0
Data Definition (predict)     0      0        0        0             7
Predicting                    0      0        0        0             7

Еще раз с кэшом метрик:

2024-11-27 22:44:42,984 - Topological features operation requires extra dependencies for time series forecasting, which are not installed. It can infuence the performance. Please install it by 'pip install fedot[extra]'
2024-11-27 22:44:43,455 - ApiComposer - Initial pipeline was fitted in 0.4 sec.
2024-11-27 22:44:43,455 - ApiComposer - Taking into account n_folds=5, estimated fit time for initial assumption is 1.8 sec.
2024-11-27 22:44:43,460 - ApiComposer - AutoML configured. Parameters tuning: False. Time limit: 3 min. Set of candidate models: ['adareg', 'catboostreg', 'dtreg', 'fast_ica', 'isolation_forest_reg', 'knnreg', 'lasso', 'lgbmreg', 'linear', 'normalization', 'pca', 'poly_features', 'ransac_lin_reg', 'ransac_non_lin_reg', 'rfr', 'ridge', 'scaling', 'sgdr', 'svr', 'xgboostreg'].
2024-11-27 22:44:43,476 - ApiComposer - Pipeline composition started.
Generations:   0%|                                                                      | 0/10000 [00:00<?, ?gen/s]2024-11-27 22:44:45,754 - MultiprocessingDispatcher - 2 individuals out of 2 in previous population were evaluated successfully.
2024-11-27 22:45:16,021 - MultiprocessingDispatcher - 21 individuals out of 21 in previous population were evaluated successfully.
2024-11-27 22:45:40,506 - MultiprocessingDispatcher - 14 individuals out of 14 in previous population were evaluated successfully.
2024-11-27 22:46:01,911 - MultiprocessingDispatcher - 18 individuals out of 18 in previous population were evaluated successfully.
Generations:   0%|                                                          | 1/10000 [01:18<217:51:55, 78.44s/gen]2024-11-27 22:46:45,402 - MultiprocessingDispatcher - 24 individuals out of 24 in previous population were evaluated successfully.
2024-11-27 22:46:53,188 - MultiprocessingDispatcher - 7 individuals out of 7 in previous population were evaluated successfully.
Generations:   0%|                                                          | 2/10000 [02:09<173:28:16, 62.46s/gen]2024-11-27 22:47:39,401 - MultiprocessingDispatcher - 32 individuals out of 32 in previous population were evaluated successfully.
Generations:   0%|                                                          | 3/10000 [03:07<167:40:59, 60.38s/gen]2024-11-27 22:47:51,160 - GroupedCondition - Optimisation stopped: Time limit is reached
Generations:   0%|                                                          | 3/10000 [03:07<173:43:41, 62.56s/gen]
2024-11-27 22:47:51,275 - ApiComposer - Model generation finished
2024-11-27 22:47:51,740 - FEDOT logger - Final pipeline was fitted
2024-11-27 22:47:51,741 - FEDOT logger - Final pipeline: {'depth': 4, 'length': 4, 'nodes': [linear, ridge, scaling, isolation_forest_reg]}
linear - {}
ridge - {}
scaling - {}
isolation_forest_reg - {}
{'rmse': 0.0}
                           days  hours  minutes  seconds  milliseconds
Data Definition (fit)         0      0        0        0            37
Data Preprocessing            0      0        0        5           166
Fitting (summary)             0      0        3        8           651
Composing                     0      0        3        8            74
Train Inference               0      0        0        0           463
Tuning (composing)            0      0        0        0             0
Tuning (after)                0      0        0        0             0
Data Definition (predict)     0      0        0        0             5
Predicting                    0      0        0        0             7

nicl-nno · 2024-11-27T22:35:32Z

Для pipeline и nodes уже есть кэширование, если правильно понимаю

Ты же вроде его модифицировал на кэширование данных?

По логам можно сделать вывод, что композирование работает быстрее без этого кэша.

А фактически сколько попаданий в кэш происходит? И сколько занимает одно обращение к нему?

Если много - то кэш метрик можно держать в памяти, а не на диске. Да и данных тоже, если в этом проблема.

Также можно задать какую-то специфическую начальную популяцию, где пайплайны сильно пересекаются по структуре. Так эффект кэша будет более заметен

DRMPN · 2024-11-28T22:52:39Z

Сделал кэширование для промежуточных метрик, в текущем применении не заметил триггер.

Сделал кэширование node для fit и predict.
Количество сохранений и загрузок кэша совпадает с количеством операций для метрик.

Прогнал несколько раз, при маленьком timeout досчитывает, а при большом заканчивается всегда так:

2024-11-29 00:35:26,356 - PipelineObjectiveEvaluate - --- save evaluate metrics cache
2024-11-29 00:35:26,478 - PipelineNode - -- load fit node data_cache
2024-11-29 00:35:26,479 - PipelineObjectiveEvaluate - Pipeline is not fitted yet
2024-11-29 00:35:26,479 - MetricsObjective - Objective evaluation error for graph {'depth': 3, 'length': 3, 'nodes': [ridge, resample, scaling]} on metric rmse: Metric can not be evaluated because of: Pipeline is not fitted yet
2024-11-29 00:35:26,479 - PipelineObjectiveEvaluate - --- save evaluate metrics cache

30_min_out.txt
30_min_out_1.txt

Возможно для глубоких/широких пайплайнов (length >3) накапливается/появляется ошибка в кэше.
Мое предположение: либо можно опустить кэш для fit, либо придумать обработку посложнее.

DRMPN · 2024-11-28T23:02:42Z

Ты же вроде его модифицировал на кэширование данных?

Я operations_cache.py имел ввиду. Тогда в каких случаях он работает?

А фактически сколько попаданий в кэш происходит? И сколько занимает одно обращение к нему?

Для метрик в одном случае получилось 135 загрузок на 805 сохранений ~= 16.7%
Для node должно получиться также, ибо их количество совпадает.

Также можно задать какую-то специфическую начальную популяцию, где пайплайны сильно пересекаются по структуре.

Вроде видел, что можно создать список из нескольких пайплайнов и дать Федоту прогнать только их. Тогда он запустится без композирования и выберет самый оптимальный, да? Данный кэш именно с композированием сейчас работает.

nicl-nno · 2024-11-29T09:50:36Z

Я operations_cache.py имел ввиду. Тогда в каких случаях он работает?

Ты написал что кэш на уровне узлов и пайплайнов уже есть. Но в этом PR-е он модифицирован, поэтому это тоже может влиять не эффективность.

Возможно для глубоких/широких пайплайнов (length >3) накапливается/появляется ошибка в кэше.

Сохрани такую историю оптимизации и попробуй по ней конкретную ситуацию воспроизвести.

nicl-nno · 2024-11-29T09:51:17Z

Вроде видел, что можно создать список из нескольких пайплайнов и дать Федоту прогнать только их. Тогда он запустится без композирования и выберет самый оптимальный, да? Данный кэш именно с композированием сейчас работает

Я про то, что можно задать такие начальные условия для структурной оптимизации, которые максимизируют пользу от кэша (если это нужно для тестирования).

DRMPN added 6 commits September 1, 2024 00:15

refactor: rename pipeline cache to operation cache

4abcae2

chore: Add test-cache.py for benchmarking and debugging purposes

e2f76d8

chore: add TODOs to insert the data_cache functionality

fcab118

feat: Add DataCache class for storing and loading predictions

5248af3

feat: Add DataCacheDB class for caching predicted output using a rela…

ef90a68

…tional database

chore: add TODO to save the predictions

77e47ca

DRMPN added enhancement New feature or request in progress task in progress composer Related to GP-composition algorithm labels Sep 10, 2024

DRMPN self-assigned this Sep 10, 2024

DRMPN changed the title ~~Implement cache for predictions~~ WIP: Implement cache for predictions Sep 10, 2024

DRMPN changed the title ~~WIP: Implement cache for predictions~~ feat: Implement cache for predictions Sep 10, 2024

kasyanovse reviewed Oct 13, 2024

View reviewed changes

DRMPN added 14 commits November 20, 2024 00:49

feat: change the logic to save the entire OutputData instead

cfaef4f

feat: get/put pickled OutputData into SQL table

612b40e

chore: modify test script to use generated dataset

8b68240

chore: modify error message

e86ec6a

feat: pass data_cache parameter down to store a prediction in DB

064ab27

feat: test access to the stored data

6edef1f

chore: remove old .pyc files

5328fe9

Merge remote-tracking branch 'origin/master' into DRMPN-better-caching

a9624d1

chore: add comment to remove redundant param

20a401d

fix: take blob column instead of str

0ed1309

fix: generate better dataset

64734b3

feat: load predicted data from cache to calculate loss function

662c248

chore: decrease timeout for test script

8a5b424

feat: add cache for pipeline metrics

9fe700e

DRMPN added 2 commits November 28, 2024 21:18

feat: add intermediate metrics' cache

12b96ad

feat: add fit/predict cache for a single node

d75b8d9

DRMPN added 6 commits December 12, 2024 04:03

feat: add cache effectiveness metric

1f4980b

feat: save cache effectiveness to csv file

a4718a4

feat: extract metrics cache to dictionary

33e3234

fix: turn on prediction cache

fa08705

chore: turn off fit cache

bab641f

fix: check and grab metric's cache before fit

54f2232

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

feat: Implement cache for predictions #1334

feat: Implement cache for predictions #1334

DRMPN commented Sep 10, 2024 •

edited

Loading

pep8speaks commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

kasyanovse Oct 13, 2024 •

edited

Loading

DRMPN commented Nov 27, 2024

nicl-nno commented Nov 27, 2024

DRMPN commented Nov 28, 2024

DRMPN commented Nov 28, 2024 •

edited

Loading

nicl-nno commented Nov 29, 2024

nicl-nno commented Nov 29, 2024

		with closing(sqlite3.connect(self.db_path)) as conn:
		with conn:

feat: Implement cache for predictions #1334

Are you sure you want to change the base?

feat: Implement cache for predictions #1334

Conversation

DRMPN commented Sep 10, 2024 • edited Loading

Summary

Context

pep8speaks commented Sep 10, 2024 • edited Loading

Comment last updated at 2024-12-17 08:51:30 UTC

github-actions bot commented Sep 10, 2024 • edited Loading

Comment last updated at Tue, 17 Dec 2024 11:52:09

kasyanovse Oct 13, 2024 • edited Loading

Choose a reason for hiding this comment

DRMPN commented Nov 27, 2024

nicl-nno commented Nov 27, 2024

DRMPN commented Nov 28, 2024

DRMPN commented Nov 28, 2024 • edited Loading

nicl-nno commented Nov 29, 2024

nicl-nno commented Nov 29, 2024

DRMPN commented Sep 10, 2024 •

edited

Loading

pep8speaks commented Sep 10, 2024 •

edited

Loading

github-actions bot commented Sep 10, 2024 •

edited

Loading

kasyanovse Oct 13, 2024 •

edited

Loading

DRMPN commented Nov 28, 2024 •

edited

Loading