Continuous Adaptation for Machine Learning System to Data Changes (#TFCommunitySpotlight Awarded)

MLOps system evolves according to the changes of the world, and that is usually caused by data/concept drift. This project shows how to combine two separate pipelines, one for batch prediction and the other for training to adapt to data changes. We worked with the TFX team to author a blog post detailing our approach. The blog post is available here: https://blog.tensorflow.org/2021/12/continuous-adaptation-for-machine.html.

We assume familiarity with basic MLOps concepts (like pipelines, data drift, batch predictions, etc.), TensorFlow, TensorFlow Extedned, and Vertex AI from the reader.

MLOps system also can be evolved when much better algorithm (i.e. state-of-the-art model) comes out. In that case, the system should apply a better algorithm to understand the existing data better. We have demonstrated such workflows in the following projects:

Model Training as a CI/CD System Part1: Reflect changes in codebase to MLOps pipeline: Code on GitHub, Article on the GCP blog
Model Training as a CI/CD System Part2: Trigger, schedule, and run MLOps pipelines: Code on GitHub, Article on the GCP blog

Workflow

Run the initial training pipeline to train an image classifier and deploy it using TensorFlow, TFX, and Vertex AI (02_TFX_Training_Pipeline.ipynb).
Download and prepare images from Bing search to simulate the data drift (97_Prepare_Test_Images.ipynb).
Generate batch prediction pipeline specification (JSON) (03_Batch_Prediction_Pipeline.ipynb).
Deploy cloud function to watch if there are enough sample data to perform batch prediction pipeline and to trigger the batch prediction pipeline (04_Cloud_Scheduler_Trigger.ipynb).
Schedule a periodic job to run the deployed cloud function (04_Cloud_Scheduler_Trigger.ipynb).

Custom components

We developed several custom components in TFX for this project. You can find them under the custom_components directory.

Name		Name	Last commit message	Last commit date
Latest commit History 120 Commits
.github/workflows		.github/workflows
custom_components		custom_components
figures		figures
notebooks		notebooks
LICENSE		LICENSE
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Continuous Adaptation for Machine Learning System to Data Changes (#TFCommunitySpotlight Awarded)

Workflow

Custom components

Checklist

Feedback

Acknowledgements

About

Uh oh!

Uh oh!

Contributors 2

Uh oh!

Languages

License

deep-diver/Continuous-Adaptation-for-Machine-Learning-System-to-Data-Changes

Folders and files

Latest commit

History

Repository files navigation

Continuous Adaptation for Machine Learning System to Data Changes (#TFCommunitySpotlight Awarded)

Workflow

Custom components

Checklist

Feedback

Acknowledgements

About

Topics

Resources

License

Uh oh!

Stars

Watchers

Forks

Uh oh!

Contributors 2

Uh oh!

Languages