Metaflow TensorFlow decorator

The tf.distribute.Strategy allows TensorFlow developers to distribute model training to multiple GPUs/TPUs and machines. This repository implements the Metaflow @tensorflow decorator, which sets up a multi-node Metaflow step to use this functionality.

Features

Installation

Install this experimental module:

pip install metaflow-tensorflow

Getting Started

This package will add a Metaflow extension to your already installed Metaflow, so you can use the tensorflow decorator.

from metaflow import FlowSpec, step, tensorflow, ...

The rest of this README.md file describes how you can use TensorFlow with Metaflow in the single node and multi-node cases which require @tensorflow.

TensorFlow Distributed on Metaflow guide

The examples in this repository are based on the original TensorFlow Examples.

Examples and guides

Directory	TensorFlow script description
MirroredStrategy	Synchronous distributed training on multiple GPUs on one machine.
MultiWorkerMirroredStrategy	Synchronous distributed training across multiple workers, each with potentially multiple GPUs.

Parameter Server

Not yet tested, please reach out to the Outerbounds team if you need help.

Installing TensorFlow for GPU usage in Metaflow

From TensorFlow documentation: Do not install TensorFlow with conda. It may not have the latest stable version. pip is recommended since TensorFlow is only officially released to PyPI.

We have found the easiest way to install TensorFlow for GPU is to use the pre-made Docker image tensorflow/tensorflow:latest-gpu.

Fault Tolerance

See TensorFlow documentation on this matter. The TL;DR is to use a flavor of tf.distribute.Strategy, which implement mechanisms to handle worker failures gracefully.

License

metaflow-tensorflow is distributed under the Apache License.

Name		Name	Last commit message	Last commit date
Latest commit History 12 Commits
.github/workflows		.github/workflows
examples		examples
metaflow_extensions/tensorflow/plugins		metaflow_extensions/tensorflow/plugins
LICENSE		LICENSE
README.md		README.md
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Metaflow TensorFlow decorator

Features

Installation

Getting Started

TensorFlow Distributed on Metaflow guide

Examples and guides

Parameter Server

Installing TensorFlow for GPU usage in Metaflow

Fault Tolerance

License

About

Uh oh!

Releases

Packages

Uh oh!

Languages

License

outerbounds/metaflow-tensorflow

Folders and files

Latest commit

History

Repository files navigation

Metaflow TensorFlow decorator

Features

Installation

Getting Started

TensorFlow Distributed on Metaflow guide

Examples and guides

Parameter Server

Installing TensorFlow for GPU usage in Metaflow

Fault Tolerance

License

About

Resources

License

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Languages

Packages