GitHub - feast-dev/feast-workshop: A workshop with several modules to help learn Feast, an open-source feature store

Workshop: Learning Feast

This workshop aims to teach users about Feast, an open-source feature store.

We explain concepts & best practices by example, and also showcase how to address common use cases.

What is Feast?

Feast is an operational system for managing and serving machine learning features to models in production. It can serve features from a low-latency online store (for real-time prediction) or from an offline store (for batch scoring).

What is Feast not?

Feast does not orchestrate data pipelines (e.g. batch / stream transformation or materialization jobs), but provides a framework to integrate with adjacent tools like dbt, Airflow, and Spark.
Feast also does not solve other commonly faced issues like data quality, experiment management, etc.

See more details at What Feast is not.

Why Feast?

Feast solves several common challenges teams face:

Lack of feature reuse across teams
Complex point-in-time-correct data joins for generating training data
Difficulty operationalizing features for online inference while minimizing training / serving skew

Pre-requisites

This workshop assumes you have the following installed:

A local development environment that supports running Jupyter notebooks (e.g. VSCode with Jupyter plugin)
Python 3.8+
pip
- Docker & Docker Compose (e.g. brew install docker docker-compose)
Module 0 pre-requisites:
- Terraform (docs)
- Either AWS or GCP setup:
  - AWS
    - AWS CLI
    - An AWS account setup with credentials via aws configure (e.g see AWS credentials quickstart)
  - GCP
    - GCP account
    - gcloud CLI
Module 1 pre-requisites:
- Java 11 (for Spark, e.g. brew install java11)

Since we'll be learning how to leverage Feast in CI/CD, you'll also need to fork this workshop repository.

Caveats

M1 Macbook development is untested with this flow. See also How to run / develop for Feast on M1 Macs.
Windows development has only been tested with WSL. You will need to follow this guide to have Docker play nicely.

Modules

See also: Feast quickstart, Feast x Great Expectations tutorial

These are meant mostly to be done in order, with examples building on previous concepts.

Time (min)	Description	Module
30-45	Setting up Feast projects & CI/CD + powering batch predictions	Module 0
15-20	Streaming ingestion & online feature retrieval with Kafka, Spark, Airflow, Redis	Module 1
10-15	Real-time feature engineering with on demand transformations	Module 2
30	Orchestrated batch/stream transformations using dbt + Airflow with Feast	Module 3 (Snowflake)
30	(WIP) Orchestrated batch/stream transformations using dbt + Airflow with Feast	Module 3 (Databricks)
30	Book recommender system with dbt + Airflow + Feast	Feast x Book Recommendations (on Databricks)

Name		Name	Last commit message	Last commit date
Latest commit History 193 Commits
.github/workflows		.github/workflows
images		images
module_0		module_0
module_1		module_1
module_2		module_2
module_3_db		module_3_db
module_3_sf		module_3_sf
module_4_rag		module_4_rag
.gitignore		.gitignore
README.md		README.md

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Repository files navigation

Workshop: Learning Feast

What is Feast?

What is Feast not?

Why Feast?

Pre-requisites

Modules

About

Uh oh!

Releases

Packages

Uh oh!

Contributors 4

Uh oh!

Languages

feast-dev/feast-workshop

Folders and files

Latest commit

History

Repository files navigation

Workshop: Learning Feast

What is Feast?

What is Feast not?

Why Feast?

Pre-requisites

Modules

About

Resources

Uh oh!

Stars

Watchers

Forks

Releases

Packages 0

Uh oh!

Contributors 4

Uh oh!

Languages

Packages