This repository demonstrates how developers can create their own custom offline store
s for Feast.
Custom offline stores allow users to use any underlying data store as their offline feature store. Features can be retrieved from the offline store for model training, and can be materialized into the online feature store for use during model inference.
Feast uses an offline store as the source of truth for features. These features can be retrieved from the offline store for model training. Typically, scalable data warehouses are used for this purpose.
Feast also materializes features from offline stores to an online store for low-latency lookup at model inference time.
Feast comes with some offline stores built in, e.g, Parquet file, Redshift and Bigquery. However, users can develop their own offline stores by creating a class that implements the contract in the OfflineStore class.
- feast_custom_offline_store/: An example of a custom offline store,
CustomFileOfflineStore
, which implements OfflineStore. This example offline store overrides the File offline store that is provided by Feast. - feature_repo/: A simple feature repository that is used to test the custom offline store. The repository has been configured to use the custom offline store as part of it's
feature_store.yaml
- test_custom_offline_store.py: A test case that uses
CustomFileOfflineStore
through thefeature_repo/
Run the following commands to test the custom offline store (FileCustomOfflineStore)
pip install -r requirements.txt
pytest test_custom_offline_store.py
It is also possible to run Feast CLI commands, which in turn will call the offline store. It may be necessary to add the
PYTHONPATH
to the path where your offline store module is located.
$ PYTHONPATH=$PYTHONPATH:/$(pwd) feast -c feature_repo apply
Registered entity driver_id
Registered feature view driver_hourly_stats
Deploying infrastructure for driver_hourly_stats
$ PYTHONPATH=$PYTHONPATH:/$(pwd) feast -c feature_repo materialize-incremental 2021-08-19T22:29:28
Materializing 1 feature views to 2021-08-19 15:29:28-07:00 into the sqlite online store.
driver_hourly_stats from 2020-08-24 20:54:03-07:00 to 2021-08-19 15:29:28-07:00:
Pulling latest features from my offline store
100%|███████████████████████████████████████████████████████████████| 5/5 [00:00<00:00, 2122.19it/s]
A subset of the Feast test suite, called "universal tests", are designed to test the core behavior of offline and online stores. A custom offline store implementation can use the universal tests as follows.
First, this repository contains Feast as a submodule. To fetch and populate the directory, run
git submodule update --init --recursive
Next, install Feast following the instructions here
cd feast
pip install -e "sdk/python[ci]"
and confirm that the Feast unit tests run as expected:
make test
The Feast universal tests can be run with the command
make test-python-universal
If the command is run immediately, the tests should fail. The tests are parametrized based on the FULL_REPO_CONFIGS
variable defined in sdk/python/tests/integration/feature_repos/repo_configuration.py
. To overwrite these configurations, you can simply create your own file that contains a FULL_REPO_CONFIGS
, and point Feast to that file by setting the environment variable FULL_REPO_CONFIGS_MODULE
to point to that file. In this repo, the file that overwrites FULL_REPO_CONFIGS
is feast_custom_offline_store/feast_tests.py
, so you would run
export FULL_REPO_CONFIGS_MODULE='feast_custom_offline_store.feast_tests'
make test-python-universal
to test the custom file offline store against the Feast universal tests. You should notice that a couple of the tests actually fail; this indicates that there is a mistake in the implementation of this offline store!