Batch-predict

Repository for batch predict

Overview

Batch predict is useful when users have a large number of instances to get predictions for and/or they don't need to get the prediction result in a real-time fashion.

This apache-beam-based implementation allows several input file formats: JSON (text), TFRecord, and compressed TFRecord files. It supports JSON and CSV output formats. Batch predict supports models trained using TensorFlow(in SavedModel format), xgboost and scikit-learn.

Today, batch predict can run on a single node in a K8s cluster using beam local runner. Alternatively, it can run on Google's Dataflow service using Dataflow Runner. We expect as other runners on K8s mature, it can run on multiple nodes in a k8s cluster.

Batch predict also supports running on GPU (k80/P100) in k8s if the cluster has configured with GPU and proper nvidia drivers installed.

Name		Name	Last commit message	Last commit date
Latest commit History 2 Commits
CONTRIBUTING.md		CONTRIBUTING.md
LICENSE		LICENSE
OWNERS		OWNERS
README.md		README.md
prow_config.yaml		prow_config.yaml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Batch-predict

Overview

About

Releases

Packages

License

yixinshi/batch-predict

Folders and files

Latest commit

History

Repository files navigation

Batch-predict

Overview

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages