JiaoZiFlow

A flexible, extensible, and customizable pipeline.

Overview

jiaoziflow is a versatile pipeline framework designed for the concurrent preprocessing of various data types, including tables, images, videos, and text. It enables users to define and customize the behavior of each node, seamlessly integrating with Jiaozifs to unlock the potential of versioned data. jiaoziflow is built for cloud-native deployment, offering flexible scaling to handle large data volumes.

Features

Multi-type Data Support: Process table data, images, videos, and text.
Concurrent Execution: Leverage parallel processing for high efficiency and scalability.
Customizable Nodes: Users can freely define and tailor the behavior of each pipeline node.
Jiaozifs Integration: Enhanced data versioning capabilities for more robust data management.
Cloud-Native: Designed for easy deployment and scaling in cloud environments.

Requirements

Rust: Requires Rust 1.80.1 or higher. Install Rust
MongoDB: Used to store runtime data. Install MongoDB
Protobuf: Utilizes Protocol Buffers for data exchange between nodes. Install Protobuf Compiler
Kubernetes: Relies on Kubernetes for deployment and scaling. Requires K8s 1.21 or higher. Install Kubernetes
StorageClass: Require a storage class named jz-flow-fs

Quick Start

1. Build

sudo apt-get install -y protobuf-compiler pkg-config libssl-dev
git clone https://github.com/GitDataAI/jiaoziflow.git
make build-jz

2. Run Daemon

# dont specify the database; it is created dynamically.
./dist/jz-flow daemon --mongo-url mongodb://<ip>:27017

3. Run a Example Flow

./dist/jz-flow job create --name simple --path ./script/example_dag.json  # Create a job and deploy all pods

4. Monitor the Job

./dist/jz-flow job detail <job id>                                        # Monitor the job's details

Name		Name	Last commit message	Last commit date
Latest commit History 98 Commits
.github/workflows		.github/workflows
crates		crates
docs		docs
nodes		nodes
script		script
src		src
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
README.md		README.md
build.rs		build.rs
makefile		makefile
requirements.txt		requirements.txt
rust-toolchain		rust-toolchain
rustfmt.toml		rustfmt.toml
setup.py		setup.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

JiaoZiFlow

Overview

Features

Requirements

Quick Start

1. Build

2. Run Daemon

3. Run a Example Flow

4. Monitor the Job

About

Releases

Packages

Languages

License

GitDataAI/jzflow

Folders and files

Latest commit

History

Repository files navigation

JiaoZiFlow

Overview

Features

Requirements

Quick Start

1. Build

2. Run Daemon

3. Run a Example Flow

4. Monitor the Job

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages