JZFS

Golang implementation of JZFS: version control file system for datasets management in the era of AI.

JZFS is an industry-leading Data-Centric Version Control File System, helps ensure Responsible AI Engineering by improving Data Versioning, Provenance, and Reproducibility.

Note:

The name JZFS pays tribute to the world's earliest paper money: Song Dynasty JiaoZi.
JZFS is yet another implementation of IPFS (InterPlanetary File System) as JZFS will be compatible with the implementation requirements of IPFS.
As a filesystem of data versioning at scale, although JZFS is built for machine learning, It has a wide range of use scenarios (refer A Universe of Uses) and can be seamlessly integrated into all your data stack.

Data-centric AI is about the practice of iterating and collaborating on data, used to build AI systems, programmatically. Machine learning pioneer Andrew Ng argues that focusing on the quality of data fueling AI systems will help unlock its full power.

Features

In production systems with machine learning components, updates and experiments are frequent. New updates to models(data products) may be released every day or every few minutes, and different users may see the results of different models as part of A/B experiments or canary releases.

Version Everything: Data scientists are often criticized for being less disciplined with versioning their experiments(versioning of data, pipeline, code, and models), especially when using computational notebooks.
Track Data Provenance: This applies to all processing steps in an AI/ML pipeline, including data collection/acquisition, data merging, data cleaning, feature extraction, learning, or deployment.
Reproducibility: A final question of AI/ML that is often relevant for debugging, audits, and also science more broadly is to what degree data, models, and decisions can be reproduced.

Getting Started

Requirement

To build JZFS, you need a working installation of Go 1.22.0 or higher
JZFS use postgres to store running data, you can install at postgres install installation guide

Build And Running

clone and build

git clone https://github.com/GitDataAI/jzfs.git
cd jzfs
make build

After following the above steps, you should be able to see an executable file named "jzfs."

init program and running

./jzfs init  --db postgres://<username>:<password>@localhost:5432/jiaozifs?sslmode=disable
./jzfs daemon

run with docker

docker run -v <data>:/app -p 34913:34913 gitdatateam/jzfs:latest  --db "postgres://<user>:<password>@192.168.1.16:5432/jiaozifs?sslmode=disable" --bs_path /app/data --listen http://0.0.0.0:34913 --config /app/config.toml

Cloud

Try without installing

Note: storage config for IPFS backend storage as you create a new repository in JZFS Console.

 {"type":"ipfs","ipfs":{"url":"/dns/kubo-service.ipfs.svc.cluster.local/tcp/5001"}}

Contributors

License

Dual-licensed under MIT + Apache 2.0

Name		Name	Last commit message	Last commit date
Latest commit History 298 Commits
.github/workflows		.github/workflows
api		api
auth		auth
block		block
chart		chart
cmd		cmd
config		config
controller		controller
docs		docs
examples		examples
fx_opt		fx_opt
integrationtest		integrationtest
models		models
script		script
testhelper		testhelper
utils		utils
version		version
versionmgr		versionmgr
.fend.yaml		.fend.yaml
.gitignore		.gitignore
.golangci.yml		.golangci.yml
Dockerfile		Dockerfile
FUNDING.json		FUNDING.json
LICENSE-APACHE		LICENSE-APACHE
LICENSE-MIT		LICENSE-MIT
README.md		README.md
codecov.yml		codecov.yml
go.mod		go.mod
go.sum		go.sum
main.go		main.go
makefile		makefile

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Licenses found

Repository files navigation

JZFS

Features

Getting Started

Requirement

Build And Running

run with docker

Cloud

Examples

Documentation

Users and Partners

Contributors

License

About

Licenses found

Releases 1

Packages

Contributors 6

Languages

License

Licenses found

GitDataAI/jzfs

Folders and files

Latest commit

History

Repository files navigation

JZFS

Features

Getting Started

Requirement

Build And Running

run with docker

Cloud

Examples

Documentation

Users and Partners

Contributors

License

About

Topics

Resources

License

Licenses found

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 6

Languages

Packages