-
Notifications
You must be signed in to change notification settings - Fork 0
Commit
This commit does not belong to any branch on this repository, and may belong to a fork outside of the repository.
- Loading branch information
Showing
1 changed file
with
64 additions
and
0 deletions.
There are no files selected for viewing
This file contains bidirectional Unicode text that may be interpreted or compiled differently than what appears below. To review, open the file in an editor that reveals hidden Unicode characters.
Learn more about bidirectional Unicode characters
Original file line number | Diff line number | Diff line change |
---|---|---|
@@ -0,0 +1,64 @@ | ||
# Barbarian Data System | ||
|
||
Barbarian is the world's best cloud-first, cloud-agnostic in-memory big data system founded on Apache Hadoop for enterprise-ready parallel distributed data processing at scale. | ||
|
||
Read more at: | ||
[https://barbarians.io/](https://barbarians.io) | ||
|
||
Docs at: | ||
[http://docs.barbarians.io/](http://docs.barbarians.io) | ||
|
||
### About Barbarian | ||
|
||
The Barbarian Data System is an in-memory, parallel, distributed (MPP) data warehousing engine designed to be deployed to Kubernetes clusters, offering Apache Hive for powerful and flexible SQL based analytics. Barbarian includes an integrated in-memory filesystem and can run in three modes of operation. | ||
* As an in-memory, standalone data warehousing system | ||
* As a data warehousing system backed by an external storage system like Amazon S3 | ||
* In a hybrid mode, where primary storage is the external storage system, with common paths mounted to the in-memory filesystem | ||
|
||
Barbarian includes compelling features including Apache Hive LLAP and Tez, with transactional tables enabled by default. | ||
|
||
Barbarian's integrated Ignite in-memory distributed parallel filesystem is resilient to node failure with replication enabled by default. | ||
|
||
Barbarian has no single points of failure. | ||
|
||
Barbarian is offered with the [Apache v2.0](https://www.apache.org/licenses/LICENSE-2.0) software license. | ||
|
||
### Installing Barbarian | ||
|
||
Barbarian can be deployed to your Kubernetes cluster with just two commands: | ||
|
||
``` | ||
helm repo add barbarians http://charts.barbarians.io/barbarian | ||
helm install barbarians/barbarian | ||
``` | ||
|
||
## Base Container image | ||
|
||
This repo contains the configuration files and build scripts for the Barbarian Hadoop Distribution **base container image**. | ||
|
||
The latest release of the base container image is based on the following Apache Foundation software releases: | ||
- Apache Hadoop 3.1.2 | ||
|
||
## Releases | ||
|
||
| Release | Notes | | ||
| -- | -- | | ||
| 0.1 | Prelease 1 | | ||
| 0.2 | Barbarian Data System r2 | | ||
| -- | -- | | ||
|
||
## Building | ||
|
||
See .codefresh | ||
|
||
## Running | ||
|
||
This image is designed to be run as a part of the Barbarian Hadoop distribution - a Kubernetes based platform for data processing at scale, founded on free software developed by the [Apache Software Foundation](https://www.apache.org/). | ||
|
||
## Features | ||
|
||
The image includes support for the following features: | ||
- Hadoop YARN | ||
- YARN RM HA | ||
- YARN Services | ||
- Ignite IGFS integration |