Skip to content

Commit

Permalink
Create README.md
Browse files Browse the repository at this point in the history
  • Loading branch information
grobbie authored Oct 17, 2018
1 parent 71f8cec commit 8c4490f
Showing 1 changed file with 64 additions and 0 deletions.
64 changes: 64 additions & 0 deletions README.md
Original file line number Diff line number Diff line change
@@ -0,0 +1,64 @@
# Barbarian Data System

Barbarian is the world's best cloud-first, cloud-agnostic in-memory big data system founded on Apache Hadoop for enterprise-ready parallel distributed data processing at scale.

Read more at:
[https://barbarians.io/](https://barbarians.io)

Docs at:
[http://docs.barbarians.io/](http://docs.barbarians.io)

### About Barbarian

The Barbarian Data System is an in-memory, parallel, distributed (MPP) data warehousing engine designed to be deployed to Kubernetes clusters, offering Apache Hive for powerful and flexible SQL based analytics. Barbarian includes an integrated in-memory filesystem and can run in three modes of operation.
* As an in-memory, standalone data warehousing system
* As a data warehousing system backed by an external storage system like Amazon S3
* In a hybrid mode, where primary storage is the external storage system, with common paths mounted to the in-memory filesystem

Barbarian includes compelling features including Apache Hive LLAP and Tez, with transactional tables enabled by default.

Barbarian's integrated Ignite in-memory distributed parallel filesystem is resilient to node failure with replication enabled by default.

Barbarian has no single points of failure.

Barbarian is offered with the [Apache v2.0](https://www.apache.org/licenses/LICENSE-2.0) software license.

### Installing Barbarian

Barbarian can be deployed to your Kubernetes cluster with just two commands:

```
helm repo add barbarians http://charts.barbarians.io/barbarian
helm install barbarians/barbarian
```

## Base Container image

This repo contains the configuration files and build scripts for the Barbarian Hadoop Distribution **base container image**.

The latest release of the base container image is based on the following Apache Foundation software releases:
- Apache Hadoop 3.1.2

## Releases

| Release | Notes |
| -- | -- |
| 0.1 | Prelease 1 |
| 0.2 | Barbarian Data System r2 |
| -- | -- |

## Building

See .codefresh

## Running

This image is designed to be run as a part of the Barbarian Hadoop distribution - a Kubernetes based platform for data processing at scale, founded on free software developed by the [Apache Software Foundation](https://www.apache.org/).

## Features

The image includes support for the following features:
- Hadoop YARN
- YARN RM HA
- YARN Services
- Ignite IGFS integration

0 comments on commit 8c4490f

Please sign in to comment.