Kudu is an open source storage engine for structured data which supports low-latency random access together with efficient analytical access patterns. Kudu distributes data using horizontal partitioning and replicates each partition using Raft consensus, providing low mean-time-to-recovery and low tail latencies. Kudu is designed within the context of the Hadoop ecosystem and supports many modes of access via tools such as Apache Impala (incubating), Apache Spark, and MapReduce.
docker-compose up -d
docker run --rm -it --link kududocker_kudu-tserver_1:kudu_tserver -e KUDU_TSERVER=kudu_tserver kunickiaj/kudu cli status
docker pull kunickiaj/kudu
docker build -t kunickiaj/kudu .
docker create --name kudu-master-data -v /var/lib/kudu/master kunickiaj/kudu
docker create --name kudu-tserver-data -v /var/lib/kudu/tserver kunickiaj/kudu
docker run -d --name kudu-master -p 8051:8051 kunickiaj/kudu master
docker run -d --name kudu-tserver -p 8050:8050 --link kudu-master \
-e KUDU_MASTER=kudu-master kunickiaj/kudu tserver
docker logs -f kudu-master
docker logs -f kudu-tserver
docker run --rm -it --link kudu-tserver -e KUDU_TSERVER=kudu-tserver kunickiaj/kudu kudu tserver status kudu-tserver
Each component provide its own web UI. Open you browser at one of the URLs below, where dockerhost
is the name / IP of the host running the docker daemon. If using Linux, this is the IP of your linux box. If using OS X or Windows (via Docker-Machine), you can find out your docker host by typing docker-machine ip default
.
Component | Port |
---|---|
Master | http://dockerhost:8051 |
TabletServer | http://dockerhost:8050 |
- This docker image (and README) inspired by https://github.com/bigdatafoundation/docker-kudu
- https://github.com/cloudera/kudu-examples/wiki/Docker-based-tutorial