Ambrose

Twitter Ambrose is a platform for visualization and real-time monitoring of MapReduce data workflows. It presents a global view of all the map-reduce jobs derived from your workflow after planning and optimization. As jobs are submitted for execution on your Hadoop cluster, Ambrose updates its visualization to reflect the latest job status, polled from your process.

Ambrose provides the following in a web UI:

A chord diagram to visualize job dependencies and current state
A table view of all the associated jobs, along with their current state
A highlight view of the currently running jobs
An overall script progress bar

Ambrose is built using the following front-end technologies:

d3.js - For chord diagram visualization
Bootstrap - For layout and CSS support

Ambrose is designed to support any Hadoop workflow runtime, but current support is limited to Apache Pig.

Supported runtimes

Pig - See pig/README.md
Cascading - future work
Scalding - future work
Cascalog - future work
Hive - future work

Examples

Below is a screenshot of the Ambrose UI. Each arc segment on the circle represents a map-reduce job. Dependencies between jobs are represented by chords which connect job arc segments. Grey jobs have not yet run, bright green jobs are running and light green jobs are completed.

Note that Each job arc is bisected; Chords on one half of the arc connect to predecessor jobs while chords on the other half connect to successor jobs. For example, in the diagram below Jobs 10 and 13 have no predecessors and Jobs 8 and 18 are the final jobs in the Pig workflow.

Note that the chord diagram shown is our first pass at visualizing the workflow, and there's room for improvement. We'd like to support other visualizations as well, like a graph of the workflow DAG. If you develop an improved visualization, be sure to send us a pull request!

Quickstart

To get started with Ambrose, first clone the Ambrose Github repository:

git clone https://github.com/twitter/ambrose.git
cd ambrose

Next, you can try running the Ambrose demo on your local machine. The ambrose-demo script starts a local instance of the Ambrose app server with sample data. Start the demo Abrose server with the following command and then browse to http://localhost:8080/web/index.html?localdata=small:

./bin/ambrose-demo

Finally, you can run Ambrose with an actual Pig script. To do so, you'll need to build the Ambrose distribution and untar it:

./bin/ambrose-package
VERSION=0.1.0-SNAPSHOT
tar zxvf ambrose-$VERSION.tar.gz

You can then run the following commands to execute path/to/my/script.pig with an Ambrose app server embedded in the Pig client:

cd ambrose-$VERSION
./bin/pig-ambrose -f path/to/my/script.pig

Now, browse to http://localhost:8080/web/ to see the progress of you script using the Ambrose UI. To override the default port, export AMBROSE_PORT before invoking pig-ambrose:

export AMBROSE_PORT=4567

Maven repository

An initial release will be pushed to Maven shortly.

How to contribute

Bug fixes, features, and documentation improvements are welcome! Please fork the project and send us a pull request on Github. You can submit issues on Github as well.

Here are some high-level goals we'd love to see contributions for:

Improve the front-end client
Add other visualization options, like a DAG view
Create a new back-end for a different runtime environment
Create a standalone Ambrose server that's not embedded in the workflow client

Versioning

For transparency and insight into our release cycle, releases will be numbered with the follow format:

<major>.<minor>.<patch>

And constructed with the following guidelines:

Breaking backwards compatibility bumps the major
New additions without breaking backwards compatibility bumps the minor
Bug fixes and misc changes bump the patch

For more information on semantic versioning, please visit http://semver.org/.

Authors

Bill Graham (@billgraham)
Andy Schlaikjer (@sagemintblue)

License

Licensed under the Apache License, Version 2.0: http://www.apache.org/licenses/LICENSE-2.0

Name		Name	Last commit message	Last commit date
Latest commit History 17 Commits
bin		bin
docs/img		docs/img
pig		pig
.gitignore		.gitignore
.travis.yml		.travis.yml
LICENSE		LICENSE
NOTICE		NOTICE
README.md		README.md
ambrose.iml		ambrose.iml
ambrose.ipr		ambrose.ipr
pom.xml		pom.xml

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Ambrose

Supported runtimes

Examples

Quickstart

Maven repository

How to contribute

Versioning

Authors

License

About

Releases

Packages

License

steveblackmon/ambrose

Folders and files

Latest commit

History

Repository files navigation

Ambrose

Supported runtimes

Examples

Quickstart

Maven repository

How to contribute

Versioning

Authors

License

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Packages