Skip to content
This repository has been archived by the owner on Aug 10, 2019. It is now read-only.

Latest commit

 

History

History
33 lines (20 loc) · 4.2 KB

getting-started-aws.md

File metadata and controls

33 lines (20 loc) · 4.2 KB

Getting Started on AWS

To run Fluent in cluster mode (i.e., on multiple nodes and with autoscaling enabled), you will need to have an AWS account. Fluent depends on Kubernetes to spin up clusters. This tutorial will walk you through setting up the required dependencies to run the Kubernetes CLI (kubectl) and kops (a tool to create & manage clusters on public cloud providers).

Prerequisites

We assume you are running inside an EC2 linux VM on AWS, where you have Python3 installed (preferably Python3.6 or later -- we haven't tested with previous versions).

Step 1: Installing kubectl, kops, & friends on your VM

  • Install the kubectl binary using the Kubernetes documentation, found here. Don't worry about setting up your kubectl configuration yet.
  • Install kops binary -- documentation found here
  • Install a variety of Python dependencies: pip3 install awscli boto3 kubernetes1

Step 2: Configuring kops

  • kops requires an IAM group and user with permissions to access EC2, Route53, etc. You can find the commands to create these permissions here. Make sure that you capture the Access Key ID and Secret Access Key for the kops IAM user and both set them as environmnent variables and pass them into aws configure, as described in the above link.
  • kops also requires an S3 bucket for state storage. More information about configuring this bucket can be found here
  • Finally, in order to access the cluster, you will need a domain name2 to point to. Currently, we have only tested our setup scripts with domain names registered in Route53. kops supports a variety of DNS settings, which you can find more information about here.

Step 3: Odds and ends

  • Our cluster creation scripts depend on two environment variables FLUENT_CLUSTER_NAME and FLUENT_STATE_STORE. Set the FLUENT_CLUSTER_NAME variable to the name of the Route53 domain that you're using (see Footnote 2 if you are not using a Route53 domain -- you will need to modify the cluster creation scripts). Set the KOPS_STATE_STORE variable to the S3 URL of S3 bucket you created in Step 2 (e.g., s3://fluent-kops-state-store).
  • As described in Footnote 1, make sure that your $PATH variable includes the path to the aws CLI tool. You can check if its on your path by running which aws -- if you see a valid path, then you're set.
  • As descried in Step 2, make sure you have run aws configure and set your region (by default, we use us-east-1) and the access key parameters for the kops user created in Step 2.

Step 4: Creating your first cluster

You're now ready to create your first cluster. To start off, we'll create a tiny cluster, with one memory tier node and one routing node. From the k8s/ directory, run ./create_cluster.py 1 0 1 1 1 0. This will take about 10-15 minutes to run. Once it's finished, you will see the URL of an AWS ELB, which you can pass into our Python client to interact with the KVS.

1 By default, the AWS CLI tool installs in ~/.local/bin on Ubuntu. You will have to add this directory to your $PATH.

2 You can also run in local mode, where you set the FLUENT_CLUSTER_NAME environment variable to {clustername}.k8s.local. This setting doesn't require a domain name -- however, this mode limits cluster size because it only runs in mesh networking mode (which only allows up to 64 nodes, from what we can tell), and requires modifying our existing cluster creation scripts. We don't have documentation written up for this as its not a use case we intend to support, but you can either open an issue or send us an email if you're interested in this.