To run Fluent in cluster mode (i.e., on multiple nodes and with autoscaling enabled), you will need to have an AWS account. Fluent depends on Kubernetes to spin up clusters. This tutorial will walk you through setting up the required dependencies to run the Kubernetes CLI (kubectl
) and kops (a tool to create & manage clusters on public cloud providers).
We assume you are running inside an EC2 linux VM on AWS, where you have Python3 installed (preferably Python3.6 or later -- we haven't tested with previous versions).
- Install the
kubectl
binary using the Kubernetes documentation, found here. Don't worry about setting up your kubectl configuration yet. - Install
kops
binary -- documentation found here - Install a variety of Python dependencies:
pip3 install awscli boto3 kubernetes
1
kops
requires an IAM group and user with permissions to access EC2, Route53, etc. You can find the commands to create these permissions here. Make sure that you capture the Access Key ID and Secret Access Key for the kops IAM user and both set them as environmnent variables and pass them intoaws configure
, as described in the above link.kops
also requires an S3 bucket for state storage. More information about configuring this bucket can be found here- Finally, in order to access the cluster, you will need a domain name2 to point to. Currently, we have only tested our setup scripts with domain names registered in Route53.
kops
supports a variety of DNS settings, which you can find more information about here.
- Our cluster creation scripts depend on two environment variables
FLUENT_CLUSTER_NAME
andFLUENT_STATE_STORE
. Set theFLUENT_CLUSTER_NAME
variable to the name of the Route53 domain that you're using (see Footnote 2 if you are not using a Route53 domain -- you will need to modify the cluster creation scripts). Set theKOPS_STATE_STORE
variable to the S3 URL of S3 bucket you created in Step 2 (e.g.,s3://fluent-kops-state-store
). - As described in Footnote 1, make sure that your
$PATH
variable includes the path to theaws
CLI tool. You can check if its on your path by runningwhich aws
-- if you see a valid path, then you're set. - As descried in Step 2, make sure you have run
aws configure
and set your region (by default, we useus-east-1
) and the access key parameters for the kops user created in Step 2.
You're now ready to create your first cluster. To start off, we'll create a tiny cluster, with one memory tier node and one routing node. From the k8s/
directory, run ./create_cluster.py 1 0 1 1 1 0
. This will take about 10-15 minutes to run. Once it's finished, you will see the URL of an AWS ELB, which you can pass into our Python client to interact with the KVS.
1 By default, the AWS CLI tool installs in ~/.local/bin
on Ubuntu. You will have to add this directory to your $PATH
.
2 You can also run in local mode, where you set the FLUENT_CLUSTER_NAME
environment variable to {clustername}.k8s.local
. This setting doesn't require a domain name -- however, this mode limits cluster size because it only runs in mesh networking mode (which only allows up to 64 nodes, from what we can tell), and requires modifying our existing cluster creation scripts. We don't have documentation written up for this as its not a use case we intend to support, but you can either open an issue or send us an email if you're interested in this.