Skip to content

Self-service management of complex Cloud Environments

License

Notifications You must be signed in to change notification settings

cloudknit-io/cloudknit

Repository files navigation

CloudKnit: An Open Source Solution for Managing Cloud Environments

CloudKnit is an open-source progressive delivery platform for managing cloud environments. It enables organizations to Define entire environments in a declarative way, Provision them, Detect and Reconcile Drift, and Teardown environments when no longer needed. It also comes with dashboards to help visualize environments and observe them.

CloudKnit is based on a concept called Environment as Code. Some people have started calling it Declarative Pipelines.

Note: We are not a big fan of using Pipeline and Declarative together as Pipeline to us means a sequence of steps which conflicts with what Declarative means.

Environment as Code (EaC) is an abstraction over cloud-native tools that provides a declarative way of defining entire environments. It has a Control Plane that manages the state of the environment, including resource dependencies, and drift detection and reconciliation.

Where CloudKnit connects with existing tools

Diagram 1: Where does CloudKnit fit in with existing tools

Table of Contents

Why we built CloudKnit

Existing automation tools like Terraform, Pulumi, and Helm allow us to automate the provisioning of cloud environments, but as those environments become more complex and teams look for advanced use cases, existing tools fall short. These tools are great at managing individual components within an environment (like networking or RDS), but engineering teams need an entire environment with various components like the one shown below [See diagram 2] to run their business applications.

This causes teams to do one of the following:

  • Hand-roll complex pipelines: Pipeline code is imperative & needs to manage the logic to provision the various components in the correct order, handle failures and tear down unused resources. We have seen teams write hundreds of lines of unmaintainable pipeline code. This causes a maintenance nightmare.
  • Build in-house solution on top of automation tools: Companies spend a lot of time and money managing in-house solutions instead of building business features.

CloudKnit makes it easy for Engineering teams to manage complex environments and provides out-of-the-box solution for use cases like ephemeral environments, environment blueprints, cloning environments, promoting changes across environments, and more.

Example Environment

Diagram 2: Example Environment

Other Challenges

There are other challenges that teams face as their environments become more complex.

  • Environment Replication is a pain
  • Not easy to Visualize/Understand Environments
  • Drift Detection for the entire environment is difficult
  • Not straightforward to Promote changes across environments

Demo

Please see CloudKnit demo below:

How does CloudKnit work?

CloudKnit

Diagram 3: CloudKnit

Environment management with CloudKnit is divided into 4 stages:

1. Define

This stage allows you to define an entire environment. We currently support easy to use YAML format for the environment definition.

See example below:

Environment Definition
apiVersion: stable.cloudknit.io/v1
kind: Environment
metadata:
  name: zmart-payment-prod-blue
  namespace: zmart-config
spec:
  teamName: payment
  envName: prod-blue
  teardown: false
  autoApprove: false
  components:

    - name: networking
      type: terraform
      autoApprove: true
      module:
        source: [email protected]:terraform-aws-modules/terraform-aws-vpc.git
      variablesFile:
        path: "prod-blue/vars/networking.tfvars"
      outputs:
        - name: vpc_id

    - name: platform-eks
      type: terraform
      dependsOn: [networking]
      module:
        source: [email protected]:terraform-aws-modules/terraform-aws-eks.git
      variables:
        - name: vpc_id
          valueFrom: networking.vpc_id
      variablesFile:
        path: "prod-blue/vars/platform-eks.tfvars"

    - name: website
      type: helm #Native support for helm charts coming soon
      dependsOn: [platform-eks]
      source:
        repo: [email protected]:helm/examples.git
        path: charts/hello-world
      variables:
        - name: environment
          value: prod-blue

2. Provision

CloudKnit Control Plane running in Kubernetes uses the Environment definition & runs various Components (Terraform, Helm Charts, etc.) in the right order. It also provides Visibility & Workflow while Provisioning the environment.

3. Detect Drift + Reconciliation

Like Kubernetes does drift detection for k8s apps & reconciles them to match the desired state in source control, CloudKnit does drift detection for the entire environment (infra + apps) & reconciles them.

Note: In case of Infrastructure as Code (IaC), CloudKnit provides an ability to see the plan & get manual approval before running the IaC to make sure it doesn't destroy any resources you don't want to, especially in Production environments.

4. Teardown

You might want to teardown environments when they are not used to save costs. CloudKnit provides a single line change using flag teardown in the Environment YAML. Once teardown flag is set to true and definition is pushed to Source Control, CloudKnit picks up the change and tears the environment down by destroying individual components in the correct order.

Environment Visibility & Workflow

CloudKnit also provides visibility into your environments and an optimal GitOps workflow with useful information on the UI like estimated costs/status etc. Check diagram 4 below for an example environment in CloudKnit UI.

Environment Visibility

Diagram 4: Environment Visibility

Conclusion

We hope that by open-sourcing CloudKnit early, we can form a close-knit open-source community around it to make managing cloud environments easy.

For a deeper dive into CloudKnit, see the architectural overview, our documentation, and the GitHub repo.

Terminologies

Components: A logical grouping of 1 or more Infrastructure Resources or Applications that get provisioned together. For example, Networking is an Infrastructure Component with various Infrastructure resources like Virtual Private Cloud(VPC), Subnets, Internet Gateways, Route Tables, etc.

Environment: A logical grouping of all the Components needed to run business applications. The grouping includes components like networking, eks, database, k8s apps, etc.