GitHub - GoogleCloudPlatform/memorystore-cluster-autoscaler: Automatically scale the capacity of your Memorystore Cluster instances based on their utilization.

OSS Memorystore Cluster Autoscaler

An open source tool to autoscale Memorystore Cluster instances
Home · Poller component · Scaler component · Forwarder component · Terraform configuration · Monitoring

Overview

The Cloud Memorystore Cluster Autoscaler is a companion tool that allows you to automatically increase or reduce the number of nodes/shards in one or more Memorystore Cluster instances, based on their utilization.

When you create a Memorystore Cluster instance, you choose the number of shards/nodes that provide compute resources for the instance.

The Autoscaler monitors your instances and automatically adds or removes capacity to ensure that the memory, CPU utilization, and other metrics remain within recommend limits.

If you would like to get started quickly with a test deployment of the Autoscaler, you can deploy to Cloud Run functions in a single project.

Architecture

The diagram above shows the high level components of the Autoscaler and the interaction flow:

The Autoscaler consists of two main decoupled components:
- The Poller component
- The Scaler component
These can be deployed to Cloud Run functions and configured so that the Autoscaler runs according to a user-defined schedule.
At the specified time and frequency, the Poller component queries the Cloud Monitoring API to retrieve the utilization metrics for each Memorystore Cluster instance.
For each instance, the Poller component pushes one message to the Scaler component. The payload contains the utilization metrics for the specific Memorystore Cluster instance, and some of its corresponding configuration parameters.
Using the chosen scaling method, the Scaler compares the cluster instance metrics against the recommended thresholds, (plus or minus an allowed margin), and determines if the instance should be scaled, and the number of shards/nodes that it should be scaled to. If the configured cooldown period has passed, then the Scaler component requests the cluster to scale out or in.

Throughout the flow, the Autoscaler writes a step by step summary of its recommendations and actions to Cloud Logging for tracking and auditing.

Deployment

To deploy the Autoscaler, decide which of the following strategies is best adjusted to fulfill your technical and operational needs:

In both of the above cases, the Google Cloud Platform resources are deployed using Terraform. Please see the Terraform instructions for more information on the deployment options available.

You can find some recommendations for productionizing deployment of the Autoscaler in the Productionization section of the Terraform documentation.

Configuration

The parameters for configuring the Autoscaler are identical regardless of the chosen deployment type, but the mechanism for configuration differs slightly:

In the case of the Cloud Run functions deployment, the parameters are defined using the JSON payload of the PubSub message that is published by the Cloud Scheduler job.

In the case of the Kubernetes deployment, the parameters are defined using a Kubernetes ConfigMap that is loaded by the Cron job.

You can find the details about the parameters and their default values in the Poller component page.

There is also a browser-based configuration file editor and a command line configuration file validator.

Licensing

Copyright 2024 Google LLC

Licensed under the Apache License, Version 2.0 (the "License");
you may not use this file except in compliance with the License.
You may obtain a copy of the License at

    https://www.apache.org/licenses/LICENSE-2.0

Unless required by applicable law or agreed to in writing, software
distributed under the License is distributed on an "AS IS" BASIS,
WITHOUT WARRANTIES OR CONDITIONS OF ANY KIND, either express or implied.
See the License for the specific language governing permissions and
limitations under the License.

Getting Support

The Autoscaler project is based on open source contributions (see Contributing).

Please note that this is not an officially supported Google product.

Name		Name	Last commit message	Last commit date
Latest commit History 53 Commits
.github		.github
.husky		.husky
configeditor		configeditor
kubernetes/unified		kubernetes/unified
resources		resources
src		src
terraform		terraform
.commitlintrc.json		.commitlintrc.json
.eslintignore		.eslintignore
.eslintrc.js		.eslintrc.js
.gcloudignore		.gcloudignore
.gitignore		.gitignore
.mdl.json		.mdl.json
.npmrc		.npmrc
.prettierignore		.prettierignore
.prettierrc.js		.prettierrc.js
.release-please-manifest.json		.release-please-manifest.json
CHANGELOG.md		CHANGELOG.md
Dockerfile-unified		Dockerfile-unified
LICENSE		LICENSE
README.md		README.md
autoscaler-config.schema.json		autoscaler-config.schema.json
cloudbuild-unified.yaml		cloudbuild-unified.yaml
code-of-conduct.md		code-of-conduct.md
contributing.md		contributing.md
jsconfig.json		jsconfig.json
markdown-link-checker.json		markdown-link-checker.json
package-lock.json		package-lock.json
package.json		package.json
release-please-config.json		release-please-config.json
renovate.json5		renovate.json5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

OSS Memorystore Cluster Autoscaler

Table of Contents

Overview

Architecture

Deployment

Configuration

Licensing

Getting Support

Contributing

About

Releases 3

Contributors 5

Languages

License

GoogleCloudPlatform/memorystore-cluster-autoscaler

Folders and files

Latest commit

History

Repository files navigation

OSS Memorystore Cluster Autoscaler

Table of Contents

Overview

Architecture

Deployment

Configuration

Licensing

Getting Support

Contributing

About

Resources

License

Code of conduct

Stars

Watchers

Forks

Releases 3

Contributors 5

Languages