Click the Launch Stack button for your desired region below to deploy the cloud resources on Amazon® Web Services (AWS®). This opens the AWS console in your web browser.
After you click the Launch Stack button above, the "Quick create stack" page opens in your browser where you can configure the parameters. It is easier to complete the steps if you position these instructions and the AWS CloudFormation console window side by side.
-
Specify a stack name. The name must be unique within your AWS account in the region you want to deploy the stack.
-
Specify and check the defaults for these resource parameters:
Parameter label | Description |
---|---|
Container image | Specify the Docker image to use to execute jobs. This parameter is prepopulated with the default MathWorks Docker image for MATLAB Parallel Server with AWS Batch. Alternatively, specify a container repository in the form repository-url/image:tag. |
Instance types | Specify the Amazon EC2 instance types allowed as a comma-delimited list. You can specify instance families (for example, 'c4' or 'p3') or specific sizes within a family (for example, 'c4.8xlarge'). Use 'optimal' to pick instance types on the fly that match the demands of your job queue. For more information on instance types, see https://aws.amazon.com/ec2/instance-types/. |
Maximum number of MATLAB workers | Specify the maximum number of MATLAB workers that the cluster can run concurrently. This number must be less than or equal to the number of workers that your MATLAB Parallel Server license supports. |
EC2 pricing model | Choose either On-Demand or Spot Pricing for the EC2 instances provisioned by AWS Batch. For more information on pricing, see https://aws.amazon.com/ec2/pricing. |
Number of GPUs per MATLAB worker | Specify the number of GPUs for each MATLAB worker to use. If you specify a value greater than 0, you must specify at least one GPU-based instance type supported by AWS Batch in the Instance Types field. For a list of supported GPU-based instance types, see https://docs.aws.amazon.com/batch/latest/userguide/gpu-jobs.html. |
Number of vCPUs per MATLAB worker | Specify the number of vCPUs for each MATLAB worker to use. Two vCPUs per worker is recommended. |
Memory per MATLAB worker | Specify the amount of memory in MiB that each MATLAB worker can use. A minimum of 2000 MiB is required. |
Subnets | Select IDs of existing subnets for the EC2 instances provisioned by AWS Batch. All subnets must exist in the VPC you have selected. It is recommended that you select a subnet for each availability zone in the VPC you have selected. |
Spot bid price | Specify the maximum percentage of On-Demand pricing you want to pay for Spot resources. This parameter is ignored if you select the 'On Demand' EC2 pricing model. |
VPC | Select the ID of an existing VPC in which to deploy this stack. |
Table 1: Stack input parameters
- The CloudFormation template in this repository uses nested stacks to reference templates from the MathWorks Infrastructure as Code Building Blocks repository. Select the checkboxes to accept that the CloudFormation template creates Identity and Access Management (IAM) roles in your account and autoexpands nested stacks when creating the stack. The template uses nested stacks to reference templates from the MathWorks Infrastructure as Code Building Blocks repository.
The IAM roles provide the following permissions:
- Allow AWS Batch to make calls to other AWS services in order to run jobs.
- Allow containers launched by AWS Batch to access Amazon S3™ and Amazon Elastic Container Service.
- Allow a custom lambda function to run that calculates the maximum number of vCPUs for the compute environment.
- Allow a custom lambda function to run that empties the S3 bucket when the stack is deleted.
-
Click the Create stack button. When you click Create stack, the cluster is created using AWS CloudFormation templates. It can take up to five minutes to create the cluster.
-
When the stack status has reached CREATE_COMPLETE, select Outputs. Figure 1 shows an example of the Outputs view.
Identify the values of the outputs with keys 'IndependentJobDefinition', 'JobQueue', 'NumWorkers', and 'S3Bucket' for use in Step 3. If you are an administrator, pass this information to your end users.
Figure 1: Stack outputs on completion
If you are an administrator and do not plan to submit jobs to the cluster, you can skip the following steps.
-
Install the AWS Command Line Interface tool on the client machine.
-
Configure your client machine with your AWS credentials using the AWS Command Line Interface tool. Alternatively, you can set up your credentials by setting the environment variables listed in Table 2 on the client machine. If you do not know your AWS credentials, contact your administrator.
Environment variable Description AWS_ACCESS_KEY_ID Specifies an AWS access key associated with an IAM user or role. AWS_SECRET_ACCESS_KEY Specifies the secret key associated with the access key. This is essentially the "password" for the access key. AWS_SESSION_TOKEN Specifies the session token value. Required if you are using temporary security credentials. AWS_DEFAULT_REGION Specifies the AWS Region to send the request to. The value of this environment variable is typically determined automatically but you may wish to set it manually. Table 2: Environment variables for specifying AWS credentials
You can set the environment variables in your current MATLAB® session using
setenv
as follows:setenv('AWS_ACCESS_KEY_ID', 'YOUR_AWS_ACCESS_KEY_ID'); setenv('AWS_SECRET_ACCESS_KEY', 'YOUR_AWS_SECRET_ACCESS_KEY'); setenv('AWS_SESSION_TOKEN', 'YOUR_AWS_SESSION_TOKEN'); setenv('AWS_DEFAULT_REGION', 'YOUR_AWS_DEFAULT_REGION');
-
Download or clone the Parallel Computing Toolbox plugin for MATLAB Parallel Server with AWS Batch repository on GitHub. You can also download this plugin via the Add-On Manager in MATLAB.
-
Once you download the plugin files, create a cluster profile by using either the Cluster Profile Manager or the MATLAB command line. To open the Cluster Profile Manager in MATLAB, on the Home tab, in the Environment section, select Parallel > Create and Manage Clusters. Within the Cluster Profile Manager, select Add Cluster Profile > Generic from the menu to create a new Generic cluster profile.
Alternatively, for the command line workflow, create a new Generic cluster object by running:
c = parallel.cluster.Generic;
-
To configure the newly created
Generic
profile, set the properties given in the table below:Property Value JobStorageLocation Path where job data is to be stored on your machine NumWorkers Number of workers used to configure the stack in Step 2 ClusterMatlabRoot '/usr/local/matlab' OperatingSystem 'unix' HasSharedFilesystem false PluginScriptsLocation Full path to the folder containing the Parallel Computing Toolbox plugin files downloaded earlier Table 3: Required properties to configure a cluster profile
In the Cluster Profile Manager, set each property value in the boxes provided. Alternatively, at the command line, set each property on the cluster object using dot notation:
c.JobStorageLocation = 'C:\MatlabJobs'; % etc.
Table 3 gives the minimum properties required for
Generic
to work correctly. For a full list of cluster properties, see the documentation forparallel.Cluster
. -
Once the minimum required properties are set for the
Generic
cluster, configure the following additional properties specific to the AWS Batch cluster stack created in Step 2:Stack output key Description IndependentJobDefinition AWS Batch job definition for independent jobs JobQueue Job queue of the AWS Batch cluster S3Bucket S3 Bucket for data transfer between client and worker MATLAB engines Table 4: AdditionalProperties field descriptions and corresponding stack output keys
Use the stack's outputs to obtain the above values. If you are an end user, contact your administrator for this information. Otherwise, obtain the information from the Outputs view of the stack in the CloudFormation console, as described in Step 2.
To configure these properties, in the Cluster Profile Manager, add new
AdditionalProperties
by clicking Add under the table ofAdditionalProperties
and populate the key-value pairs according to the values given in Table 4. Alternatively, for the command line workflow, use dot notation to add these additional properties:c.AdditionalProperties.IndependentJobDefinition = '<Job definition>';
-
To save your new profile, click Done in the Cluster Profile Manager UI. If creating the cluster using the command line, run:
saveAsProfile(c, "myAwsBatchCluster");
-
Validate your cluster by clicking the Validate button. The Cluster connection test (parcluster) and Job test (createJob) stages should pass successfully. The remaining validation stages will not pass as communicating jobs are not supported.
Your cluster is now set up for batch processing MATLAB jobs. The first time you submit a job to the cluster, MATLAB prompts you to log into your MathWorks® account.
You can remove the CloudFormation stack and all associated resources when you are done with them. Note that there is no undo. After you delete the cloud resources you cannot use the cluster again.
-
Select the stack in the CloudFormation Stacks screen. Select Delete.
-
Confirm the delete when prompted. It can take a few minutes for CloudFormation to delete your resources.
If your stack fails to create, check the Events section of the CloudFormation console to determine which resources caused the failure and why.
Copyright 2020-2024 The MathWorks, Inc.