From cf2bfee04bcdaa2b1ac83f9245b1af0cdd0b5ac5 Mon Sep 17 00:00:00 2001 From: Eric Coulter Date: Fri, 30 Oct 2020 15:28:14 -0400 Subject: [PATCH] Updated the README for cluster_create.sh Signed-off-by: Eric Coulter --- README.md | 20 ++++++++++++-------- 1 file changed, 12 insertions(+), 8 deletions(-) diff --git a/README.md b/README.md index 2945b71..742c399 100644 --- a/README.md +++ b/README.md @@ -44,28 +44,32 @@ To build your own Virtual cluster, starting on your localhost: (or ensure that they're available in the shared filesystem). * For other modifications, feel free to get in touch! -1. Run ```headnode_create.sh``` - it *will* require an ssh key to exist in +1. Run ```cluster_create.sh``` - it *will* require an ssh key to exist in ```${HOME}/.ssh/id_rsa.pub```. This will be the key used for your jetstream instance! If you prefer to use a different key, be sure to edit this script accordingly. The expected argument is only the headnode name, and will create an 'm1.small' instance for you. - ```./headnode_create.sh ``` + ```./cluster_create.sh ``` Watch for the ip address of your new instance at the end of the script! + It is worth double-checking the output of the script to see that the ansible + playbook for compute image creation ran successfully - no failed tasks means + that the image for your compute nodes should be available. + 1. The headnode_create script has copied everything in this directory - to your headnode. You should now be able to ssh in + to your headnode EXCEPT your local openrc file. You should now be able to ssh in as the centos user, with your default ssh key: ```ssh centos@``` -1. Now, in the copied directory, *on the headnode*, run the install.sh script - with sudo: +1. Your cluster is now up and running. You can submit jobs via sbatch or srun. If you + see issues with running jobs, there are useful logs in + ``` /var/log/slurm/slurm_elastic``` + and + ``` /var/log/slurm/slurmctld.log``` - ```sudo ./install.sh```. - This script handles all the steps necessary to install slurm, with - elastic nodes set. Useage note: Slurm will run the suspend/resume scripts in response to