Skip to content

Latest commit

 

History

History
133 lines (89 loc) · 5.97 KB

lab-1.md

File metadata and controls

133 lines (89 loc) · 5.97 KB
tags
ggg, ggg2024, ggg201b

hackmd-github-sync-badge

[toc]

GGG 201b WQ 2024 - Lab 1, Day 1 - Links and info

Lab for Fri, Jan 12th, 2024.

Syllabus link for labs

Schedule for lab 1

Resources for learning more stuff!

UNIX Shell specifically

For HW 0, you might find the Happy Belly Bioinformatics UNIX Crash Course helpful.

Background campus resources on computing and bioinformatics.

You might be interested in workshops 1 through 5 of Intro to Remote Computing. Warning, this was written mostly by Titus ;). There are video recordings available.

You might also be interested in the Grad Pathways Microcredentialing in Research Computing - (link). Hit me up by e-mail if you're interested!

DataLab runs a bunch of workshops that you might be interested in, as does the Genome Center Bioinformatics Core. The DataLab workshops are free, the GC ones are not. This course will prepare you well, and/or complement, these workshops!

The book Bioinformatics Data Skills is an excellent reference that is worth buying if you are looking to invest ;).

Appendix: Advance preparation for HW 0 - links & info

Logging into farm:

Using the login information I sent you for farm, please follow the appropriate set of instructions for your platform:

Instructions for Mac OS, Linux, and WSL

Instructions for Windows and MobaXterm

Once you log in successfully...

You should be at a prompt that says datalab-XX@farm:~$ .

Things to try:

  • log out and log back in a few times to make sure you've got it! You can log out by typing exit.
  • log in simultaneously a few times by using a new window or windows.

Request compute resources with srun

Copy & paste the following command at the command prompt:

srun -p high2 --time=3:00:00 --nodes=1 --cpus-per-task 1 --mem 5GB --pty /bin/bash

This asks for three hours of access to one computer and one CPU, reserving 5 GB of memory for your use. The -p high2 says to ask for it with high priority, while the --pty /bin/bash asks for an interactive terminal as opposed to running a specific program.

You should see output that looks like this:

srun: job 9312054 queued and waiting for resources srun: job 9312054 has been allocated resources

but with different numbers ;).

And you should end up at a prompt that looks something like this:

datalab-02@cpu-3-64:~$

but again, with different numbers.

What you've done here is reserve a specific chunk of compute time for your sole private use on farm. After 3 hours, your reservation will be cancelled and whatever you're running will be stopped. You can also give up your reservation early by logging out.

Run RStudio Server on your reserved node

Now run:

module load rstudio-server

followed by:

module load R

followed by:

rstudio-launch 

The first command sets up your account to use the RStudio Server software.

The second command sets up your account to use a specific version of R.

The third command runs RStudio Server on farm.

You should see output that looks like this;

Run the following command in a new terminal on your computer:

ssh -L50700:cpu-3-64:50700 [email protected]

Then, on your computer, navigate your browser to:

URL: http://localhost:50700
Username: datalab-02
Password: attention-plausible-overripe-sliceable-vacant-imprint

NOTE: Using R at /share/apps/conda/environments/r-4.2.3/bin/R.

Connect to RStudio from your laptop

Find the ssh command above that starts with ssh -L. We'll need to run that on your laptop, so, copy it into your copy/paste buffer.

If you're on Mac OS X, Linux, or WSL: open a new terminal prompt and paste in the command. You may need to enter your datalab password from my e-mail again.

If you're on MobaXterm, open a shell window, and paste in the command. You shouldn't need a password this time.

Leaving that all running, open a browser and paste in the URL from your ssh window. It should start with http://localhost.... You'll need to enter your account name and the password output by RStudio (NOT the one in my e-mail).

If all goes well... you should see an RStudio window!