How to add resource manager options inside Cerise #75

felipeZ · 2018-02-05T12:37:07Z

Let's imaging that I have the following slurm script

#! /bin/bash
#SBATCH -t 00:05:00
#SBATCH -N 1
#SBATCH -J test
#SBATCH -C TitanX
#SBATCH --gres=gpu:1

How can I add constrains like -C TitanX from Cerise?

LourensVeen · 2018-02-05T12:55:15Z

It's not currently supported, because Xenon doesn't do that yet: xenon-middleware/xenon#582

Then there's the question of where these things should be specified: in the Cerise configuration, or in the CWL file. The CWL file seems logical, but CWL 1.0 doesn't support that, see common-workflow-language/common-workflow-language#587

So we're a bit constrained by the technology that we're working with. We'll need some kind of solution though, but with the lack of Xenon support that's not so easy. Xenon does let you use a custom job script, but that means that we'll have to do everything else by hand as well, and for every supported scheduler, which is exactly what we're trying to avoid by using Xenon...

felipeZ · 2018-02-05T13:03:24Z

For running the MD simulation with Gromacs we have cases where GPUs are available at a specific queue, other cases we need some constrains or the combination of queue names and constrains.
The user should have some control over the resource manager tool (e.g. Slurm, Torque) some testing can be done in a short queue while production requires another queue. Right now we have the queue name hardcoded in the cerise-config forcing the user to build the container every time that a different queue is required.

LourensVeen · 2018-03-14T13:03:27Z

It seems like it would be best to let the user add a hint (which is a CWL feature) to the Workflow, where they can specify the kind of node (or however we call it, needs thought) to run on. These kinds are then defined in the specialisation, because they depend on the machine (not all machines have GPUs, short queues, or whatever). The same hint feature could then be used to specify the number of nodes to request and the runtime, for #44.

LourensVeen · 2018-04-30T15:48:32Z

Xenon now has support for setting job constraints, so it's waiting for the upgrade to Xenon 2 now.

LourensVeen · 2018-04-30T15:53:59Z

Actually, we already had the idea to specify different steps for different use cases. For example, we'd have a gromacs_fast.cwl and a gromacs_efficient.cwl, one giving the result ASAP, the other in few core hours. And we could have gromacs_protein_protein.cwl for larger systems, or something. So the user wouldn't give a hint, they'd call a specific step, and the step would contain a hint.

Cerise will then, on reading the steps on startup, build a table of requirements per step. When a workflow is submitted, the requirements of all the steps used will be merged, and then the job will be submitted with them. Conflicting requirements should be avoided by the specialist as much as possible, but will result in a PermanentFailure if incompatible steps are used. That should give a good error message in the job log as well.

Initial solution for #75.

LourensVeen · 2018-11-04T19:22:09Z

Okay, Xenon 2 didn't happen, and we're running out of time a bit, but Cerulean can do this and I've added the simplest solution I could think of, which is an additional option next to queue-name in the API configuration where you can specify additional scheduler options. This may mean that we need two separate specialisations, one with and one without GPUs, but I think that that's acceptable for now. @felipeZ will that work for you?

felipeZ added the question label Feb 5, 2018

felipeZ assigned LourensVeen Feb 5, 2018

felipeZ mentioned this issue Feb 9, 2018

Add support for setting job constraints xenon-middleware/xenon#582

Closed

LourensVeen added a commit that referenced this issue Nov 4, 2018

Enable extra scheduler options through API config

304d887

Initial solution for #75.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

How to add resource manager options inside Cerise #75

How to add resource manager options inside Cerise #75

felipeZ commented Feb 5, 2018

LourensVeen commented Feb 5, 2018

felipeZ commented Feb 5, 2018

LourensVeen commented Mar 14, 2018

LourensVeen commented Apr 30, 2018

LourensVeen commented Apr 30, 2018

LourensVeen commented Nov 4, 2018 •

edited

Loading

How to add resource manager options inside Cerise #75

How to add resource manager options inside Cerise #75

Comments

felipeZ commented Feb 5, 2018

LourensVeen commented Feb 5, 2018

felipeZ commented Feb 5, 2018

LourensVeen commented Mar 14, 2018

LourensVeen commented Apr 30, 2018

LourensVeen commented Apr 30, 2018

LourensVeen commented Nov 4, 2018 • edited Loading

LourensVeen commented Nov 4, 2018 •

edited

Loading