Specifying memory usage per task? #19

lgarrison · 2021-05-10T21:08:56Z

disBatch has worked great for me for dealing with heterogeneous tasks that take different amounts of CPU time. But I'm now facing a set of jobs with heterogeneous memory usage, and I'm stuck requesting a very conservative number of jobs-per-node so that no node blows out its memory usage. I can estimate each job's memory usage, and I'm wondering if it would be possible to communicate this information to disBatch so that it would only dispatch a job to a node when it has enough memory for it. I guess the syntax might be something like:

#DISBATCH MEM 8GB
job1
job2
#DISBATCH MEM 20GB
job3
etc...

disBatch would also need to know about the available memory on each node, which I guess it could learn through Slurm. For non-Slurm backends, maybe it could be specified manually.

Do you think a feature like this makes sense for disBatch? I'd be happy to take a stab at this feature if you think so!

The text was updated successfully, but these errors were encountered:

njcarriero · 2021-06-23T17:52:12Z

Thanks for the suggestion.

This is something that I have been thinking about for a while (and the related variable number of cores).

I haven't come up with a solution that doesn't involve reinventing a non-trivial portion of a resource manager. If you think you have a good idea, go for it. FYI, the dynamicdb branch will soon(-ish) become release 2.0.

In a pinch, a user can clump small tasks together via shell ops, e.g. using a task like " job1 & job2 ; wait ". But that defeats one design goal which was to provide per-task record keeping.

lgarrison · 2021-06-23T18:16:05Z

Thanks; I didn't have an implementation immediately in mind, and I agree that one doesn't want to reinvent a resource manager! In the last few weeks I think my need for this feature has lessened, but I may return to this in the future. The job-chaining idea may help too; thanks for the suggestion.

xiuliren · 2022-06-13T20:01:32Z

I was looking for this feature as well. My pipeline consists of over ten tasks with different resource requirements. I am currently running them manually one by one!

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Specifying memory usage per task? #19

Specifying memory usage per task? #19

lgarrison commented May 10, 2021

njcarriero commented Jun 23, 2021 •

edited

Loading

lgarrison commented Jun 23, 2021

xiuliren commented Jun 13, 2022

Specifying memory usage per task? #19

Specifying memory usage per task? #19

Comments

lgarrison commented May 10, 2021

njcarriero commented Jun 23, 2021 • edited Loading

lgarrison commented Jun 23, 2021

xiuliren commented Jun 13, 2022

njcarriero commented Jun 23, 2021 •

edited

Loading