Allow persisting changes in between turbo mode runs #210

lukasmartinelli · 2018-06-07T06:00:18Z

Is there a narrative for containers that would like to load a big dataset ahead of time and then operate in turbo mode with persisting that data across job invocations?

Aka amortizing long startup time over multiple job invocations?

This applies to applications that need to load a model or a graph or a database in order to execute the job and want to keep that in memory or on disk in between runs.

Options on the top of my head:

Expose watchbot as library so it becomes trivial to implement your own job invocation on top of the SQS polling (e.g. treat this as very special ECS service)
Support "persistent turbo" mode jobs as an option that does not clean out data directories (or kill a background process) in between job runs

/cc @rclark @jakepruitt

daniel-j-h · 2018-09-20T15:41:26Z

Adding a use case here (cc @vsmart): loading large self-contained machine learning models once and using ecs-watchbot to scale it out on cpu workers running it on a large amount of images. The models are read-only and will always be the same. At the moment we simply use a large batch size per worker to amortize the model downloading on each worker; but this limits scale out.

lukasmartinelli · 2018-09-20T16:26:47Z

Adding a use case here (cc @vsmart): loading large self-contained machine learning models once and using ecs-watchbot to scale it out on cpu workers running it on a large amount of images. The models are read-only and will always be the same. At the moment we simply use a large batch size per worker to amortize the model downloading on each worker; but this limits scale out.

I second that use case.
Hit that use case with NLP models before too that take some minutes to download 👍

jakepruitt mentioned this issue Jun 26, 2018

Discussion: Is "fresh: true" the right name? #217

Closed

jakepruitt added the enhancement label Jun 26, 2018

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Allow persisting changes in between turbo mode runs #210

Allow persisting changes in between turbo mode runs #210

lukasmartinelli commented Jun 7, 2018

daniel-j-h commented Sep 20, 2018

lukasmartinelli commented Sep 20, 2018

Allow persisting changes in between turbo mode runs #210

Allow persisting changes in between turbo mode runs #210

Comments

lukasmartinelli commented Jun 7, 2018

daniel-j-h commented Sep 20, 2018

lukasmartinelli commented Sep 20, 2018