ensure reproducibility and simplify the process of loading old jobs #287
Labels
enhancement
New feature or request
enterprise_practices
Governance housekeeping trying to keep projects usable and secure.
as the title suggests a user that wants to reproduce the output of a particular job currently follows the following steps:
exp
directory with state config files.docker pull
the image from the container registry (requires login and authentication via a custom process)data/
folder andexp/
folder respectively.This process is super rough and not even fully reproducible. If the job was launched with uncommitted changes to the
src
directory, AND the docker image tag was subsequently reused, that data is lost forever. Full reproducibility would come down to memory of what changes occured.This is obviously a problem and needs a solution. I propose a couple of different routes to solving this:
Solution A (my preferred): Improve docker images, build automated pipeline for gathering
input
mount and containerdocker pull
with guarantees that the image has not been overridden by future jobs (because the hashes still line up)Solution B: mount everything.
src
directory into docker image, we instead mount it just like we do with the experiments currently.src
were uncommittedSolution C: remove ability to push images with uncommitted
src
changessrc
directory at the time the experiment was launched, if we disallow this, then our current flow would work just fine.src
some of them not)The text was updated successfully, but these errors were encountered: