PET & Alchemical-model produce different results with same random seed #211

HannaTuerk · 2024-05-28T14:10:03Z

Hi,
@M-R-Schaefer and I trained metatensor-models for PET and Alchemical-models with the same random seed on different machines.

PET: it produced different models (performance similar but RMSE of the energy differs around 0.3 eV ). Rerunning a training from the same machine also produces different models.
My pet version (pulled 28.5.2024, with CUDA_DETERMINISTIC: True the training differences are at around 10**-7 for some energy traiininigs. (I did 2 trainings with same random seeds produce the same training (only 2 epochs to test)).

Alchemical-models: It produces the same result on the same machine, but on different machines with the same random seed the output is not reproducible (resulting models are different).

We also tried soap-bpnn and gap, for both the trainings were reproducible for different runs on the same and on different machines (+Moritz and my laptop).

frostedoyster · 2024-05-28T14:17:37Z

Thank you! We'll be working on it

Luthaf added Alchemical Model Alchemical model experimental architecture PET PET experimental architecture labels May 28, 2024

PicoCentauri added the Priority: Medium Important issues to address after high priority. label Jun 3, 2024

frostedoyster self-assigned this Jun 3, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

PET & Alchemical-model produce different results with same random seed #211

PET & Alchemical-model produce different results with same random seed #211

HannaTuerk commented May 28, 2024

frostedoyster commented May 28, 2024

PET & Alchemical-model produce different results with same random seed #211

PET & Alchemical-model produce different results with same random seed #211

Comments

HannaTuerk commented May 28, 2024

frostedoyster commented May 28, 2024