Add more loss types and options #385

frostedoyster · 2024-11-10T16:24:40Z

This PR implements more loss options. Now the user can choose the type of the loss (rmse, mae, huber) as well as the reduction used (sum or mean).

Contributor (creator of pull-request) checklist

Tests updated (for new features and bugfixes)?
Documentation updated (for new features)?
~~- [ ] Issue referenced (for PRs that solve an issue)?~~

📚 Documentation preview 📚: https://metatrain--385.org.readthedocs.build/en/385/

PicoCentauri

Very useful addition. I hope our config doesn't get too complicated with all these nested options. I suggested one shorthand notation which might be useful.

PicoCentauri · 2024-11-11T10:10:05Z

docs/src/architectures/alchemical-model.rst

-  and 0.1 for the forces, you can set the following in the ``options.yaml`` file:
-  ``loss_weights: {"energy": 1.0, "forces": 0.1}``.
+- ``loss``: This section describes the loss function to be used, and it has three
+  subsections. 1. ``weights``. This controls the weighting of different contributions


Maybe give the default value here?

PicoCentauri · 2024-11-11T10:11:15Z

src/metatrain/experimental/alchemical_model/default-hypers.yaml

+      weights: {}
+      type: mse


I would switch the type and the weight dictionary.

PicoCentauri · 2024-11-11T10:14:13Z

src/metatrain/experimental/soap_bpnn/trainer.py

+            reduction=self.hypers["loss"]["reduction"],
+            type=self.hypers["loss"]["type"],


Does it make sens to design the function is such a way that you simply pass

loss_fn = TensorMapDictLoss(**self.hypers["loss"])

Good idea, I'll try to do it

PicoCentauri · 2024-11-11T10:15:02Z

src/metatrain/experimental/alchemical_model/schema-hypers.json

+              "oneOf": [
+                {
+                  "type": "string",
+                  "enum": ["mse", "mae"]


isn't "huber" missing here?

"huber" is actually an object (not a string) and it's allowed, just a few lines below these

PicoCentauri · 2024-11-11T10:17:38Z

src/metatrain/experimental/alchemical_model/default-hypers.yaml

    log_mae: False
+    loss:


I think we should also just allow

loss: mse

and exand this to

loss: type: mse weights: {} reduction: sum

This should help usability in is in line of what we do for the datasets.

I think this is confusing for the user, because then the hypers change dynamically. Since we give the user the full template of the hypers to rely on, I would keep the format exactly as in the default hypers template (at least for now)

I agree with @PicoCentauri here, having a smaller version of the same input is nice, especially if people are not just copy-pasting our templates around. We are already doing this for datasets:

training_set: "dataset.xyz"

Actually means

training_set: systems: read_from: dataset.xyz reader: ase length_unit: null # ...

Copy-pasting the templates is the only way for users to modify the hypers... they can't make them up. This means that this feature, which is hard to implement, test and maintain, won't be used much

I'm saying that the overhead for this feature just isn't worth it, considering that the projected gain is from

loss: type: mae

to

loss: mae

It's not a lot more, but it is more consistent with other parameters, and nicer to use when writing the input by hand.

I haven't checked the code complexity, but this also looks like a fairly easy thing to implement, so the implementation overhead should be very manageable.

For now, I think it opens too many possibilities and, from what I can see, it's not easy to test and maintain. Like in programming languages, having many ways of achieving the same thing is not always an advantage. I will open the issue though

I don't see why the possibilities opened are too many, could you elaborate? Test & maintenance should also be trivial: if the type of loss is a string, you transform to loss: {"type": <value>} and then continue as is. Am I missing something here?

I could understand if we where trying to be very explicit everywhere, but In general, metatrain options.yaml already includes a lot of syntactic sugar, from default hypers and with the datatset handling; and I really don't see how the loss would be any different.

frostedoyster · 2024-11-12T11:43:42Z

I would merge by the end of today if there are no objections, because we need this feature for some work and the outstanding point is more of a general design issue. I will make sure to open an issue about it

Integrate MAE loss

f563e03

frostedoyster requested a review from PicoCentauri November 10, 2024 16:24

frostedoyster requested a review from abmazitov as a code owner November 10, 2024 16:24

frostedoyster force-pushed the mae-loss branch 3 times, most recently from f0466f9 to a22914b Compare November 10, 2024 17:00

Allow Huber loss

fc6d426

frostedoyster force-pushed the mae-loss branch from a22914b to fc6d426 Compare November 10, 2024 17:39

frostedoyster added 2 commits November 10, 2024 18:55

Improve test coverage

3f01018

Actually improve test coverage

d039c1a

PicoCentauri reviewed Nov 11, 2024

View reviewed changes

Implement suggestions from review

38f248a

frostedoyster requested a review from PicoCentauri November 11, 2024 14:56

frostedoyster merged commit 36de384 into main Nov 12, 2024
12 checks passed

frostedoyster deleted the mae-loss branch November 12, 2024 17:39

frostedoyster mentioned this pull request Nov 12, 2024

Allow user-facing shortcuts for architecture parameters #387

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add more loss types and options #385

Add more loss types and options #385

frostedoyster commented Nov 10, 2024 •

edited by github-actions bot

Loading

PicoCentauri left a comment

PicoCentauri Nov 11, 2024

PicoCentauri Nov 11, 2024

frostedoyster Nov 11, 2024

PicoCentauri Nov 11, 2024

frostedoyster Nov 11, 2024

PicoCentauri Nov 11, 2024

frostedoyster Nov 11, 2024

PicoCentauri Nov 11, 2024

frostedoyster Nov 11, 2024 •

edited

Loading

Luthaf Nov 11, 2024

frostedoyster Nov 11, 2024 •

edited

Loading

frostedoyster Nov 11, 2024

Luthaf Nov 12, 2024

frostedoyster Nov 12, 2024

Luthaf Nov 12, 2024

Luthaf Nov 12, 2024

frostedoyster commented Nov 12, 2024

		reduction=self.hypers["loss"]["reduction"],
		type=self.hypers["loss"]["type"],

Add more loss types and options #385

Add more loss types and options #385

Conversation

frostedoyster commented Nov 10, 2024 • edited by github-actions bot Loading

Contributor (creator of pull-request) checklist

PicoCentauri left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

frostedoyster Nov 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

frostedoyster Nov 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

frostedoyster commented Nov 12, 2024

frostedoyster commented Nov 10, 2024 •

edited by github-actions bot

Loading

frostedoyster Nov 11, 2024 •

edited

Loading

frostedoyster Nov 11, 2024 •

edited

Loading