docs: example writing multiple datasets to orso file #92

jokasimr · 2024-10-10T11:01:14Z

Fixes #87

This solution is not so well integrated in the sciline pipeline.
I'd appreciate suggestions about how to do this better.

SimonHeybrock · 2024-10-10T12:58:56Z

src/ess/reflectometry/tools.py

+    for fname, quantities in files.items():
+        wf = workflow.copy()
+        wf[Filename[SampleRun]] = fname
+        for name, value in quantities.items():
+            wf[name] = value
+        reflectivity_curves.append(wf.compute(NormalizedIofQ))


Can be done using something like:

reflectivity_curves = sciline.compute_mapped(wf.map(files), NormalizedIofQ)

provided that files is setup in a way compatible with map. I suggest using a pandas.DataFrame with column names as keys for Sciline.

Sure, I can use that here. But it seems to be slightly different from the current implementation. In the current implementation the runs that are mapped over don't have to all set the same parameters. One can set the SampleRotation and another can set some other parameters. But if the parameters are defined in a table then all "runs" (rows of that table) need set the same parameters.

Do they have different parameters?

Yes that's plausible. For example, in the reduction of data from the experiments we did in PSI one month ago there were some special settings in the low angle measurement. The high angle measurement just used the default setting.

We could of course specify a value for that setting for both runs, specifying the default value for the high angle run, but that is more cumbersome and error prone.

SimonHeybrock · 2024-10-10T13:04:50Z

src/ess/reflectometry/tools.py

+    datasets = []
+    for (fname, quantities), curve, scale_factor in zip(
+        files.items(), reflectivity_curves, scale_factors, strict=True
+    ):
+        wf = workflow.copy()
+        wf[Filename[SampleRun]] = fname
+        for name, value in quantities.items():
+            wf[name] = value
+        wf[NormalizedIofQ] = scale_factor * curve
+        dataset = wf.compute(orso.OrsoIofQDataset)


datasets = sciline.compute_mapped(wf.map(dataframe), orso.OrsoIofQDataset)

If you setup dataframe with a NormalizedIofQ column (I am not 100% we can map non-source nodes. Edit: No we cannot, I think we should add __delitem__ as also discussed in scipp/cyclebane#14).
Not sure about the find_corrections, but I presume they are the same for all files, i.e., you can just run that once on the base workflow?

but I presume they are the same for all files, i.e., you can just run that once on the base workflow?

They are probably the same, but I don't see the benefit of making that assumption here.

You are literally using the same workflow, how could it be different?

What corrections are applied could depend on the parameters of the workflow. Right now they don't but I don't think that is an implausible situation. Besides, there's no cost to rerunning it, it takes 2ms.

Seems quite implausible, I thought this is for handling measurements from the same "run" at multiple angles?

But if you are convinced that this is what you need, in that case I don't have any suggestions for improvement.

Seems quite implausible, I thought this is for handling measurements from the same "run" at multiple angles?

That's the main use case yes. It's wrong to say that I'm convinced this is what we need. I don't know, and therefore opted to keep it flexible, since the cost of doing so is negligible.

SimonHeybrock

Missing tests.

SimonHeybrock · 2024-10-21T03:03:14Z

tests/tools_test.py

+        [normalized_ioq, orso_dataset], params={Filename[SampleRun]: 'default'}
+    )
+    datasets = orso_datasets_from_measurements(
+        workflow,
+        [{}, {Filename[SampleRun]: 'special'}],


You are not really testing what you claimed above was needed: Different sets of parameter for different runs. Yes, you pass {} for the first, but since it relies on a default value that is not saying much? For what you are testing here, you could have went with the simpler version I had suggested, simply by passing the filename for each. As it is now, what orso_dataset_from_measurements will do for the first run is a mystery for the reader.

Seems to be a misunderstanding here. Maybe we should talk about this in person? If you have a simpler and better solution in mind I'll be happy to use that.

You are not really testing what you claimed above was needed: Different sets of parameter for different runs.

We want to override different parameters in different runs. For example, in some run we might want to override the SampleRotation parameter, but in another run we want to use the default (meaning reading it from the file).
The intent with this test is to assert if:

If no value was provided for a parameter (in this case Filename), is the default value used?

If a value was provided, is that value used?

SimonHeybrock · 2024-10-21T03:05:51Z

tests/tools_test.py

+        scale_to_overlap=False,
+    )
+    assert len(datasets) == 2
+    assert tuple(d.info.name for d in datasets) == ('default.orso', 'special.orso')


Above you said orso_dataset_from_measurements should be able to perform/record possibly distinct corrections applied for each run. Here you are testing the same correction, but it retrieves a different value (the filename). Is that the same?

This doesn't test any aspect of the "correction finder"-mechanism, so I'm not sure what you mean.
This only tests if the provided parameter value was used, or if the default parameter value was used.

jokasimr requested a review from SimonHeybrock October 10, 2024 11:01

jokasimr marked this pull request as ready for review October 10, 2024 12:47

SimonHeybrock reviewed Oct 10, 2024

View reviewed changes

SimonHeybrock reviewed Oct 11, 2024

View reviewed changes

jokasimr force-pushed the orso-multi-dataset branch from 5f51422 to 6d6411a Compare October 11, 2024 14:23

jokasimr requested a review from SimonHeybrock October 16, 2024 12:21

SimonHeybrock approved these changes Oct 21, 2024

View reviewed changes

jokasimr force-pushed the orso-multi-dataset branch 2 times, most recently from a8cb24d to 1860988 Compare October 22, 2024 08:07

jokasimr added 3 commits October 22, 2024 11:22

docs: example writing multiple datasets to orso file

ab520b5

fix: rename to ReflectivityOverQ

3ae8bd4

fix: remove 'return_scaling_factors'

a36f6e0

jokasimr force-pushed the orso-multi-dataset branch from 58349f3 to a36f6e0 Compare October 22, 2024 09:22

jokasimr merged commit a6f14eb into main Oct 22, 2024
4 checks passed

jokasimr deleted the orso-multi-dataset branch October 22, 2024 09:44

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

docs: example writing multiple datasets to orso file #92

docs: example writing multiple datasets to orso file #92

jokasimr commented Oct 10, 2024

SimonHeybrock Oct 10, 2024

jokasimr Oct 11, 2024 •

edited

Loading

SimonHeybrock Oct 11, 2024

jokasimr Oct 11, 2024

SimonHeybrock Oct 10, 2024

jokasimr Oct 11, 2024

SimonHeybrock Oct 11, 2024

jokasimr Oct 11, 2024 •

edited

Loading

SimonHeybrock Oct 11, 2024

jokasimr Oct 11, 2024

SimonHeybrock left a comment

SimonHeybrock Oct 21, 2024

jokasimr Oct 21, 2024

SimonHeybrock Oct 21, 2024

jokasimr Oct 21, 2024

docs: example writing multiple datasets to orso file #92

docs: example writing multiple datasets to orso file #92

Conversation

jokasimr commented Oct 10, 2024

Choose a reason for hiding this comment

jokasimr Oct 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jokasimr Oct 11, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

SimonHeybrock left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

jokasimr Oct 11, 2024 •

edited

Loading

jokasimr Oct 11, 2024 •

edited

Loading