Add input and output validation #104

pluflou · 2024-12-18T06:32:11Z

Extends @t-bz 's work in Add validation #102
Makes default_value required for input_variables in LUMEBaseModel
Adds precision attribute to TorchModel class
Removes value_range_tolerance attribute from Variable class (might be a temporary change until more discussion happens)
Extends input validation to thoroughly validate input dictionary, type casting, and values in torch tensors within dictionary, before any other method is called
Re-introduces is_constant attribute to Variable class
Adds strictness config setting for input/output validation
Updates example notebooks
Updates tests

To do:

Finalize documentation update
Add more unit and integration tests to cover any new code

…icitly addressed in model_dump()

…del metadata

…king

roussel-ryan

Mostly looks good to me. I have one question on top of the comment I left, is there some testing for evaluating models on non-scalar tensors for the input variables?
For example, in `torch_model.ipynb' does the following work:

input_dict = torch_model.random_input(n_samples=5)
torch_model.evaluate(input_dict)

roussel-ryan · 2024-12-18T15:55:18Z

lume_model/base.py

@@ -292,6 +277,14 @@ def unique_variable_names(cls, value):
        verify_unique_variable_names(value)
        return value

+    @field_validator("input_variables")
+    def verify_input_default_value(cls, value):


this is what differentiates input variables from output variables correct? if so I want to go back on what we discussed earlier and maybe have different input/output class types such that validation errors happen during variable definition instead of model validation

Yes that's correct. We either set the attribute as optional and do it this way, or make it required and set the output variable default as a nan in the base class, but the latter was a little messy. We can revisit having separate child classes.

pluflou · 2024-12-18T17:11:40Z

For example, in `torch_model.ipynb' does the following work:
input_dict = torch_model.random_input(n_samples=5)
torch_model.evaluate(input_dict)

Yes the above code will work. It supports non-scalar tensors but not non-scalars of other types (e.g. a list of scalars).

…idation

roussel-ryan

LGTM except for one comment

roussel-ryan · 2024-12-19T18:53:20Z

lume_model/models/torch_model.py

-    def evaluate(
+    def _set_precision(self, value: torch.dtype):
+        """Sets the precision of the model."""
+        torch.set_default_dtype(value)


this is probably an overreach, ie. it extends beyond this class/object. Do we need to set the default type for torch here?

You're right, that is not needed since we set the model and transformers. I removed it.

roussel-ryan · 2024-12-20T17:27:27Z

LGTM, you can go ahead and merge

t-bz and others added 26 commits December 5, 2024 20:13

Refactor variable definitions to support validation

dae5794

Update variable tests

24655a2

Fix utils tests

768a440

Fix base tests

3e80ceb

Fix torch_model tests

370362d

Fix torch_module tests

708b9e5

Add validation to models

b817d44

Add validation tests

47ea12a

Clean up conftest

c9dd02c

Remove SerializeAsAny annotation since variable serialization is expl…

6fbd75a

…icitly addressed in model_dump()

make default value required

297e2da

add single and double precision support in variable class

50bd4d3

add support for numpy floats

892de33

add validation for input_dict and support for precision setting in mo…

6990180

…del metadata

make input dict validation strict

0f786d0

catch bools in torch tensor inputs

2c0eaab

drop np.float32 until we have a use-case

eb8425d

add dynamic checking for default vals, and strict flag for range chec…

dc08f80

…king

make type casting more consistent during input validation

629b77d

make default required for inputs only and validate in base class

bd322f7

fix range validation tests

9552dd9

remove range check within tolerance for now

d2c2da6

add is_constant flag and default range, fix unit tests

fa508eb

update example nbs

b6aaddf

update example notebooks

cbb67e1

add nicer onnx graphs

e409459

pluflou requested a review from roussel-ryan December 18, 2024 06:32

roussel-ryan reviewed Dec 18, 2024

View reviewed changes

simplify validation config

e6c7b6a

pluflou added 2 commits December 18, 2024 13:16

Merge branch 'main' of https://github.com/slaclab/lume-model into val…

45dfde8

…idation

fix tests after adjusting config validation format

cba3073

roussel-ryan reviewed Dec 19, 2024

View reviewed changes

pluflou added 2 commits December 19, 2024 13:24

remove setting torch default dtype

2de7dd2

add some tests

6c83403

pluflou added 3 commits December 20, 2024 09:50

adjust docstrings

b10521a

update README

6bb2e07

reset precision to double to fix tests

f34926f

pluflou merged commit dfde898 into slaclab:main Dec 20, 2024
4 checks passed

pluflou deleted the validation branch December 20, 2024 18:50

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add input and output validation #104

Add input and output validation #104

pluflou commented Dec 18, 2024 •

edited

Loading

roussel-ryan left a comment

roussel-ryan Dec 18, 2024

pluflou Dec 18, 2024

pluflou commented Dec 18, 2024

roussel-ryan left a comment

roussel-ryan Dec 19, 2024

pluflou Dec 19, 2024

roussel-ryan commented Dec 20, 2024

Add input and output validation #104

Add input and output validation #104

Conversation

pluflou commented Dec 18, 2024 • edited Loading

roussel-ryan left a comment

Choose a reason for hiding this comment

roussel-ryan Dec 18, 2024

Choose a reason for hiding this comment

pluflou Dec 18, 2024

Choose a reason for hiding this comment

pluflou commented Dec 18, 2024

roussel-ryan left a comment

Choose a reason for hiding this comment

roussel-ryan Dec 19, 2024

Choose a reason for hiding this comment

pluflou Dec 19, 2024

Choose a reason for hiding this comment

roussel-ryan commented Dec 20, 2024

pluflou commented Dec 18, 2024 •

edited

Loading