Chunk interpolation to select calibration data #2634

ctoennis · 2024-10-31T11:27:37Z

I will need a method to select calibration data for the strar tracker. I made some slides to decribe how it is supposed to work:
https://docs.google.com/presentation/d/1oxIcYSQvGnU7IQYy3fGdcv0qXiLpvaXR9YtmnDesj4Y/edit?usp=sharing

src/ctapipe/monitoring/interpolation.py

…apipe into ChunkFunction

ctao-dpps-sonarqube · 2024-10-31T12:20:01Z

Analysis Details

1 Issue

0 Bugs
0 Vulnerabilities
1 Code Smell

Coverage and Duplications

98.00% Coverage (94.30% Estimated after merge)
0.00% Duplicated Code (0.70% Estimated after merge)

Project ID: cta-observatory_ctapipe_AY52EYhuvuGcMFidNyUs

View in SonarQube

ctoennis · 2024-11-10T12:08:53Z

@maxnoe @kosack Can you have another look if there is something else to be changed?

mexanick · 2024-11-18T15:52:28Z

@kosack @maxnoe this PR is needed to complete the pointing calibration (for the variance calibration application), can we advance it?

src/ctapipe/monitoring/interpolation.py

ctoennis · 2024-11-22T13:38:49Z

I am a bit stuck here with one of the tests. test_hdf5 is failing in pytest begause the data from the file is not loaded correctly. When i look in the test what x and y values the interpolators have i get some wrong values. However if i try to do the same outside of pytest it works. I used this code to test by myself:

import astropy.units as u
import numpy as np
import tables
from astropy.table import Table
from astropy.time import Time

from functools import partial
from ctapipe.core import Component, traits

from ctapipe.monitoring.interpolation import PointingInterpolator

from ctapipe.io import write_table

t0 = Time("2022-01-01T00:00:00")

table = Table(
    {"time": t0 + np.arange(0.0, 10.1, 2.0) * u.s, "azimuth": np.radians(np.linspace(0.0, 10.0, 6)) * u.rad, "altitude": np.radians(>)

path = "pointing.h5"

write_table(table, path, "/dl0/monitoring/telescope/pointing/tel_001")

with tables.open_file(path) as h5file:
    interpolator = PointingInterpolator(h5file)
    t = t0 + 1 * u.s
    alt, az = interpolator(tel_id=1, time=t)
    print(interpolator._interpolators[1]["alt"].y,interpolator._interpolators[1]["alt"].x)
    print(alt,az)

Has anyone an idea what is wrong here?

src/ctapipe/monitoring/tests/test_interpolator.py

maxnoe · 2024-12-04T10:17:22Z

src/ctapipe/monitoring/interpolation.py

        super().__init__(**kwargs)

+        self._interpolators = {}


As discussed before, the base class should be rather pure. An interface shouldn't prescribe private data layout.

maxnoe · 2024-12-04T10:30:09Z

src/ctapipe/monitoring/interpolation.py

+    def __init__(self, h5file: None | tables.File = None, **kwargs: Any) -> None:
+        super().__init__(h5file=h5file, **kwargs)
+
+        self._interpolators = {}


Why is this now here and not in the LinearInterpolator?

maxnoe · 2024-12-04T10:44:32Z

This looks good now. A remaining question would be if you want to add specific ChunkInterpolators (like PointingInterpolator for LinearInterpolator)?

I.e. CalibrationInterpolator, FlatFieldInterpolator, PedestalInterpolator etc.

maxnoe · 2024-12-04T10:44:46Z

Can also be done in a follow-up PR of course.

mexanick · 2024-12-04T10:47:58Z

This looks good now. A remaining question would be if you want to add specific ChunkInterpolators (like PointingInterpolator for LinearInterpolator)?

I.e. CalibrationInterpolator, FlatFieldInterpolator, PedestalInterpolator etc.

I'd consider having a factory then. I think, just "CalibrationInterpolator" won't make much sense, but a specific ones like "FFInterpolator" may. I'd address it in another PR.

maxnoe · 2024-12-04T10:49:21Z

Yes, factory makes sense for this

maxnoe · 2024-12-04T11:14:02Z

src/ctapipe/monitoring/interpolation.py

+        super().__init__(**kwargs)
+        self._interpolators = {}
+        self.required_columns = ["start_time", "end_time"]
+        self.expected_units = {}


Why are these instance variables? And why are they empty?

This is not in line with how the other class works.

It seems this class variable is unused even.

Here required_columns shall become class attributes, but the interpolators and expected units shall remain instance variable, to allow creation of multiple instances. We have to see in the future PR, whether we want further specialization (e.g. a factory of VarNameInterpolators), that may lead to change this (they will basically become singletons).

expected units are unused because the quantity is dimensionless, we may want to actually enforce this through u.dimensioneless_unscaled

We had implemented that for collumns that are supposed to not have a unit we set the expected unit to None and check if the actual unit is equivalent. I put it like that.

mexanick · 2024-12-04T13:12:04Z

src/ctapipe/monitoring/interpolation.py

+    def __init__(self, h5file: None | tables.File = None, **kwargs: Any) -> None:
+        super().__init__(**kwargs)
+        self._interpolators = {}
+        self.required_columns = ["start_time", "end_time"]


required columns as start and stop time shall be the class attributes and shall be frozen, they are mandatory. You can copy them to an instance and extend with a value column(s) when you engage a __call__.

as the functions that use required_columns are in the parent class and always look to that name it makes more sense to have it as a modifiable instance variable

No. The original design idea was to have the required_columns frozen per "final" class. I.e. MonitoringInterpolator requires altitude and azimuth columns.

This is due to the configuration system in ctapipe, which works on class name basis, not instances.

So, the most sensible way to keep the required units and columns as class variables is to have pedestal and flatfield interpolators that inherit from the ChunkInterpolator. The ChunkInterpolator now has no required columns or units, but rather the subclasses have those variables. If we use the FlatFieldInterpolator we know we will always look for a column with relative gain factors with no unit, similarly we know what data a PedestalInterpolator will need.

I made those classes, and if we want to use the Chunk interpolation later for some other data we can add another subclass.

ctoennis · 2024-12-09T10:54:30Z

src/ctapipe/monitoring/interpolation.py

+        self.required_columns.update(columns)
+        self.required_columns = set(self.required_columns)
+        for col in columns:
+            self.expected_units[col] = None


Here i set the unit of the new columns to None, which is then in the next line enforced by _check_tables. This way we ensure the values have no unit.

maxnoe · 2024-12-09T15:17:59Z

src/ctapipe/monitoring/interpolation.py

+
+        for column in self.columns:
+            self.values[tel_id][column] = input_table[column]
+            self.start_time[tel_id][column] = input_table["start_time"].to_value("mjd")


Why store start_time and end_time per column?

maxnoe · 2024-12-09T15:18:02Z

src/ctapipe/monitoring/interpolation.py

+                raise ValueError(
+                    f"Column '{column}' not found in interpolators for tel_id {tel_id}"
+                )
+            result[column] = self._interpolators[tel_id][column](mjd)


Why use self._interpolators, why keep that around at all?

Why not just call self._interpolate_chunk(tel_id, column, mjd)?

read_table checks if _interpolators has already been set up for the given tel_id. _check_interpolators in MonitoringInterpolator also does that check and adds data from hdf5file if the interpolator is not set. I can move column to be an argument of _interpolate_chunk though.

src/ctapipe/monitoring/interpolation.py

src/ctapipe/monitoring/__init__.py

Christoph Toennis and others added 8 commits October 29, 2024 11:32

Adding the chunk function

3d879b9

Reverting to the previous implementation

c2ce42c

Changing some variables

c8f7ef0

Chaning to using scipy functions

bef9677

fixing docustring

cd4864f

Updating docustrings further

fb3a9a4

simplifying chunk interpolation

0f0b948

Refactor ChunkInterpolator and its tests

50f2791

ctoennis requested review from maxnoe and kosack October 31, 2024 11:27

ctoennis added the new functionality label Oct 31, 2024

ctoennis self-assigned this Oct 31, 2024

maxnoe reviewed Oct 31, 2024

View reviewed changes

src/ctapipe/monitoring/interpolation.py Outdated Show resolved Hide resolved

add back base Interpolator to __all__

90115d6

This comment has been minimized.

Sign in to view

Christoph Toennis added 3 commits October 31, 2024 12:55

adding changelog

f772b19

Merge branch 'ChunkFunction' of https://github.com/cta-observatory/ct…

e201356

…apipe into ChunkFunction

renaming changelog

907b6cd

This comment has been minimized.

Sign in to view

ctoennis requested a review from maxnoe October 31, 2024 13:30

ctoennis mentioned this pull request Nov 1, 2024

Handling and applying calibration parameters #2635

Open

mexanick added the calibration label Nov 4, 2024

maxnoe reviewed Nov 18, 2024

View reviewed changes

src/ctapipe/monitoring/interpolation.py Outdated Show resolved Hide resolved

Changing inheritance scheme

dc3b24c

maxnoe reviewed Nov 22, 2024

View reviewed changes

src/ctapipe/monitoring/tests/test_interpolator.py Outdated Show resolved Hide resolved

maxnoe reviewed Dec 4, 2024

View reviewed changes

removing provate data definition from parent class

a915d8d

maxnoe reviewed Dec 4, 2024

View reviewed changes

moving a variable

3423bd3

mexanick previously approved these changes Dec 4, 2024

View reviewed changes

mexanick requested a review from maxnoe December 4, 2024 11:05

maxnoe reviewed Dec 4, 2024

View reviewed changes

putting required units on ChunkInterpolator

496af8f

ctoennis dismissed mexanick’s stale review via 496af8f December 4, 2024 12:55

mexanick reviewed Dec 4, 2024

View reviewed changes

Christoph Toennis added 2 commits December 4, 2024 14:41

implementing reviewer comment

3b93770

making required_columns an instance variable

da86bf6

ctoennis commented Dec 9, 2024

View reviewed changes

making subclasses to ChunkInterpolator

50b9e84

maxnoe reviewed Dec 9, 2024

View reviewed changes

simplifying start_time and end_time

c11b6f2

mexanick reviewed Dec 10, 2024

View reviewed changes

src/ctapipe/monitoring/interpolation.py Show resolved Hide resolved

adding data groups

b02b526

mexanick previously approved these changes Dec 11, 2024

View reviewed changes

mexanick reviewed Dec 11, 2024

View reviewed changes

src/ctapipe/monitoring/__init__.py Show resolved Hide resolved

mexanick requested a review from maxnoe December 12, 2024 09:59

Christoph Toennis added 2 commits December 12, 2024 11:17

adding child classes, making chunk function take arrays

16aa176

making the nan switch test check if the switch is done element-wise

e0c0004

ctoennis dismissed mexanick’s stale review via e0c0004 December 12, 2024 12:42

adding imports to __init__

1644388

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Chunk interpolation to select calibration data #2634

Chunk interpolation to select calibration data #2634

ctoennis commented Oct 31, 2024

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

ctao-dpps-sonarqube bot commented Oct 31, 2024

ctoennis commented Nov 10, 2024

mexanick commented Nov 18, 2024

ctoennis commented Nov 22, 2024

maxnoe Dec 4, 2024

maxnoe Dec 4, 2024

maxnoe commented Dec 4, 2024

maxnoe commented Dec 4, 2024

mexanick commented Dec 4, 2024

maxnoe commented Dec 4, 2024

maxnoe Dec 4, 2024

mexanick Dec 4, 2024

mexanick Dec 4, 2024

ctoennis Dec 4, 2024

mexanick Dec 4, 2024

ctoennis Dec 9, 2024

maxnoe Dec 9, 2024

ctoennis Dec 9, 2024

ctoennis Dec 9, 2024

maxnoe Dec 9, 2024

maxnoe Dec 9, 2024

ctoennis Dec 9, 2024

Chunk interpolation to select calibration data #2634

Are you sure you want to change the base?

Chunk interpolation to select calibration data #2634

Conversation

ctoennis commented Oct 31, 2024

This comment has been minimized.

This comment has been minimized.

This comment has been minimized.

ctao-dpps-sonarqube bot commented Oct 31, 2024

Analysis Details

1 Issue

Coverage and Duplications

ctoennis commented Nov 10, 2024

mexanick commented Nov 18, 2024

ctoennis commented Nov 22, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

maxnoe commented Dec 4, 2024

maxnoe commented Dec 4, 2024

mexanick commented Dec 4, 2024

maxnoe commented Dec 4, 2024

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment