#239 Source detection and SNR computation code #301

toshiyukimizuki · 2025-02-14T11:28:47Z

Describe your changes

Submission of code to create snmap and do point source detection, as well as code to test it

corgidrp/find_source.py
tests/test_find_source.py
tests/test_data/NoCompNoDisk-GPIcube.fits

Reference any relevant issues (don't forget the #)

#239

Checklist before requesting a review

I have linted my code
I have verified that all unit tests pass in a clean environment and added new unit tests, as needed
I have verified that all docstrings are properly formatted and added new documentation, as needed
I have filled out the Unit Test Definition Table on confluence, as needed

maxwellmb · 2025-02-15T00:34:27Z

Hi @toshiyukimizuki,
Great that you've gotten the PR submitted! It looks like it's failing the CI tests because of the structure of the test file. I'd recommend you review how the other test files are structure and adjust your file accordingly. If you have any questions about it, please reach out!
Cheers,
Max

maxwellmb

Hi Toshiyuki,

This is a great start. I have a few comments, hopefully not to much of a bother to implement. Please let me know if you have any questions!

Cheers,
Max

maxwellmb · 2025-02-27T05:02:17Z

corgidrp/find_source.py

+from pyklip.kpp.stat.statPerPix_utils import get_image_stat_map_perPixMasking
+from corgidrp.data import Dataset
+
+def find_source(input_dataset: Dataset, psf=None, fwhm=3.5, nsigma_threshold=5.0):


We haven't so far been using type hints in the function calls, but I suppose we could. Should the type instead be corgidrp.data.Dataset?

Actually I think it should be fine for this to accept an Image as input rather than a dataset since we don't expect the L4_to_TDA.py doesn't have to be scriptable with the walker, so the input type should be corgidrp.data.Image (and the variable name should reflect that too).

maxwellmb · 2025-02-27T05:02:41Z

corgidrp/find_source.py

+    Args:
+        input_dataset (Dataset): Input dataset containing images.
+        psf (np.ndarray, optional): 2D PSF array. If None, a Gaussian PSF is generated.
+        fwhm (float, optional): Full-width at half-maximum of the PSF.


specify the units here, I think it's "pixels", right?

maxwellmb · 2025-02-27T05:26:27Z

corgidrp/find_source.py

@@ -0,0 +1,89 @@
+import numpy as np


I think we should create a file called l4_to_TDA.py and put this there since it's an L4 to TDA processing step.

maxwellmb · 2025-02-27T05:41:04Z

corgidrp/find_source.py

+    sn_source, xy_source = [], []
+
+    # Iteratively find sources above the SNR threshold
+    while np.nanmax(image_snmap) >= nsigma_threshold:


Could we modularize all these steps a bit more (e.g. the SNR detection, then then PSF scaling/subtraction)? I could imagine a case where we might want to use these steps with a bit more manual control. For example, the PSF's morphology will change close to the inner and outer edges of the field of view, so being able to adjust the input PSF could be helpful. I still think that we want it packaged nicely into a function like this, but having things be more modular could also allows us to carry out some of these steps manually.

maxwellmb · 2025-02-27T05:41:49Z

corgidrp/find_source.py

+
+    # Store detected sources in FITS header
+    for i in range(len(sn_source)):
+        input_dataset[0].pri_hdr[f'snyx{i:03d}'] = f'{sn_source[i]:5.1f},{xy_source[i][0]:4d},{xy_source[i][1]:4d}'


Let's add it to the extension header ext_hdr instead. The primary header will mostly be reserved for things set by the telescope.

maxwellmb · 2025-02-27T05:56:43Z

tests/test_find_source.py

@@ -0,0 +1,173 @@
+import os, glob, copy


Please take a look at the structure of other test functions - we're using a pytest infrastructure which requires that we have functions start with test_* and make specific assertion tests.

maxwellmb · 2025-02-27T06:01:24Z

tests/test_find_source.py

+        nondetection = np.vstack((nondetection, np.delete(np.column_stack((sn_rand, y_rand, x_rand)), idx1, axis=0)))
+        misdetection = np.vstack((misdetection, np.delete(snyx, idx2, axis=0)))
+
+    # Print summary of detection results


At the end here rather than printing things or plotting things we should make assertion statements to show that we've recovered the input x,y positions and the expected SNR

maxwellmb · 2025-02-27T06:03:14Z

tests/test_find_source.py

+    nsigma_threshold = 5.0
+
+    # Load the dataset from GPI data file
+    file_gpi = mockfilepath+'NoCompNoDisk-GPIcube.fits'


I actually quite like what you've done here, but in order to keep the size of the repository small we're trying to minimize the number of data files that we add. Rather than adding more data to the repository here we could just simulate a noise field. For this level of test I think its ok if the noise is just Gaussian, or if you wanted a radially changing Gaussian noise field that could be fine too. For the end to end tests we might have higher fidelity speckle residual fields.

maxwellmb · 2025-02-27T06:14:34Z

tests/test_find_source.py

+        # Categorize results into detections, non-detections, and misdetections
+        detection = np.vstack((detection, np.hstack((np.column_stack((sn_rand, y_rand, x_rand))[idx1], snyx[idx2]))))
+        nondetection = np.vstack((nondetection, np.delete(np.column_stack((sn_rand, y_rand, x_rand)), idx1, axis=0)))
+        misdetection = np.vstack((misdetection, np.delete(snyx, idx2, axis=0)))


I like that you're injecting sources below the SNR threshold to make sure we don't detect them - that's a good test of the threshold.

maxwellmb · 2025-02-27T06:14:48Z

tests/test_find_source.py

+        inputflux_rand*= 1.5 # somehow needed ?
+        ##### ##### #####
+        x_rand = radius_rand * np.cos(np.radians(pa_rand)) + dataset_center[1]
+        y_rand = radius_rand * np.sin(np.radians(pa_rand)) + dataset_center[0]


I don't think we want to randomly inject point sources with random SNRs, we need to define them specifically so that we can show that we can recover them.

roman-corgi#239 Source detection and SNR computation code

367d23a

maxwellmb self-assigned this Feb 15, 2025

fixed the import issue

77ba9be

maxwellmb assigned maxwellmb and unassigned maxwellmb Feb 21, 2025

maxwellmb requested changes Feb 27, 2025

View reviewed changes

Merge branch 'roman-corgi:main' into find_source_#239

5c55a55

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

#239 Source detection and SNR computation code #301

#239 Source detection and SNR computation code #301

toshiyukimizuki commented Feb 14, 2025 •

edited

Loading

maxwellmb commented Feb 15, 2025

maxwellmb left a comment

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

maxwellmb Feb 27, 2025

#239 Source detection and SNR computation code #301

Are you sure you want to change the base?

#239 Source detection and SNR computation code #301

Conversation

toshiyukimizuki commented Feb 14, 2025 • edited Loading

Describe your changes

Reference any relevant issues (don't forget the #)

Checklist before requesting a review

maxwellmb commented Feb 15, 2025

maxwellmb left a comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

toshiyukimizuki commented Feb 14, 2025 •

edited

Loading