Implement covariant kernels #40

cgarling · 2024-07-06T19:13:08Z

Fixes #30.

This is a major PR that separates the kernels we support to describe the 2-D probability distributions of observed magnitudes given intrinsic magnitudes into RealSpaceKernels and PixelSpaceKernels depending on whether their fields (centroids, widths, etc.) are defined in magnitude (real) space or the pixel space of the Hess diagram matrix.

This was added to enable definition of the covariant kernel GaussianPSFCovariant which is easier to describe in real space coordinates than pixel space coordinates. This kernel is valid for Hess diagrams where the y-axis magnitude also appears in the x-axis color. Addition of this kernel greatly improves modelling accuracy for these types of Hess diagrams. A future refactor may be able to convert this into a PixelSpaceKernel, but the extra code necessitated by the implementation of addstar! for subtypes of RealSpaceKernel adds very little overhead (~200 ns per call) and so this is not a priority.

A new example examples/templates/kernels_example.jl has been added showing the performance of this functionality. The output of this script is spliced into a new documentation page under the fitting/internals heading.

New tests, including of partial_cmd_smooth, have been added. Calling partial_cmd_smooth will necessarily call most of the other functions in the main "template creation" process. Many of these methods presently lack their own unit tests but this is low priority for now.

There was also an issue with the old data/isochrone.txt which has now been fixed.

I added VectorizationBase.jl to deps to get AbstractSIMD, verf for this PR but now I don't think it is truly necessary. However, this is a dependency of LoopVectorization.jl which I do need and so we would have to install it anyway. For now I'm fine keeping it but might remove in the future.

Initial test and example appear to be working well

Should enable us to use one type for both covariance cases

The `RealSpaceKernel` implementation is working and showing good results. Implementation is only slightly slower than the `PixelSpaceKernel`. Could get rid of the pixel-space implementation all together, but going to keep it for now. Next I will refactor `bin_cmd_smooth` and `partial_cmd_smooth` to add the new real-space kernel.

For `Float32/64` we can use `VectorizationBase.verf` to get a ~2x speedup in `addstar!` which should be worth it as `VectorizationBase` is a dependency of `LoopVectorization` already. To support this specialization, I was able to break the main `erf` calls out into `gaussian_int_general` so that the kernel `evaluate` methods for both `GaussianPSFAsymmetric` and `GaussianPSFCovariant` now call into this single generic function. We are still keeping a fallback for general `Real` inputs to `SpecialFunctions.erf` which supports more datatypes. Also added some docstrings and cleaned up some code syntax throughout.

Additional 2x speed improvement due to better simd. Requires ensuring that the loop indices are all inbounds *prior* to starting the loop, where the old implementation was lazy and looped over all defined indices with explicit `checkbounds()` inside the loop. `LoopVectorization.jl` and `VectorizationBase.jl` are at risk of being deprecated in Julia 1.11, but using them together in this application gives 4x speedup for the hottest loop in the template creation process, so I think this is worth doing for now.

add third filter for additional tests

Tried to fold `bin_cmd_smooth` into `partial_cmd_smooth` for better integration, but it massively allocates probably because it cannot determine the type of some variable. Julia specializes at function boundaries and so these allocations go away when I call out to `bin_cmd_smooth` with the same arguments. Having trouble figuring out what is causing the allocations at the moment so might just resort to refactoring `bin_cmd_smooth` after all.

Initial results look pretty good on real data. Should probably make `cov_mult` mandatory in `bin_cmd_smooth` rather than optional -- address later. Think I might have a bug in `model_cmd` as results on synthetic data processed with that function look worse; check later. The kernel implementation is a bit inefficient because the kernel is tilted in matrix space so many pixels are evaluated that are far from the bulk of the kernel. Could fix this with some sort of sparsity pattern mask.

At lower Hess diagram resolutions like `size(edges) = (75, 75)` there were significant artifacts in the template model with the new covariant kernel. Doubling the y-resolution to `length=150` gives a much better model. Not obvious why this is the case, as the integration over the pixels should make the solution resolution agnostic. Might have something to do with not resolving the skewed kernel shape if the grid is too coarse? Worth more investigation.

4x performance improvement with `@turbo` loop optimizations. Same approach as previous commit where I added `@turbo` for `PixelSpaceKernel`; make loop iterators aren't empty, all loop indices inbounds, etc.

Previously if you passed a direct `edges` argument to `calculate_edges` it would return it with no checks. Downstream functions that call this method now assume the output will be a `Tuple{<:AbstractRange, <:AbstractRange}`, so we have to check the input type and error if it is incorrect. This commit also cleans the docs for `bin_cmd`.

Rather than hard-coded 0.01 deltamg in call to `mini_spacing`, use calculated bin steps to set the required deltamag.

No need to pass all the other kws when we've already calculated edges

Forgot this was on by default, turn off to show raw data

When pixels are very large compared to kernel size, the old offsets could become 0 as, e.g., `div(1,2) = 0`, which would cause failures.

better to use the rcparam

old size was mostly adequate, but x-axis width was a bit low such that I could see the truncation of the kernel in the plots I'm making for the paper. Since the methods are well optimized, we can afford to increase the size of the kernel; making a single template still takes ~3 ms so still quite good.

Previous implementation failed to include the altered `deltax` in the integration across the bin/pixel; this was a shortcut that would work well at high grid resolutions (e.g., Hess diagram size=(75, 200)) but had major issues in lower-resolution grids. Re-derive the pixel integrals (see `notes/kernels.nb`) and implement new method for `gaussian_psf_covariant`. The integral is not entirely analytic, but one axis can be analytic integrated. After analytically integrating along the x-axis, I'm using 3-point Gauss-Legendre integration along the y-axis to finish the pixel integration. New residuals look much better and performance is comparable to `GaussianPSFAsymmetric` implementation.

add `kernels` docpage and fix examples to work in docs build

This was an old experimental code for implementing the covariant Gaussian kernels that have now been added into the main package.

codecov-commenter · 2024-07-06T19:18:20Z

Codecov Report

Attention: Patch coverage is 89.75904% with 17 lines in your changes missing coverage. Please review.

Project coverage is 88.06%. Comparing base (c45e31e) to head (587bc52).

Files	Patch %	Lines
src/StarFormationHistories.jl	89.37%	17 Missing ⚠️

Additional details and impacted files

@@            Coverage Diff             @@
##             main      #40      +/-   ##
==========================================
+ Coverage   79.69%   88.06%   +8.37%     
==========================================
  Files          19       18       -1     
  Lines        1374     1416      +42     
==========================================
+ Hits         1095     1247     +152     
+ Misses        279      169     -110

☔ View full report in Codecov by Sentry.
📢 Have feedback on the report? Share it here.

cgarling added 30 commits June 28, 2024 17:29

new covariant kernels implementation

4e101ea

Initial test and example appear to be working well

add cov_mult to GaussianPSFCovariant

7be2b11

Should enable us to use one type for both covariance cases

clear some calls from mcmc_test.jl

64a1972

replace test isochrone

69233c9

add third filter for additional tests

slight cleanup simulate.jl

a98a37b

Add @turbo to addstar! for RealSpaceKernel

9b4cfde

4x performance improvement with `@turbo` loop optimizations. Same approach as previous commit where I added `@turbo` for `PixelSpaceKernel`; make loop iterators aren't empty, all loop indices inbounds, etc.

add new tests into runtests.jl

c62f63b

Add adaptive point spacing in mini_spacing call

c002a0a

Rather than hard-coded 0.01 deltamg in call to `mini_spacing`, use calculated bin steps to set the required deltamag.

also send calculated edges directly out

ba1a186

No need to pass all the other kws when we've already calculated edges

turn off interpolation in imshow calls

97c35b9

Forgot this was on by default, turn off to show raw data

remove old code

4d51820

guarantee that the pixel offsets are always >=1

b1a4da1

When pixels are very large compared to kernel size, the old offsets could become 0 as, e.g., `div(1,2) = 0`, which would cause failures.

move interp option to plt.rc

03f9ad2

better to use the rcparam

example alpha

def1f57

remove global type declarations support julia1.7

585a950

update addstar! step

1bc8875

add more template tests

9409d76

update docs and examples

03731d6

add `kernels` docpage and fix examples to work in docs build

update fitting1.ipynb example

e2a228d

more docs

6679bbe

remove cmdpoint.jl

a745059

This was an old experimental code for implementing the covariant Gaussian kernels that have now been added into the main package.

cgarling added 3 commits July 6, 2024 15:25

try to add macos and windows to CI with julia v1

4a009b9

add arch

65fac33

fix template_test.jl

587bc52

cgarling merged commit 0b5a968 into main Jul 6, 2024
8 checks passed

cgarling deleted the covariant_kernels_2 branch July 6, 2024 21:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Implement covariant kernels #40

Implement covariant kernels #40

cgarling commented Jul 6, 2024

codecov-commenter commented Jul 6, 2024 •

edited

Loading

Implement covariant kernels #40

Implement covariant kernels #40

Conversation

cgarling commented Jul 6, 2024

codecov-commenter commented Jul 6, 2024 • edited Loading

Codecov Report

codecov-commenter commented Jul 6, 2024 •

edited

Loading