Fix overflows up to at least tco1279 #162

lukasm91 · 2024-10-16T09:37:23Z

This is one more change that Olivier ported from my branch to his branch, and now I am taking it to develop.

The upper end of the sizes that this allows is certainly more "theoretical" because we can't run simulations with it, but even for real use-cases, we might see overflows without this, and those can be very difficult to spot. With one rank, we can grow to very large domains, with multiple ranks, at some point we run into overflows of the MPI buffers, those are properly diagnosed.

This will conflict with #161 , but should be trivial to resolve.

src/trans/gpu/algor/ext_acc.F90

lukasm91 · 2024-12-05T10:34:35Z

@samhatfield This is the second last PR from the original GPU branch that is missing and I propose for integration. This should be much more straight forward than the previous one. I am using this for a while now without issues, and it is very useful because the overflows are very hard to find, and it is nice to be able experimenting on a single node/GPU.

samhatfield · 2024-12-05T11:03:32Z

Noted, thanks Lukas. I'll try to find time to look at this in the next week.

src/trans/gpu/external/setup_trans.F90

src/trans/gpu/internal/fsc_mod.F90

src/trans/gpu/algor/buffered_allocator_mod.F90

samhatfield

Looks good, just some minor comments.

A common pattern in this PR is something like

2_JPIB*D%NLENGT0B*KF_LEG*C_SIZEOF(PFBUF(1))

which involves a mixture of integer types. I don't usually do this in Fortran so I'm not sure what the rules are for casting in these cases. Can someone reassure me that it's all fine? Here we have JPIB * JPIM * C_SIZE_T.

src/trans/gpu/internal/ftdir_mod.F90

src/trans/gpu/internal/ftinv_mod.F90

lukasm91 changed the title ~~Fix overflows up to tco1279~~ Fix overflows up to at least tco1279 Oct 16, 2024

lukasm91 mentioned this pull request Oct 16, 2024

Remove non-standard SIZEOF from GPU subtree #161

Merged

fix overflows

8622da1

lukasm91 force-pushed the fix-overflow branch from dd1e3bb to 8622da1 Compare October 17, 2024 06:26

lukasm91 changed the base branch from main to develop October 17, 2024 06:30

Merge branch 'fix-overflow' into HEAD

27a35b6

wdeconinck reviewed Oct 29, 2024

View reviewed changes

src/trans/gpu/algor/ext_acc.F90 Show resolved Hide resolved

lukasm91 added 3 commits October 30, 2024 02:13

fix typo

8495f60

Merge commit '27a35b67' into fix-overflow

4b655f8

Merge remote-tracking branch 'public/develop' into fix-overflow

5a1d597