Enable patchwise training and prediction #135

davidwilby · 2024-11-07T12:51:51Z

Hey @tom-andersson - at long last, the long-awaited patchwise training and prediction feature that @nilsleh and @MartinSJRogers have been working on.

This PR adds patching capabilities to DeepSensor during training and inference.

Training

Optional args patching_strategy, patch_size, stride and num_samples_per_date are added to TaskLoader.__call__.

There are two available patching strategies: random_window and sliding_window. The random_window option randomly selects points in the x1 and x2 extent as the centroid of the patch. The number of patches is defined by the num_samples_per_date argument. The sliding_window function starts in the top left of the dataset and convolves from left to right and top to bottom over the data using the user-defined patch_size and stride.

TaskLoader.__call__ now contains additional conditional logic depending upon the patching strategy selected. If no patching strategy is selected, task_generator() runs exactly as before. If random_window (sliding_window) is selected the bounding boxes for the patches are generated using the sample_random_window() (sample_sliding_window()) methods. The bounding boxes are appended to the list bboxes, and passed to task_generator().

Within task_generator() after the sampling strategies are applied, the data is spatially sliced using each bbox in bboxes using the self.spatial_slice_variable() function.

When using a patching strategy, TaskLoader produces a list of tasks per date, rather than an individual task per date. A small change has been made to Task's summarise_str method to avoid an error when printing patched Tasks and to output more meaningful information.

Inference

To run patchwise predictions, a new method has been created in model.py called predict_patch(). This method iterates through and applies the pre-exisiting predict() method to each patched task. The predict() method has not been changed. Within each iteration, prior to running predict() for each patch, the bounding box of each patch is unnormalized, so the X_t of each patch can be passed to the predict() function. The patchwise predictions are stored in the list preds for subsequent stitching.

It is only possible to use the sliding_window patching function during inference, and the stride and patch size are defined when the user generates the test tasks within the task_loader() call. The data_processor must also be passed to predict_patch() method to enable unnormalisation of the coordinates of the bboxes in model.py.

Once the list of patchwise predictions are generated, stitch_clipped_predictions() is used to form a prediction at the original X_t extent. Currently, functionality is provided to subset or clip each patchwise prediction so there is no overlap between adjacent patches and then merge the patches using xr.combine_by_coords(). The modular nature of the code means there is scope for additional stitching strategies to be added after this PR, for example applying a weighting function to overlapping predictions. To ensure the patches are clipped by the correct amount, get_patch_overlap() calculates the overlap between adjacent patches. stitch_clipped_predictions() also contains code to handle patches at the edge or bottom of the dataset, where the overlap may be different.

The output from predict_patch() is the identical DeepSensor object produced in model.predict(), hence DeepSensor’s plotting functionality can subsequently be used in the same way.

Documentation and Testing

New notebook(s) are added illustrating the usage of both patchwise training and prediction.

New tests are added to verify the new behaviour.

Limitations

Patchwise prediction does not currently support predicting at more than one timestamp - calling predict_patch with more than one date raises a NotImplementedError.
predict_patch is a new, distinct function due to all the pre-processing it needs to do, the patchwise behaviour may be better served as an option in predict - let me know what you think.
Patched tasks don't exactly follow the proportions from patch_size, e.g. for a 'square' patch patch_size=(0.5,0.5) the exact dimensions won't be exactly square, this is accounted for in stitching of patches, but is slightly inelegant at the moment so we may want to come back and find a more refined solution in the future.
In test_model.test_patchwise_prediction I've temporarily commented-out the asserts checking for correct prediction shape, these fail with test datasets for now, but with real datasets the shapes are correct, see the patchwise_training_and_prediction.ipynb notebook.

Sliding window patching

Refactor `sample_sliding_window`

Co-authored-by: David Wilby <[email protected]>

…erge Replace combine_by_coords with np.where() to stich patched predictions

Tidy up patchwise prediction arguments

tom-andersson

Thanks @davidwilby, looks much improved and nearly ready to LGTM. Two high-level comments:

Can you fix the failing unit tests?
For readability and maintenance, could you move the additions to TaskLoader and DeepSensor model into subclasses to improve encapsulation of the patching functionality? I'm realising this PR makes the standard API more complicated, which may be offputting for users who don't require patching, and also makes the code for those classes trickier to parse. How about a PatchTaskLoader and DeepSensorPatchwiseModel? With no other changes to the class hierarchy I think we'd need ConvNP to subclass DeepSensorPatchwiseModel instead of the standard DeepSensorModel, though there may be a more elegant solution where the user decides whether to set up a model that predicts with patches through the ConvNP interface. WDYT?

tests/test_model.py

requirements/requirements.dev.txt

.gitignore

tests/test_model.py

deepsensor/model/model.py

tom-andersson · 2024-12-29T19:03:19Z

deepsensor/model/model.py

+        )
+
+        ## Cast prediction into DeepSensor.Prediction object.
+        # TODO make this into seperate method.


pred.assign isn't really what you want for stitching predictions together because that method just assigns data in bulk to the xarray or pandas objects.

TBH I've stared at this for a while and I'm not sure what you mean by 'copying one of the patch predictions and extending it' or where I should be looking, partly because it's difficult to follow predict_patchwise because of all the nested functions. In particular I don't understand what's being done with prediction below and why we can't just return stitched_prediction --

As typing I just realised that stitch_clipped_predictions is returning a dict not a Prediction, and the code below has a bunch of redundant lines and is simply overwriting the entries of a copy of the first patch Prediction with the xarray objects in stitched_predictions. I think you can simplify and resolve this if you resolve my comment above about making stitch_clipped_predictions return a Prediction directly.

… method

Co-authored-by: David Wilby <[email protected]>

Simplify stitching process

Co-authored-by: David Wilby <[email protected]>

…iction_objects Simplify stitching by retaining prediction objects

nilsleh and others added 30 commits October 10, 2023 19:01

stach changes

131c434

draft

3342b96

draft

b7cf3fa

merge main

70f3783

wrong merge

379e3b2

incorporate some of the feedback

85cd34b

run black

be8fffd

merge main

3415377

merge main

39dd15b

layout code

876970e

change __call__

d1cb338

revert

218f791

type annotation

37fe771

patch_size sampling test

fb20ccc

patchwise test trainer

5bda80b

gridded window patching

c276844

adding sliding window patching function

fde7e02

loader with bboxes

195a923

loader with boxes

824df24

Altering kwargs to enable for-loop and change sliding function

e6e1ae8

Merge branch 'patchwise_train' into msjr/patching

e75d022

move logic to call

bae0855

Merge branch 'main' into patchwise_train

a090d34

Merge branch 'patchwise_train' into msjr/patching

797f48e

Merge pull request #1 from nilsleh/msjr/patching

5291ec3

Sliding window patching

typo

7b09119

notebook with patchwise train

282c2be

refining stride to avoid error

dfa386d

inference patching

8d46653

predict_patches

acbad8b

Martin Rogers and others added 10 commits December 3, 2024 11:02

remove the +1 to prevent Nan lines forming

e68c01a

add some comments

8b9a8ac

Merge pull request #16 from davidwilby/refactor_sample_sliding

d620e88

Refactor `sample_sliding_window`

Update deepsensor/model/model.py

3c1c1c8

Co-authored-by: David Wilby <[email protected]>

linting

e601109

tweak comments

a86ce31

re-enable size checking in test

c7a994e

rename some variables for slightly improved readability; add typehints

e5b580b

Merge pull request #18 from davidwilby/replace_combineByCoords_with_m…

91b83ce

…erge Replace combine_by_coords with np.where() to stich patched predictions

Merge pull request #17 from davidwilby/predict_args

158b6dc

Tidy up patchwise prediction arguments

davidwilby requested a review from tom-andersson December 20, 2024 14:32

tom-andersson requested changes Dec 29, 2024

View reviewed changes

davidwilby and others added 18 commits January 8, 2025 11:43

reduce large comment block to easier to follow inline comments

747f7dd

remove unused hypothesis dependency

4f5eead

remove todo

322766f

move coord direction calcuation to where it is needed

b4e9ff5

clean up markup

572d7ec

Reduce repitiion and place code to determine coordinate extent in one…

da2f68f

… method

Create DeepSensor object straight after stitching

c2f0ffe

Slightly amend some mark up text

9a7e743

Editted text for get_coordinate_extent_method

e857355

Edit where time is defined in stitched prediction object

358b884

Reduce for loops and keep predictions as deepsensor.prediction objects

58e9076

Update deepsensor/model/model.py

9943e99

Co-authored-by: David Wilby <[email protected]>

Merge pull request #19 from davidwilby/simplify_stitching

53ee50f

Simplify stitching process

Update deepsensor/model/model.py

6cf0a28

Co-authored-by: David Wilby <[email protected]>

Merge pull request #20 from davidwilby/simplify_stitching_retain_pred…

1f0fb32

…iction_objects Simplify stitching by retaining prediction objects

lint

be883dc

use python 3.8 compatible typehint

b0459e8

correct type hint

9765787

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Enable patchwise training and prediction #135

Enable patchwise training and prediction #135

davidwilby commented Nov 7, 2024

tom-andersson left a comment

tom-andersson Dec 29, 2024

Enable patchwise training and prediction #135

Are you sure you want to change the base?

Enable patchwise training and prediction #135

Conversation

davidwilby commented Nov 7, 2024

Training

Inference

Documentation and Testing

Limitations

tom-andersson left a comment

Choose a reason for hiding this comment

tom-andersson Dec 29, 2024

Choose a reason for hiding this comment