Add marine infrastructure dataset and model config #49

favyen2 · 2024-10-18T20:51:49Z

This adds dataset/model configs for solar farm and wind turbine models, along with corresponding inference pipelines.

It also:

Moves convert_satlas_webmercator_to_rslearn to one_off_projects since it's no longer needed now that the conversion of the datasets (from old format + WebMercator projection to new rslearn format + UTM projection) is completed.
Adds rslp.common.worker system to launch workers that read tasks from a queue (Google Pub/Sub topic) and execute them. In the future all task systems (like Landsat and Sentinel-2 job launchers) should be converted to this system so that they can share the same way to execute workers -- then the individual projects just need to implement the process of writing tasks to the topic.

…pochs

…terFormat

…0926

favyen2 · 2024-10-23T21:40:14Z

Changing back to draft, I'm going to add the work on deploying the prediction pipeline on Beaker here.

…yen/marine-infra

…ects into favyen/marine-infra

rslp/common/worker.py

Hgherzog · 2025-02-06T17:50:56Z

rslp/launch_beaker.py

@@ -104,7 +100,9 @@ def launch_job(
                    config_path,
                    "--autoresume=true",
                ],
-                constraints=Constraints(cluster=["ai2/jupiter-cirrascale-2"]),
+                constraints=Constraints(
+                    cluster=["ai2/jupiter-cirrascale-2", "ai2/augusta-google-1"]


Suggestion: Make cluster Name into constants or better a class containing all the cluster names

I will add refactoring of this to #100 since that PR consolidates some of the Beaker stuff.

rslp/satlas/bkt.py

rslp/common/worker.py

rslp/satlas/data_sources.py

Hgherzog · 2025-02-06T17:57:20Z

rslp/satlas/data_sources.py

+    return cur_groups
+
+
+class MonthlySentinel2(DataSource):


Would this code make more sense in rslearn? I would imagine there are other usecases for monthly sentinel2 data

Same comment for the rest of the data sources here. Add integration tests as well.

Added tests. I think it makes sense for now to keep it in rslearn_projects, we will see how many applications use the same thing. If we do add to rslearn, we should refactor it so we don't need this wrapper class, ideally it'd be something in the QueryConfig maybe.

Hgherzog · 2025-02-06T17:59:49Z

rslp/satlas/postprocess.py

+        return fname, json.load(f)
+
+
+def apply_nms(


Add tests for the postprocessing functions

Hgherzog · 2025-02-06T18:01:25Z

rslp/satlas/predict_pipeline.py

+            applying the model globally but can be disabled for small regions to avoid
+            the time to create the index.
+    """
+    dataset_config_fname = DATASET_CONFIG_FNAME.format(application=application.value)


Suggestion: Split this into a couple functions for readability

Hgherzog · 2025-02-06T18:09:59Z

rslp/satlas/postprocess.py

+        # Smoothing is handled by a Go script.
+        subprocess.check_call(
+            [
+                "rslp/satlas/scripts/smooth_point_labels_viterbi",


Why can't the smooth point labels be written in python?

It may be feasible, I'm not sure, this is borrowed from https://github.com/allenai/remote-sensing-data-hub/blob/main/go_scripts/smooth_point_labels_viterbi.go and not much reason to rewrite at this time I think.

Hgherzog · 2025-02-06T18:12:39Z

tests/integration_slow/satlas/test_predict_pipeline.py

+    # Apply the pipeline. It will ingest data and apply the model.
+    # We disable rtree index so that it doesn't need an hour to create it.
+    predict_pipeline(
+        application=Application.MARINE_INFRA,


Would it make sense to have tests for the other applications as well?

Currently it is only MARINE_INFRA / WIND_TURBINE and these are similar enough that I don't think we need to run both (the code is the same, just different model, but this test is just making sure the code runs correctly).

I think once we have SOLAR_FARM it could use its own test since there will be code specific to the segmentation.

favyen2 and others added 15 commits September 26, 2024 13:32

wind turbine: compare new split vs old split performance

6f7ce55

try with 384x384 patches plus freeze the model for the first couple e…

c2ec853

…pochs

fix freezing code

90b26fe

remove unused old config for wind turbine training

726ad8e

add webmercator version of the wind turbine dataset

9b0d4bb

Add script to add the bounds metadata for layers using SingleImageRas…

91dd7f9

…terFormat

update name for the 384x384 experiment

f03249c

Fix wind turbine webmercator training (labels were not being populated)

036bf0b

Use six Sentinel-2 images from diverse months of the year

d6f8b02

Add marine infrastructure dataset config and model config.

8cc663a

add files needed by the model config for training

2b29949

maybe close to same performance

82a9149

Merge remote-tracking branch 'origin/master' into favyen/turbine-2024…

a5953a6

…0926

json formatting

24e6093

fix missing sections in config.json

cb04f2d

favyen2 marked this pull request as draft October 23, 2024 21:40

favyen2 added 5 commits October 23, 2024 14:43

Merge remote-tracking branch 'origin/master' into favyen/marine-infra

6a42948

Add job launcher and prediction pipeline for Satlas applications.

91274c1

Merge remote-tracking branch 'origin/master' into favyen/marine-infra

5249d27

marine infra updates

ac8eb32

gcp rtree index not working after august 2024 ...

4fb0a69

favyen2 mentioned this pull request Nov 14, 2024

Deploy Satlas marine infrastructure model #47

Closed

favyen2 added 7 commits December 5, 2024 09:46

Merge remote-tracking branch 'origin/master' into favyen/marine-infra

01deb1f

latest changes

7fccfb3

sync

f6bbc78

Merge remote-tracking branch 'origin/master' into favyen/marine-infra

07565da

sync

7085a2f

Merge branch 'favyen/turbine-20240926' into favyen/marine-infra

480473a

sync

cf57d21

favyen2 added 7 commits January 7, 2025 15:32

add azure configs

cd07cfe

upgrade solar farm ingestion

f916d5c

remove debug configs

463fa94

add documentation and test (wip)

976def9

fix test

0bd47f5

move convert_satlas_webmercator_to_rslean to one_off_porjects

f8ee3ec

update readme

693f9a0

favyen2 requested a review from Hgherzog January 9, 2025 20:01

favyen2 added 3 commits January 9, 2025 14:26

Merge branch 'master' of github.com:allenai/rslearn_projects into fav…

522c210

…yen/marine-infra

fix

4d200c8

fix

31576ab

favyen2 mentioned this pull request Jan 14, 2025

Add public-facing documentation #88

Merged

favyen2 and others added 7 commits January 17, 2025 13:23

only start satlas jobs that weren't already completed

298de6a

enable satlas prediction pipeline to run on jupiter (using /data disk)

3c9083c

add solar farm config

1efd544

Merge branch 'master' into favyen/marine-infra

8be2dd9

add documentation about viterbi smoothing step

9a3d757

Merge branch 'favyen/marine-infra' of github.com:allenai/rslearn_proj…

0d922d6

…ects into favyen/marine-infra

add documentation

ec95814

Hgherzog reviewed Feb 6, 2025

View reviewed changes

remove unused launch_worker

ead3e22

favyen2 mentioned this pull request Feb 6, 2025

[WIP] Forest Loss Web Integration #100

Open

favyen2 added 8 commits February 7, 2025 09:40

add test for bkt

a71f3f6

add test for rslp/common/worker.py

1eb9d78

add doc string

c4af2a5

add tests

df45d4c

add test for apply_nms

731cd0c

add merge_points test

98d0925

remove unused nms stuff

d7810bf

fix test

fed1b51

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add marine infrastructure dataset and model config #49

Add marine infrastructure dataset and model config #49

favyen2 commented Oct 18, 2024 •

edited

Loading

favyen2 commented Oct 23, 2024 •

edited

Loading

Hgherzog Feb 6, 2025

favyen2 Feb 6, 2025

Hgherzog Feb 6, 2025

Hgherzog Feb 6, 2025

favyen2 Feb 7, 2025

Hgherzog Feb 6, 2025

Hgherzog Feb 6, 2025

Hgherzog Feb 6, 2025

favyen2 Feb 7, 2025

Hgherzog Feb 6, 2025

favyen2 Feb 7, 2025

Add marine infrastructure dataset and model config #49

Are you sure you want to change the base?

Add marine infrastructure dataset and model config #49

Conversation

favyen2 commented Oct 18, 2024 • edited Loading

favyen2 commented Oct 23, 2024 • edited Loading

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

Choose a reason for hiding this comment

favyen2 commented Oct 18, 2024 •

edited

Loading

favyen2 commented Oct 23, 2024 •

edited

Loading