[WIP] Add Panoptic Quality (PQ) #408

NielsRogge · 2023-01-30T14:44:23Z

This PR adds the panoptic quality (PQ) metric, based on the original implementation.

Unlike most metrics that only require 2 things to be provided to the add_batch method (which are the predictions and the references), the panoptic quality metric requires 2 additional things to be provided, namely the predicted segments_info and the ground truth segments_info (which contain more information about the predicted and ground truth segmentation map, respectively).

Refer to this notebook for evaluating Mask2Former on the COCO panoptic validation set using this metric.

To do:

decide on API; what should be in the ground truth and predicted segments info? Both should ideally contain the same keys
support multiprocessing, which currently doesn't work

HuggingFaceDocBuilderDev · 2023-01-30T14:48:25Z

The docs for this PR live here. All of your documentation changes will be reflected on that endpoint.

alaradirik · 2023-02-01T10:49:29Z

@NielsRogge thank you for working on this! I think it'll really make it easier to use the segmentation models as well.

I'm guessing researchers will evaluate their models on public datasets such as COCO and ADE20K, and other users will use their custom datasets and will want to evaluate using a minimal setup. My proposal is as follows:

Ground truth annotations include id, category_id, iscrowd, and optionally area (set to None by default) keys. if bbox is included in the dictionary, we simply ignore it.
The iscrowd key is very specific to the COCO dataset, perhaps we can look for a was_fused key and map the iscrowd key to it if the was_fused key is not included in the ground truth annotations.
If there is no area key in the ground truth or prediction annotations, we perform Connected Components Analysis on each (h, w) panoptic segmentation map to compute the area of each instance (fused or unfused). cv2.connectedComponentsWithStats returns the area of each instance.
Predicted segments include the id, category_id, was_fused and the optional area keys.

I'm in favor of changing label_id to category_id in our post-processing methods and keeping was_fused as it is since iscrowd is not a descriptive attribute name and users would have a harder time preparing their custom datasets for evaluation .

cc @amyeroberts

NielsRogge · 2023-02-01T13:00:34Z

A few remarks:

Ground truth annotations include id, category_id, iscrowd, and optionally area (set to None by default) keys. if bbox is included in the dictionary, we simply ignore it.

I'm not sure it's possible to define optional features for the inputs of add_batch, cc @lvwerra. Here, we'd like to have an optional key in the reference_annotations.

If there is no area key in the ground truth or prediction annotations, we perform Connected Components Analysis on each (h, w) panoptic segmentation map to compute the area of each instance (fused or unfused). cv2.connectedComponentsWithStats returns the area of each instance.

I would definitely avoid having a cv2 dependency, as this library is pretty painful to install and it creates an additional dependency.

Predicted segments include the id, category_id, was_fused and the optional area keys.

The was_fused isn't used either by the metric, for the moment.

Ideally the same keys should be present in the ground truth and predicted segments_info (it's a bit weird to have different keys in both).

lvwerra · 2023-02-01T13:13:30Z

I'm not sure it's possible to define optional features for the inputs of add_batch, cc @lvwerra. Here, we'd like to have an optional key in the reference_annotations.

Yes, you can! The features can be a list of different formats and evaluate figures out which ones match the provided data (checkout bleu for example). You will just need to be consistent across all batches.

lvwerra · 2023-03-14T20:34:59Z

Let me know if you need another review on this @NielsRogge.

NielsRogge · 2023-03-14T20:50:21Z

The metric is actually in a ready state, the final API just needs to be decided (which keys need to be in the predicted vs ground truth annotations). cc @alaradirik

alaradirik · 2023-03-15T08:40:05Z

The metric is actually in a ready state, the final API just needs to be decided (which keys need to be in the predicted vs ground truth annotations). cc @alaradirik

Sorry about that, I thought I replied to your remarks!

I'm in favor of excluding keys that are not used for the metric computation - iscrowd/was_fused, area, bbox. Users can still include these in the ground truth annotations for the sake of convenience and we can drop them during the actual computation.

So the ground truth annotations would have the id, category_id, optionally iscrowd, area and bboxkeys and the predictions would have the id andcategory_id keys. We can exclude the optional / unused keys from the documentation and simply add a remark that additional metadata will be ignored.

What do you think? @NielsRogge @lvwerra @amyeroberts

Niels Rogge added 30 commits December 24, 2022 11:34

First draft

e2b0adb

More improvements

cc29a21

Update features

65136a8

Remove feature

970653d

Add print statements

67123ca

Use Image feature

9c97456

Update feature

4d2fdc7

Fix code

3d43d5b

More fixes

3e6a0db

Fix more code

12fe7ee

Add print statement

439c2e3

Add print statement

abdfb3e

Add more fixes

d683c2f

Add image_id to all annotations

c179f18

Add image_id to all annotations

16f4aaa

First draft

de90b26

Update features

fac3bb5

Update features

c5173f0

Update features

059b652

Debug

c77e326

Add feature

e53249f

Update features

ac5827f

Improve implementation

d15b9ae

Disable features

3044365

Add features

49d8205

Improve features

0546a84

Improve features

b7a7d2b

Debug features

1511fb2

Debug features

aeccea0

Debug features

0ccc704

Niels Rogge added 12 commits January 15, 2023 17:03

Debug features

d591d98

Remove segments_info

9fd134e

Debug

0d0b47c

Debug

e9ca360

Debug

678c8ea

Debug

2f0ef02

Add features back

f485dc7

Add print statement

095ebfc

Debug

297ebbd

Debug

ff776c2

Debug

a76ca5d

Clean up code

cb1464e

NielsRogge requested a review from alaradirik January 30, 2023 14:47

Fix style

8c6e29b

NielsRogge marked this pull request as draft January 30, 2023 14:53

NielsRogge mentioned this pull request Apr 12, 2024

Create panoptic segmentation task guide huggingface/transformers#30214

Open

4 tasks

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[WIP] Add Panoptic Quality (PQ) #408

[WIP] Add Panoptic Quality (PQ) #408

NielsRogge commented Jan 30, 2023 •

edited

Loading

HuggingFaceDocBuilderDev commented Jan 30, 2023

alaradirik commented Feb 1, 2023

NielsRogge commented Feb 1, 2023 •

edited

Loading

lvwerra commented Feb 1, 2023

lvwerra commented Mar 14, 2023

NielsRogge commented Mar 14, 2023

alaradirik commented Mar 15, 2023 •

edited

Loading

[WIP] Add Panoptic Quality (PQ) #408

Are you sure you want to change the base?

[WIP] Add Panoptic Quality (PQ) #408

Conversation

NielsRogge commented Jan 30, 2023 • edited Loading

HuggingFaceDocBuilderDev commented Jan 30, 2023

alaradirik commented Feb 1, 2023

NielsRogge commented Feb 1, 2023 • edited Loading

lvwerra commented Feb 1, 2023

lvwerra commented Mar 14, 2023

NielsRogge commented Mar 14, 2023

alaradirik commented Mar 15, 2023 • edited Loading

NielsRogge commented Jan 30, 2023 •

edited

Loading

NielsRogge commented Feb 1, 2023 •

edited

Loading

alaradirik commented Mar 15, 2023 •

edited

Loading