In what format are the coordinates of bounding boxes returned? #222

Sopralapanca · 2025-02-25T16:10:27Z

I am using layout-parser to detect bounding boxes related to tables in a pdf file.
When the coordinates are accessed in this way:

model = lp.Detectron2LayoutModel('lp://PubLayNet/faster_rcnn_R_50_FPN_3x/config', 
                                 extra_config=["MODEL.ROI_HEADS.SCORE_THRESH_TEST", 0.8],
                                 label_map={0: "Text", 1: "Title", 2: "List", 3:"Table", 4:"Figure"})

# Detect tables
layout = model.detect(image_rgb)

for el in layout:
  coords = el.coordinates
  print(coords)

In what format are they returned? Could they be restitute in the format x1, y1, x2, y2 where (x1,y1) identifies the upper-left corner and (x2,y2) the lower-right corner and the origin of the coordinates is (0,0) in the lower-left corner?

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

In what format are the coordinates of bounding boxes returned? #222

In what format are the coordinates of bounding boxes returned? #222

Sopralapanca commented Feb 25, 2025

In what format are the coordinates of bounding boxes returned? #222

In what format are the coordinates of bounding boxes returned? #222

Comments

Sopralapanca commented Feb 25, 2025