You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I am using layout-parser to detect bounding boxes related to tables in a pdf file.
When the coordinates are accessed in this way:
model = lp.Detectron2LayoutModel('lp://PubLayNet/faster_rcnn_R_50_FPN_3x/config',
extra_config=["MODEL.ROI_HEADS.SCORE_THRESH_TEST", 0.8],
label_map={0: "Text", 1: "Title", 2: "List", 3:"Table", 4:"Figure"})
# Detect tables
layout = model.detect(image_rgb)
for el in layout:
coords = el.coordinates
print(coords)
In what format are they returned? Could they be restitute in the format x1, y1, x2, y2 where (x1,y1) identifies the upper-left corner and (x2,y2) the lower-right corner and the origin of the coordinates is (0,0) in the lower-left corner?
The text was updated successfully, but these errors were encountered:
I am using layout-parser to detect bounding boxes related to tables in a pdf file.
When the coordinates are accessed in this way:
In what format are they returned? Could they be restitute in the format x1, y1, x2, y2 where (x1,y1) identifies the upper-left corner and (x2,y2) the lower-right corner and the origin of the coordinates is (0,0) in the lower-left corner?
The text was updated successfully, but these errors were encountered: