instead of one text block multiple text blocks #173

naarkhoo · 2023-03-11T13:45:37Z

I am using layoutparser '0.3.4' through

! pip install layoutparser torchvision && pip install "detectron2@git+https://github.com/facebookresearch/[email protected]#egg=detectron2" in colab

my model is

model = lp.models.Detectron2LayoutModel('lp://PubLayNet/mask_rcnn_X_101_32x8d_FPN_3x/config',
                                 extra_config=["MODEL.ROI_HEADS.SCORE_THRESH_TEST", 0.5],
                                 label_map={0: "Text", 1: "Title", 2: "List", 3:"Table", 4:"Figure"})

What I see is that block detection is too sensitive - meaning instead returning one block of text, the result is instead four blocks of text. The input is an article from pubmed.

What is the best practice in such case ?

labling additional data and fine tuning the model ?
post analysis using the coordinates ? (too hacky)
is there any other model that is less sensitive

The text was updated successfully, but these errors were encountered:

naarkhoo · 2023-03-11T13:51:41Z

seems it returns both the big text_block along with every line as a block

naarkhoo added the bug label Mar 11, 2023

naarkhoo closed this as completed Mar 11, 2023

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

instead of one text block multiple text blocks #173

instead of one text block multiple text blocks #173

naarkhoo commented Mar 11, 2023

naarkhoo commented Mar 11, 2023

instead of one text block multiple text blocks #173

instead of one text block multiple text blocks #173

Comments

naarkhoo commented Mar 11, 2023

naarkhoo commented Mar 11, 2023