Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

instead of one text block multiple text blocks #173

Closed
naarkhoo opened this issue Mar 11, 2023 · 1 comment
Closed

instead of one text block multiple text blocks #173

naarkhoo opened this issue Mar 11, 2023 · 1 comment
Labels
bug Something isn't working

Comments

@naarkhoo
Copy link

I am using layoutparser '0.3.4' through

! pip install layoutparser torchvision && pip install "detectron2@git+https://github.com/facebookresearch/[email protected]#egg=detectron2" in colab

my model is

model = lp.models.Detectron2LayoutModel('lp://PubLayNet/mask_rcnn_X_101_32x8d_FPN_3x/config',
                                 extra_config=["MODEL.ROI_HEADS.SCORE_THRESH_TEST", 0.5],
                                 label_map={0: "Text", 1: "Title", 2: "List", 3:"Table", 4:"Figure"})

What I see is that block detection is too sensitive - meaning instead returning one block of text, the result is instead four blocks of text. The input is an article from pubmed.

What is the best practice in such case ?

  1. labling additional data and fine tuning the model ?
  2. post analysis using the coordinates ? (too hacky)
  3. is there any other model that is less sensitive

image

@naarkhoo naarkhoo added the bug Something isn't working label Mar 11, 2023
@naarkhoo
Copy link
Author

seems it returns both the big text_block along with every line as a block

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug Something isn't working
Projects
None yet
Development

No branches or pull requests

1 participant