You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
What I see is that block detection is too sensitive - meaning instead returning one block of text, the result is instead four blocks of text. The input is an article from pubmed.
What is the best practice in such case ?
labling additional data and fine tuning the model ?
post analysis using the coordinates ? (too hacky)
is there any other model that is less sensitive
The text was updated successfully, but these errors were encountered:
I am using layoutparser '0.3.4' through
! pip install layoutparser torchvision && pip install "detectron2@git+https://github.com/facebookresearch/[email protected]#egg=detectron2"
in colabmy model is
What I see is that block detection is too sensitive - meaning instead returning one block of text, the result is instead four blocks of text. The input is an article from pubmed.
What is the best practice in such case ?
The text was updated successfully, but these errors were encountered: