Skip to content

DamonsJ/dhSegment

This branch is 79 commits behind dhlab-epfl/dhSegment:master.

Folders and files

NameName
Last commit message
Last commit date

Latest commit

Jan 17, 2019
7f65ad4 · Jan 17, 2019
Nov 1, 2018
Jan 17, 2019
Jan 17, 2019
Jun 15, 2018
Apr 3, 2018
Oct 24, 2018
Apr 23, 2018
Oct 29, 2018
Nov 1, 2018
Oct 24, 2018
Apr 4, 2018
Oct 29, 2018
Oct 30, 2018

Repository files navigation

dhSegment

Documentation Status

dhSegment is a tool for Historical Document Processing. Its generic approach allows to segment regions and extract content from different type of documents. See some examples here.

The complete description of the system can be found in the corresponding paper.

It was created by Benoit Seguin and Sofia Ares Oliveira at DHLAB, EPFL.

Installation and usage

The installation procedure and examples of usage can be found in the documentation (see section below).

Demo

Have a try at the demo to train (optional) and apply dhSegment in page extraction using the demo.py script.

Documentation

Under construction

The documentation is available on readthedocs.

If you are using this code for your research, you can cite the corresponding paper as :

@inproceedings{oliveiraseguinkaplan2018dhsegment,
  title={dhSegment: A generic deep-learning approach for document segmentation},
  author={Ares Oliveira, Sofia and Seguin, Benoit and Kaplan, Frederic},
  booktitle={Frontiers in Handwriting Recognition (ICFHR), 2018 16th International Conference on},
  pages={7--12},
  year={2018},
  organization={IEEE}
}

About

Generic framework for historical document processing

Resources

License

Stars

Watchers

Forks

Packages

No packages published

Languages

  • Python 95.4%
  • Jupyter Notebook 4.6%