Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

credit #22

Open
rlzijdeman opened this issue Apr 9, 2024 · 4 comments
Open

credit #22

rlzijdeman opened this issue Apr 9, 2024 · 4 comments

Comments

@rlzijdeman
Copy link

Could you please add in the readme a section crediting the creators and grants that made Loghi possible?

@carschno
Copy link

carschno commented Jul 1, 2024

Perhaps related: how to cite Loghi? Maybe add a citation file?

@rvankoert
Copy link
Collaborator

We only have a paper that is not published yet, but will be in august/september. I'll try to add citation info. For now you should be able to use the following in bibtex format:

@InProceedings{loghi,
author={van Koert, Rutger
and Klut, Stefan
and Maas, Martijn
and Koornstra, Tim
and Peters, Luke},
title={Loghi: an end-to-end framework for making historical documents machine readable},
year={2024},
publisher={Springer Nature},
abstract={Loghi is a novel framework and suite of tools for the layout analysis and text recognition of historical documents. Scans are processed in a modular pipeline, with the option to use alternative tools in most stages. Layout analysis and text recognition can be trained on example images with PageXML ground truth. The framework is intended to convert scanned documents to machine-readable PageXML. Additional tooling is provided for the creation of synthetic ground truth. A visualiser for troubleshooting the text recognition training is also made available. The result is a framework for end-to-end text recognition, which works from initial layout analysis on the scanned documents, and includes text line detection, text recognition, reading order detection and language detection. The Loghi pipeline has been used successfully in several projects. We achieve good results on the layout analysis and text recognition of both the handwritten and printed archives of the Dutch States General on resolutions spanning the 17th and 18th century. The CER on handwritten 17th century material is below 3 percent. Loghi is open source and free to use.},
numpages = {16},
keywords = {handwritten text recognition, layout analysis, pagexml},
location = {Athens, Greece},
series = {ARPC '24}
}

@carschno
Copy link

carschno commented Jul 8, 2024

Perfect, thanks!
I suggest to add that information in a citation file: #27

@stefanklut
Copy link

The citation file has been added here
As well as a bibtex entry in the README

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants