Commit a527b721 authored by Vít Starý Novotný's avatar Vít Starý Novotný
Browse files

Cite PERO OCR

parent 575258b8
Loading
Loading
Loading
Loading
Loading
+18 −3
Original line number Diff line number Diff line
@@ -37,9 +37,21 @@ Run `ahisto-ocr --help` from the command line for more information.

The development of the tool has been documented in the following two conference articles:

- [When Tesseract Does It Alone: Optical Character Recognition of Medieval Texts][8]
- [When Tesseract Brings Friends: Layout Analysis, Language Identification, and
   Super-Resolution for the Optical Character Recognition of Medieval Texts][3]
- [V Novotný: When Tesseract Does It Alone: Optical Character Recognition of
  Medieval Texts. RASLAN, 2020.][8]
- [V Novotný, K Seidlová, T Vrabcová, A Horák. When Tesseract Brings Friends:
  Layout Analysis, Language Identification, and Super-Resolution for the
  Optical Character Recognition of Medieval Texts. RASLAN, 2021.][3]

Since August 2022 (see commit 7e943d0), the tool uses PERO OCR from the Brno
University of Technology instead of Google Vision AI:

- [O Kodym, M Hradiš: Page Layout Analysis System for Unconstrained Historic
  Documents. ICDAR, 2021.][10]
- [M Kišš, K Beneš, M Hradiš: AT-ST: Self-Training Adaptation Strategy for OCR
  in Domains with Limited Transcriptions. ICDAR, 2021.][11]
- [J Kohút, M Hradiš: TS-Net: OCR Trained to Switch Between Text Transcription
  Styles. ICDAR, 2021.][12]

## Notes

@@ -56,3 +68,6 @@ from [the RASLAN 2021 article When Tesseract Brings Friends][3] is available in
 [7]: https://starfos.tacr.cz/en/project/TL03000365
 [8]: https://nlp.fi.muni.cz/raslan/2020/paper1.pdf
 [9]: https://docs.nvidia.com/datacenter/cloud-native/container-toolkit/install-guide.html#docker
 [10]: http://doi.org/10.1007/978-3-030-86331-9_32
 [11]: http://doi.org/10.1007/978-3-030-86337-1_31
 [12]: http://doi.org/10.1007/978-3-030-86337-1_32