| --- |
| license: mit |
| language: |
| - de |
| metrics: |
| - cer |
| library_name: transformers |
| tags: |
| - kurrent |
| - ocr |
| - htr |
| - 19th century |
| --- |
| # TrOCR Kurrent-Model 19th century |
| Base model: **microsoft/trocr-base-handwritten** |
|
|
| Train Lines: 292'997 |
| Eval Lines: 7'513 |
| Test Lines: 15'817 |
|
|
| Epochs: 19.66 / 20 |
| Eval CER: 0.02827 |
| Test CER: 0.02655 |
|
|
| Finetuned on Kurrent-dataset, containing: |
| - Material from the State Archives of Zurich ("Regierungsratsprotokolle"), provided by the State Archives of Zurich |
| - Lecture notes of Humboldt Lectures, provided by the Berlin-Brandenburgian Academy of Sciences |
| - Diary of Eugen Huber, provided by the University of Zurich |
| - Handwritting and Copies by and of Gottfried Semper |
| - Konzilsprotokolle, University of Greifswald (19th century) |
| - as well as many other smaller collections/examples |
|
|
| The model has not been extensively tested. |
| Potential biases are still to be identified. |