File size: 1,244 Bytes
1de99ec
 
 
 
 
 
 
 
 
 
 
 
 
 
 
19a60e5
 
 
 
 
1de99ec
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
dbec26a
1de99ec
e621559
 
1de99ec
 
 
c0f9bdb
 
 
 
7e8e1bf
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
---
library_name: PyLaia
license: mit
tags:
- PyLaia
- PyTorch
- atr
- htr
- ocr
- historical
- handwritten
metrics:
- CER
- WER
language:
- fr
- la
- it
- oc
- es
datasets:
- CATMuS/medieval
pipeline_tag: image-to-text
---

# PyLaia - CATMuS/medieval

This model performs Handwritten Text Recognition in Latin/Romance on historical documents.

## Model description

The model was trained using the PyLaia library on the [CATMuS/medieval](https://huggingface.co/datasets/CATMuS/medieval) dataset.

Training images were resized with a fixed height of {dimension} pixels, keeping the original aspect ratio. Vertical lines are discarded.

| set | lines | 
| :----- | ------: | 
| train | 15,2816 |
| val   |  19,402 |
| test  |  22,590 |

An external 6-gram character language model can be used to improve recognition. The language model is trained on the text from the CATMuS/medieval training set.

## Training Plot


![image/png](https://cdn-uploads.huggingface.co/production/uploads/6616d277007cf0aeb2600b77/6bJ-xbwuLHc2cU3m38YZf.png)

## How to use?

Please refer to the [PyLaia documentation](https://atr.pages.teklia.com/pylaia/usage/prediction/) to use this model.

## Demo

https://huggingface.co/spaces/johnlockejrr/yolov11_pylaia_catmus