File size: 840 Bytes
0b786d8
 
 
9e8ac57
 
 
 
 
 
 
 
0b786d8
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
---
# For reference on model card metadata, see the spec: https://github.com/huggingface/hub-docs/blob/main/modelcard.md?plain=1
# Doc / guide: https://huggingface.co/docs/hub/model-cards
language: fo
tag: text2text-generation
pipeline_tag: text2text-generation
widget:
- text: "l/ú veit eg tað várar í P'oroyum"
inference:
  parameters:
    max_length: 512
---

# Model Card for Model ID

<!-- Provide a quick summary of what the model is/does. -->
OCR post processing for Faroese.

## Model Details
This model is finetuned using a ByT5 model (base) trained on Icelandic OCR post-processing data: https://huggingface.co/atlijas/byt5-is-ocr-post-processing-modern-texts
The Faroese training data was created by extracting authentic errors from OCR-ed Faroese texts and applied to a corpus of Faroese, along with random character noise.