Ba2han's picture
Update README.md
ac5604c verified
---
license: mit
datasets:
- Ba2han/Turkish-noisy
language:
- tr
base_model:
- vngrs-ai/VBART-Medium-Base
---
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/kXieKlHd03d5QWAhExCTB.png)
This model was trained on noisy and clean examples. The aim was to clean corrupted text (from OCR etc.) but the model hallucinates if the damage is too bad.
___________________
Bu model gürültülü ve temiz örnekler ile eğitildi. Amaç, (OCR kaynaklı vs.) bozuk yazıları tamir etmekti ama model hâlâ fazla hasarlı metinlerde halüsinasyon üretiyor.
Example of hallucination / halüsinasyon örneği:
![image/png](https://cdn-uploads.huggingface.co/production/uploads/6324eabf05bd8a54c6eb1650/Ju8mdnb-8hlZgCB5jGsDM.png)
Great for its' size but lacks a bit of contextual understanding.
Boyutuna göre iyi ama bağlamsal anlayış biraz eksik.