carlosdanielhernandezmena commited on
Commit
6e8da3e
1 Parent(s): 1d4211f

Specifying that Samrómur Milljón was used to train this model.

Browse files
Files changed (1) hide show
  1. README.md +1 -1
README.md CHANGED
@@ -125,7 +125,7 @@ model-index:
125
 
126
  The "whisper-large-icelandic-62640-steps-967h" is an acoustic model suitable for Automatic Speech Recognition in Icelandic. It is the result of fine-tuning the model [openai/whisper-large](https://huggingface.co/openai/whisper-large) for 62,640 steps with 967 hours of Icelandic data collected by the [Language and Voice Laboratory](https://huggingface.co/language-and-voice-lab) through the platform [Samrómur](https://samromur.is/).
127
 
128
- The specific data that was used to fine-tune the model is the result of the automatic verification of 1 million of recordings comming from the corpus ["Samromur Unverified 22.07"](http://hdl.handle.net/20.500.12537/265). It has to be pointed out that this model was trained with different data than our previous model [whisper-large-icelandic-30k-steps-1000h](https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h).
129
 
130
  The fine-tuning process was performed during June (2023) in the servers of the Language and Voice Laboratory (https://lvl.ru.is/) at Reykjavík University (Iceland) by [Carlos Daniel Hernández Mena](https://huggingface.co/carlosdanielhernandezmena).
131
 
 
125
 
126
  The "whisper-large-icelandic-62640-steps-967h" is an acoustic model suitable for Automatic Speech Recognition in Icelandic. It is the result of fine-tuning the model [openai/whisper-large](https://huggingface.co/openai/whisper-large) for 62,640 steps with 967 hours of Icelandic data collected by the [Language and Voice Laboratory](https://huggingface.co/language-and-voice-lab) through the platform [Samrómur](https://samromur.is/).
127
 
128
+ The specific data that was used to fine-tune the model is the corpus [Samrómur Milljón](https://huggingface.co/datasets/language-and-voice-lab/samromur_milljon), which is the result of the automatic verification of 1 million of recordings comming from the corpus ["Samromur Unverified 22.07"](http://hdl.handle.net/20.500.12537/265). It has to be pointed out that this model was trained with different data than our previous model [whisper-large-icelandic-30k-steps-1000h](https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-30k-steps-1000h).
129
 
130
  The fine-tuning process was performed during June (2023) in the servers of the Language and Voice Laboratory (https://lvl.ru.is/) at Reykjavík University (Iceland) by [Carlos Daniel Hernández Mena](https://huggingface.co/carlosdanielhernandezmena).
131