carlosdanielhernandezmena's picture
Adding BibTex Citation
49e4a0d
|
raw
history blame
1.89 kB
---
license: cc-by-4.0
language:
- is
datasets:
- language-and-voice-lab/samromur_milljon
tags:
- audio
- automatic-speech-recognition
- icelandic
- whisper
- whisper-large
- iceland
- reykjavik
- samromur
- faster-whisper
---
# whisper-large-icelandic-62640-steps-967h-ct2
This is a faster-whisper version of [language-and-voice-lab/whisper-large-icelandic-62640-steps-967h](https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-62640-steps-967h).
The model was created like described in [faster-whisper](https://github.com/guillaumekln/faster-whisper/tree/master):
```bash
ct2-transformers-converter --model language-and-voice-lab/whisper-large-icelandic-62640-steps-967h \
--output_dir whisper-large-icelandic-62640-steps-967h-ct2 \
--quantization float16
```
# Usage
```python
from faster_whisper import WhisperModel
model_size = "whisper-large-icelandic-62640-steps-967h-ct2"
# Run on GPU with FP16
model = WhisperModel(model_size, device="cuda", compute_type="float16")
# or run on GPU with INT8
# model = WhisperModel(model_size, device="cuda", compute_type="int8_float16")
# or run on CPU with INT8
# model = WhisperModel(model_size, device="cpu", compute_type="int8")
segments, info = model.transcribe("audio.mp3", beam_size=5)
print("Detected language '%s' with probability %f" % (info.language, info.language_probability))
for segment in segments:
print("[%.2fs -> %.2fs] %s" % (segment.start, segment.end, segment.text))
```
# BibTeX entry and citation info
*When publishing results based on these models please refer to:*
```bibtex
@misc{mena2023whisperlarge62640icelandicct2,
title={Acoustic Model in Icelandic: whisper-large-icelandic-62640-steps-967h-ct2.},
author={Hernandez Mena, Carlos Daniel},
url={https://huggingface.co/language-and-voice-lab/whisper-large-icelandic-62640-steps-967h-ct2},
year={2023}
}
```