chrisjay
/

afrospeech-wav2vec-all-6

Audio Classification

afro-digits-speech

Inference Endpoints

Model card Files Files and versions Community

chrisjay commited on Oct 5, 2022

Commit

4c24ac7

•

1 Parent(s): 9f32578

updated README

Files changed (1) hide show

README.md +5 -9

README.md CHANGED Viewed

@@ -25,8 +25,7 @@ model-index:
 # afrospeech-wav2vec-all-6
-This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the [crowd-speech-africa](https://huggingface.co/datasets/chrisjay/crowd-speech-africa).
-It achieves the following results on the [validation set](VALID_all_interesred_6_audiodata.csv):
 - F1: 0.5787048581502744
 - Accuracy: 0.6205357142857143
@@ -35,21 +34,18 @@ The confusion matrix below helps to give a better look at the model's performanc
 ![confusion matrix](afrospeech-wav2vec-all-6_confusion_matrix_VALID.png)
-## Model description
-More information needed
-## Intended uses & limitations
-More information needed
-## Training and evaluation data
 - Size of training set: 1977
 - Size of validation set: 396
 ![digits-bar-plot-for-afrospeech](digits-bar-plot-for-afrospeech-wav2vec-all-6.png)
-## Training procedure
 ### Training hyperparameters

 # afrospeech-wav2vec-all-6
+This model is a fine-tuned version of [facebook/wav2vec2-base](https://huggingface.co/facebook/wav2vec2-base) on the [crowd-speech-africa](https://huggingface.co/datasets/chrisjay/crowd-speech-africa), which was a crowd-sourced dataset collected using the [afro-speech Space](https://huggingface.co/spaces/chrisjay/afro-speech). It achieves the following results on the [validation set](VALID_all_interesred_6_audiodata.csv):
 - F1: 0.5787048581502744
 - Accuracy: 0.6205357142857143
 ![confusion matrix](afrospeech-wav2vec-all-6_confusion_matrix_VALID.png)
+## Training and evaluation data
+The model was trained on a mixed audio data from 6 African languages - Igbo (`ibo`), Yoruba (`yor`), Rundi (`run`), Oshiwambo (`kua`), Shona (`sna`) and Oromo (`gax`).
 - Size of training set: 1977
 - Size of validation set: 396
+Below is a distribution of the dataset (training and valdation)
 ![digits-bar-plot-for-afrospeech](digits-bar-plot-for-afrospeech-wav2vec-all-6.png)
 ### Training hyperparameters