{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Speech-to-Text Seq2Seq Whisper" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "Finetuned hyperlocal languages on pretrained HuggingFace models, https://huggingface.co/mesolitica" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", " | Size (MB) | \n", "malay-malaya | \n", "malay-fleur102 | \n", "singlish | \n", "Language | \n", "
---|---|---|---|---|---|
mesolitica/finetune-whisper-tiny-ms-singlish | \n", "151 | \n", "{'WER': 0.20141585, 'CER': 0.071964908} | \n", "{'WER': 0.235680975, 'CER': 0.0986880877} | \n", "{'WER': 0.09045121, 'CER': 0.0481965} | \n", "[malay, singlish] | \n", "
mesolitica/finetune-whisper-tiny-ms-singlish-v2 | \n", "151 | \n", "{'WER': 0.20141585, 'CER': 0.071964908} | \n", "{'WER': 0.22459602, 'CER': 0.089406469} | \n", "{'WER': 0.138882971, 'CER': 0.074929807} | \n", "[malay, singlish] | \n", "
mesolitica/finetune-whisper-base-ms-singlish-v2 | \n", "290 | \n", "{'WER': 0.172632664, 'CER': 0.0680027682} | \n", "{'WER': 0.1837319118, 'CER': 0.0599804251} | \n", "{'WER': 0.111506313, 'CER': 0.05852830724} | \n", "[malay, singlish] | \n", "
mesolitica/finetune-whisper-small-ms-singlish-v2 | \n", "967 | \n", "{'WER': 0.13189875561, 'CER': 0.0434602169} | \n", "{'WER': 0.13277694, 'CER': 0.0478108612} | \n", "{'WER': 0.09489335668, 'CER': 0.05045327551} | \n", "[malay, singlish] | \n", "