{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Speaker Vector" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", " | Size (MB) | \n", "Quantized Size (MB) | \n", "Embedding Size | \n", "EER | \n", "
---|---|---|---|---|
deep-speaker | \n", "96.7 | \n", "24.40 | \n", "512.0 | \n", "0.21870 | \n", "
vggvox-v1 | \n", "70.8 | \n", "17.70 | \n", "1024.0 | \n", "0.14070 | \n", "
vggvox-v2 | \n", "43.2 | \n", "7.92 | \n", "512.0 | \n", "0.04450 | \n", "
speakernet | \n", "35.0 | \n", "8.88 | \n", "7205.0 | \n", "0.02122 | \n", "