{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Voice Activity Detection" ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", " | Size (MB) | \n", "Quantized Size (MB) | \n", "Accuracy | \n", "
---|---|---|---|
vggvox-v1 | \n", "70.800 | \n", "17.700 | \n", "0.809844 | \n", "
vggvox-v2 | \n", "31.100 | \n", "7.920 | \n", "0.819688 | \n", "
speakernet | \n", "20.300 | \n", "5.180 | \n", "0.734062 | \n", "
marblenet-factor1 | \n", "0.526 | \n", "0.232 | \n", "0.849187 | \n", "
marblenet-factor3 | \n", "3.210 | \n", "0.934 | \n", "0.838556 | \n", "
marblenet-factor5 | \n", "8.380 | \n", "2.210 | \n", "0.843541 | \n", "