{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Noise Reduction\n", "\n", "Reduce background musics, noises and etc while maintain voice activities." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "
\n", " | Size (MB) | \n", "Quantized Size (MB) | \n", "SUM MAE | \n", "MAE_SPEAKER | \n", "MAE_NOISE | \n", "SDR | \n", "ISR | \n", "SAR | \n", "
---|---|---|---|---|---|---|---|---|
unet | \n", "78.9 | \n", "20.0 | \n", "0.862316 | \n", "0.460676 | \n", "0.40164 | \n", "9.173120 | \n", "13.92435 | \n", "13.20592 | \n", "
resnet-unet | \n", "96.4 | \n", "24.6 | \n", "0.825350 | \n", "0.438850 | \n", "0.38649 | \n", "9.454130 | \n", "13.96390 | \n", "13.60276 | \n", "
resnext-unet | \n", "75.4 | \n", "19.0 | \n", "0.811020 | \n", "0.447190 | \n", "0.36383 | \n", "8.992832 | \n", "13.49194 | \n", "13.13210 | \n", "