{ "cells": [ { "cell_type": "markdown", "id": "adapted-channel", "metadata": {}, "source": [ "# Speech Split PySPTK" ] }, { "cell_type": "markdown", "id": "accessory-relief", "metadata": {}, "source": [ "detailed speaking style conversion by disentangling speech into content, timbre, rhythm and pitch using PySPTK." ] }, { "cell_type": "markdown", "id": "incoming-willow", "metadata": {}, "source": [ "
\n", " | Size (MB) | \n", "Quantized Size (MB) | \n", "
---|---|---|
fastspeechsplit-vggvox-v2 | \n", "232.0 | \n", "59.2 | \n", "
fastspeechsplit-v2-vggvox-v2 | \n", "105.0 | \n", "411.0 | \n", "