{ "cells": [ { "cell_type": "markdown", "metadata": {}, "source": [ "# Youtube ASR + Diarization\n", "\n", "Let say you want to transcribe long audio from youtube and detect speakers using TorchAudio, malaya-speech able to do that." ] }, { "cell_type": "markdown", "metadata": {}, "source": [ "