ASR

Whisper trained on Prosodic&Phonemic transcription on JSUT5000 & VOICEVOX generated dataset.