ASR

Whisper trained on Prosodic&Phonemic transcription on JSUT5000 & VOICEVOX generated dataset.

audio_path

output

·

Built with Gradio logo

·