WhisperTemple Synthetic ASR Dataset Generator #897

gongouveia · 2024-07-03T10:53:42Z

gongouveia
Jul 3, 2024

WhisperTemple is a user-friendly GUI designed for capturing audio and transcribing it in real-time or in batch mode, while also managing the resulting dataset. It offers an intuitive interface for setting audio parameters, transcription options, and managing datasets. Additionally, it simplifies exporting the generated and edited dataset to HuggingFace and retraining the Whisper model on custom data.

I developed this project in my spare time and would greatly appreciate any feedback or contributions. Please consider starring the repository to stay updated with future enhancements.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

WhisperTemple Synthetic ASR Dataset Generator #897

{{title}}

Replies: 0 comments

Select a reply

WhisperTemple Synthetic ASR Dataset Generator #897

gongouveia Jul 3, 2024

Replies: 0 comments

gongouveia
Jul 3, 2024