WhisperTemple Synthetic ASR Dataset Generator #897
gongouveia
started this conversation in
Show and tell
Replies: 0 comments
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
WhisperTemple is a user-friendly GUI designed for capturing audio and transcribing it in real-time or in batch mode, while also managing the resulting dataset. It offers an intuitive interface for setting audio parameters, transcription options, and managing datasets. Additionally, it simplifies exporting the generated and edited dataset to HuggingFace and retraining the Whisper model on custom data.
I developed this project in my spare time and would greatly appreciate any feedback or contributions. Please consider starring the repository to stay updated with future enhancements.
Beta Was this translation helpful? Give feedback.
All reactions