Changes
This update focuses on training again. It adds several preprocessing options that improve the quality of the dataset, and thus trained models, especially for languages other than English (which generally suffer from more artifacts).
In order to make use of it, please install Pandrator again (or download the biggest package again).
The new options for processing source audio include:
- trimming end silence,
- removal of breath sounds,
- fade-in and -out effect,
- discarding segments that still end abruptly, even after all preprocessing, to avoid "clicks" at the end of generated sentences.
If your source audio is professional (studio quality), don't use any preprocessing options except for trimming, fade and perhaps abrupt cut-off detection.
Self-contained packages
I've prepared packages (archives) that you can simply unpack - everything is preinstalled in its own portable conda environment. You can download them from here.
You can use the launcher to start Pandrator, update it and install new features.
Package | Contents | Unpacked Size |
---|---|---|
1 | Pandrator and Silero | 4GB |
2 | Pandrator and XTTS (CPU only) | 7GB |
3 | Pandrator and XTTS (Nvidia GPU Support) | 14GB |
4 | Pandrator, XTTS, RVC, WhisperX (for dubbing and training) and XTTS fine-tuning | 36GB |
Installer
You may use the installer/launcher below, which was created from the pandrator_installer_launcher.py
file in the repository, or use the source file directly. Please remember to run the executable as an administrator. It's possible that Windows or your antivirus software will flag it as a threat. You may whitelist it, or, if you're not comfortable doing that, review the code in the repository and install Pandrator manually.