v7.5.0
This release adds Speech to Speech RAG, new TTS models and Generative Audio features
See below for full details on the new features, improvements and bug fixes.
New Features
- Add Speech to Speech example notebook (#789)
- Add streaming speech generation (#784)
- Add a microphone pipeline (#785)
- Add an audio playback pipeline (#786)
- Add Text to Audio pipeline (#792)
- Add support for SpeechT5 ONNX exports with Text to Speech pipeline (#793)
- Add audio signal processing and mixing methods (#795)
- Add Generative Audio example notebook (#798)
- Add example notebook covering open data access (#782)
Improvements
- Issue with Language Specific Transcription Using txtai and Whisper (#593)
- Update TextToSpeech pipeline to support speaker parameter (#787)
- Update Text to Speech Generation Notebook (#790)
- Update hf_hub_download methods to use cached_file (#794)
- Require Python >= 3.9 (#796)
- Upgrade pylint and black (#797)