v7.5.0

davidmezzetti released this 14 Oct 11:58

· 79 commits to master since this release

72c68c9

This release adds Speech to Speech RAG, new TTS models and Generative Audio features

See below for full details on the new features, improvements and bug fixes.

New Features

Add Speech to Speech example notebook (#789)
Add streaming speech generation (#784)
Add a microphone pipeline (#785)
Add an audio playback pipeline (#786)
Add Text to Audio pipeline (#792)
Add support for SpeechT5 ONNX exports with Text to Speech pipeline (#793)
Add audio signal processing and mixing methods (#795)
Add Generative Audio example notebook (#798)
Add example notebook covering open data access (#782)

Improvements

Issue with Language Specific Transcription Using txtai and Whisper (#593)
Update TextToSpeech pipeline to support speaker parameter (#787)
Update Text to Speech Generation Notebook (#790)
Update hf_hub_download methods to use cached_file (#794)
Require Python >= 3.9 (#796)
Upgrade pylint and black (#797)

Assets 2