Skip to content

v7.5.0

Compare
Choose a tag to compare
@davidmezzetti davidmezzetti released this 14 Oct 11:58
· 79 commits to master since this release

This release adds Speech to Speech RAG, new TTS models and Generative Audio features

See below for full details on the new features, improvements and bug fixes.

New Features

  • Add Speech to Speech example notebook (#789)
  • Add streaming speech generation (#784)
  • Add a microphone pipeline (#785)
  • Add an audio playback pipeline (#786)
  • Add Text to Audio pipeline (#792)
  • Add support for SpeechT5 ONNX exports with Text to Speech pipeline (#793)
  • Add audio signal processing and mixing methods (#795)
  • Add Generative Audio example notebook (#798)
  • Add example notebook covering open data access (#782)

Improvements

  • Issue with Language Specific Transcription Using txtai and Whisper (#593)
  • Update TextToSpeech pipeline to support speaker parameter (#787)
  • Update Text to Speech Generation Notebook (#790)
  • Update hf_hub_download methods to use cached_file (#794)
  • Require Python >= 3.9 (#796)
  • Upgrade pylint and black (#797)