关于FastSpeech2语音合成 #2141
-
用FastSpeech2合成音频时能顺便返回字级别音素边界有大佬帮忙帮忙提供一下思路吗 |
Beta Was this translation helpful? Give feedback.
Answered by
yt605155624
Jul 12, 2022
Replies: 1 comment 5 replies
-
参考 https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos/style_fs2 |
Beta Was this translation helpful? Give feedback.
5 replies
Answer selected by
lym0302
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
参考 https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/demos/style_fs2
里面调整速度就是获取了 phone 级别的 duration 再用 scale 调节 duration 实现的,那么你获取 phone 级别的 duration 后,稍微加点规则就能获得字级别的边界了(一般一个字是由一个声母+一个韵母,或者一个单韵母组成)