🤓 ASR 如何使用自己的语料在预训练模型上 finetune #1972

Jackwaterveg · 2022-05-26T14:13:32Z

Jackwaterveg
May 26, 2022
Collaborator

先说一下 example 中数据处理的步骤：

如果你已经可以生成自己的 manifest 文件了，但是自己构建的 manifest 文件使用的词表长度和预训练模型的词表长度不一致，而你还希望用完整的预训练模型。那么，你可以在生成 manifest 文件的时候使用预训练的词表。也就是说，处理数据的时候，使用如下脚本：

 bash local/data.sh --stage -1 --stop_stage -1    # 生成你使用数据的 manifest 文件
 bash local/data.sh --stage 0 --stop_stage 0      # 生成你使用数据的 mean_std
 bash local/data.sh --stage 2 --stop_stage 2      # 跳过构建你的 vocab.txt, 使用预训练模型的词表生成正式的 manifest 文件

J-ZZ · 2022-08-21T03:37:49Z

J-ZZ
Aug 21, 2022

请问预训练模型是怎么加载的呢是否可以选择性的加载其他的预训练模型

3 replies

zh794390558 Aug 22, 2022
Maintainer

可以，模型结构一致即可。

Jackwaterveg Aug 24, 2022
Collaborator Author

加载模型的代码位置：

PaddleSpeech/paddlespeech/s2t/utils/checkpoint.py

Line 115 in 5a58a27

model_dict = paddle.load(params_path)

如果你要加载模型继续训练，你可以先自己从头训练一段时间，产生了checkpoint 和相应的index 文件，然后你可以修改 index文件，或者把预训练模型改名为产生的checkpoint，再继续训练。

JoyceMind Oct 5, 2022

为什么会有一个原始的和正式的manifest？有.raw和没有的有什么区别？

yaleimeng · 2023-03-28T03:09:11Z

yaleimeng
Mar 28, 2023

哪个是index文件？就是1、2、3.json 这种吗？这种json文件里面就几个字段，需要改什么？
预训练模型只有*.pdprams，没有*.pdopt文件，只把pdparams改名放进去，没删除*.pdopt继续微调效果不好啊
有人微调成功吗？效果比预训练模型有提升吗？能不能贴一下微调前后CER成绩对比？

2 replies

lemondy Apr 7, 2023

朋友，你微调成功了吗？知道怎么改index吗？

Chuyaoyuan Jun 29, 2023

同问，你微调怎么样了？成功了吗，可以分享下经验吗，多谢

makeukus · 2023-04-06T02:01:56Z

makeukus
Apr 6, 2023

假如我有很多数据，如何才能不使用预训练模型，而完全基于自己的数据训练新的模型。

0 replies

yaleimeng · 2023-04-07T01:23:35Z

yaleimeng
Apr 7, 2023

@makeukus 默认情况就是完全基于自己的数据训练的。按照数据集目录->模型结构去找示例就行了。
使用预训练微调要麻烦一点。

4 replies

makeukus Apr 7, 2023

你有示例代码吗朋友关于【全新模型训练】，我跑了这个项目，https://aistudio.baidu.com/aistudio/projectdetail/5003396。但这个项目就是基于小样本预训练模型合成，

yaleimeng Apr 7, 2023

@makeukus 你发的链接是TTS小样本微调。。跟ASR完全是两码事。
在 https://github.com/PaddlePaddle/PaddleSpeech/tree/develop/examples 里面，有不同类型数据集处理的案例脚本。一般入门以aishell作为参考。把坑踩完，你就成功了。

makeukus Apr 7, 2023

好的，谢谢你，我look一下。

prcvoldermort Aug 7, 2023

大佬，能给个详细些的完全基于自己的数据训练ASR模型的文档或教程吗？

lemondy · 2023-04-07T05:34:50Z

lemondy
Apr 7, 2023

如果我用tts 文字转成语音，然后构造自己语音片段和文字对应关系数据集，不知这种tts生成的音频数据是否可以做微调？声音的音色音调这些会对模型效果有影响吗？

1 reply

yaleimeng Apr 7, 2023

公开的真人语音数据集都用不完，何必要用TTS生成的语音呢？音色音调对ASR影响比较小，但如果你连实际语音数据集都没有，最终的模型也别抱什么期望了

khushbookk · 2023-04-11T07:49:00Z

khushbookk
Apr 11, 2023

I want to fine tune an asr model trained on librispeech with my own dataset specific to a domain, since it's said to use the vocab of the existing model to keep it consistent, but I want to append my vocabulary to the existing one, how do I do it?

0 replies

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

🤓 ASR 如何使用自己的语料在预训练模型上 finetune #1972

{{title}}

Replies: 6 comments 10 replies

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

{{title}}

Select a reply

🤓 ASR 如何使用自己的语料在预训练模型上 finetune #1972

Jackwaterveg May 26, 2022 Collaborator

Replies: 6 comments · 10 replies

zh794390558 Aug 22, 2022 Maintainer

Jackwaterveg Aug 24, 2022 Collaborator Author

Jackwaterveg
May 26, 2022
Collaborator

Replies: 6 comments 10 replies

zh794390558 Aug 22, 2022
Maintainer

Jackwaterveg Aug 24, 2022
Collaborator Author