We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
包含自定义词表,以及自己实现的tokenize,detokenize。 pretrain_pipeline.py是流式输入数据。 各个程序直接使用Python运行即可,具体配置到代码里调整。