We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
我看文档里只写支持到qwen1.5,但是issue里不少人有用在qwen2上?
The text was updated successfully, but these errors were encountered:
我想在qwen2上用序列并行训长文本
Sorry, something went wrong.
Sequence parallel needs transformers <4.43. Same issue in #935
训了一版,不过loss看着不太正常,性能也没提升
No branches or pull requests
我看文档里只写支持到qwen1.5,但是issue里不少人有用在qwen2上?
The text was updated successfully, but these errors were encountered: