Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

LongVILA - compatibility with other LLMs #115

Open
orrzohar opened this issue Aug 20, 2024 · 1 comment
Open

LongVILA - compatibility with other LLMs #115

orrzohar opened this issue Aug 20, 2024 · 1 comment

Comments

@orrzohar
Copy link

Hi!
very impressed by your work with LongVILA. I would like to do long context with Qwen/LLaMA3.1, but currently, i only see support for Mixtral.

Any chance you plan on releaseing long context support for these models soon?

Best,
Orr

@orrzohar
Copy link
Author

Looking at the huggingface release -- it appears that the base LLM is LLaMA3.1?
If yes -- could you please advise how this is implemented? I want to use this on my own model for long-context support, is this some attention implementation add-on?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant