Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[Common Issue] Check here for some common issues you might encounter #130

Open
kcz358 opened this issue Aug 9, 2024 · 1 comment
Open
Labels
documentation Improvements or additions to documentation question Further information is requested

Comments

@kcz358
Copy link
Collaborator

kcz358 commented Aug 9, 2024

Common Issue

More Questions will be added......

Training Related

Q : Can not finetune the existing LLaVA-Onevision checkpoints
A : We edit our model's config so that it is able to be served on sglang. To finetune on existing LLaVA-Onevision checkpoints, you might first need to download all the weight and change the model_type in the config.json from llava to qwen2

Q : Can not download LLaVA-NeXT data or LLaVA-Onevision Data?
A : Sometimes we found out that the version of the dataset can cause issue. You can try update your dataset before performing the load_dataset.

Q : Data is not complete?
A : Yes, LLaVA-NeXT data is not complete as we can not release the 15k user data. For other data such as video or multi-images, it is splited into different dataset repo on huggingface. For example, lmms-lab/M4-Instruct-Data, lmms-lab/LLaVA-ReCap-CC3M, lmms-lab/ShareGPTVideo, lmms-lab/LLaVA-ReCap-558K, lmms-lab/LLaVA-ReCap-118K

@kcz358 kcz358 added documentation Improvements or additions to documentation question Further information is requested labels Aug 9, 2024
@kcz358 kcz358 pinned this issue Aug 9, 2024
@Luodian
Copy link
Contributor

Luodian commented Sep 1, 2024

Q: About video data?
A: It's to be released in @ZhangYuanhan-AI next version of a more powerful video model.
Currently we released the data yaml used in onevision stage at onevision.yaml.

You can checkout the three subsets video data, (1) sharegpt4video_255000.json (checkout sharegpt4video) (2) 0718_0_30_s_academic_mc_v0_1_all.json (to be released) (3) academic_source_30s_v1_all.json (to be released).

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
documentation Improvements or additions to documentation question Further information is requested
Projects
None yet
Development

No branches or pull requests

2 participants