[Feature] Support LLaVA #196

LZHgrla · 2023-11-02T06:18:51Z

Docs: https://github.com/LZHgrla/xtuner/blob/lzh/llava/xtuner/configs/llava/README.md

Features

Support fine-tuning LLaVA model, with LoRA/QLoRA/Full/Freeze LLM and LoRA/Full/Freeze visual encoder
Chat with the fine-tuned LLaVA model
Evaluate the fine-tuned LLaVA model with MMBench
Involve the LLaVA official pretrain, fine-tune datasets

TODO

* v1 * add load_image * update cfg image url * del fig * update * temp * update convert * update chat_mm * add exclude_frozen_parameters for deepspeed * update chat * update xtuner help msg * fix bugs * revert bf16 deepspeed * fix bugs * add visual_select_layer for chat * improve pth_to_hf * rename projecter_pth to pretrained_pth * temp * update requirements * add cfgs * update * fix pre-commit * optim chat * optim chat * Delete xtuner/model/unused.py * move dispatch to a deeper folder * add projector * update * del model/projector * fix bugs * add docs * update * update * update * update * enhance resume for map_fn * update import * add llava_internlm_chat_7b_clip_vit_large_p14 * update dispatch * update dispatch * add link * update max_length * update max_length * update hyp * align * move yi flash attn * fix pre-commit * update deepspeed requirements * add mmbench script * install openpyxl * add entry_point for mmbench * save args * update mmbench * update max_length * add llama2 qlora * update mmbench * fix mmbench bugs * use osp instead of os.path * refactor pth_to_hf * update chat and mmbench to support --llava * align to chat * update entry_point * add vicuna template * add vicuna_7b_v15 * fix pre-commit * add vicuna_7b_v1.5 qlora * skip_special_tokens for decode text * remove do_sample * add warmup * fix pre-commit * Update dataset_prepare.md * Update dataset_prepare.md * Add KEEP_STSTEM for template * remove * fix vicuna template * clean cfgs * add cfgs * fix pre-commit * add --language for mmbench * fix bugs * fix pretrain bug * support visual_encoder lora * fix bugs * add paramwise_cfg * remove print_peft_model_trainable_parameters * fix bugs * add paramwise_cfg for DeepSpeedOptimWrapper * fix engine deepspeed paramwise_cfg bug * fix encode_fn bug * fix * fix pad_image_to_square bugs * Add space for system to avoid mismatch of 'USER' token * revert to adding bos_token at each conv * revert for paramwise_cfg * better cfgs? * fix import bug * fix import bug * pretrain align * update prepare_inputs_labels_for_multimodal * 1792 * support length_grouped_samplers * 1792 * remove KEEP_SYSTEM * remove system in cfg * update 336 cfg * add torch_dtype for mmbench and chat * group 50 * quant for pretrain * update cfgs * refactor cfgs * add length for concat dataset * update requirements * fix typo * add template for internlm pretrain * no zh * remove 20b cfgs * fix pre-commit * revert invalid input * rename * Update README.md * Update README_zh-CN.md * fix pre-commit * remove llava_zh from docs * qlora 512 * rename llava map_fn * update cfgs * update model urls * add docs link * add llava docs * update docs * update urls * add citation * fix README * move * update * vicuna pretrain with prompt * rename * add results * fix pre-commit * update * update * update * update * update * update * update * update * update * update * update * update * Update README.md * Update README_zh-CN.md * Update README_zh.md * Update README_zh.md * Update README.md * Update README_zh.md * Update README.md * Update README.md * fix typo * fix * Update README.md * Update README_zh-CN.md * rename * auto cn_string * fix pre-commit * rename * remove language * add VLMEvalKit * rename VLLM to VLM * add the download links of MMBench * update * update readme * update * update * update merge * fix cfg bug * Update README.md * Update README_zh.md * update * fix * update requirements * Update runtime.txt * Update runtime.txt * Update runtime.txt * Update README.md * Update README.md * Update README_zh.md * fix pre-commit * fix * update mmbench prompt * fix bugs * fix bugs * update docs * update * update * Update README.md

LZHgrla added 10 commits November 1, 2023 10:10

v1

6de3469

add load_image

70971d9

update cfg image url

9405944

del fig

c946bd5

Merge branch 'main' into lzh/llava

5b76a56

update

551bb74

temp

70fa7d9

update convert

5013d3d

update chat_mm

39f2fb3

add exclude_frozen_parameters for deepspeed

a65a1ae

LZHgrla marked this pull request as draft November 2, 2023 06:19

LZHgrla and others added 7 commits November 2, 2023 14:48

update chat

5dd244a

update xtuner help msg

669f282

fix bugs

b0f9ad0

revert bf16 deepspeed

dea64fb

Merge branch 'InternLM:main' into lzh/llava

c31ab61

fix bugs

1f4a97b

add visual_select_layer for chat

6ceeaa8

This was linked to issues Nov 2, 2023

will to add qwen-vl ? Thanks #182

Closed

will add multimodal model like minigpt,llava,mplug-owl ？ #173

Closed

LZHgrla added 10 commits November 3, 2023 01:08

improve pth_to_hf

7502793

Merge branch 'main' into lzh/llava

6f31402

rename projecter_pth to pretrained_pth

9268dbc

temp

5282b3c

update requirements

3e4b425

add cfgs

fe30549

update

413111f

fix pre-commit

fac6cf8

optim chat

da3f268

optim chat

b25913b

LZHgrla and others added 27 commits December 21, 2023 14:49

update

360b816

update

4ade82d

update merge

885c832

fix cfg bug

990d689

Update README.md

0e5d692

Update README_zh.md

8225f9f

update

648111d

fix

6f06498

Merge branch 'main' into lzh/llava

76d1313

update requirements

b5124b1

Merge branch 'main' into lzh/llava

5973d6c

Update runtime.txt

311f9d0

Update runtime.txt

cbb7924

Update runtime.txt

d9a96af

Update README.md

8332c2c

Update README.md

b9efc8a

Update README_zh.md

7b29f81

fix pre-commit

6c80ec7

fix

034b4cb

update mmbench prompt

f7ec4da

fix bugs

7231865

fix bugs

bf384de

update docs

80d7c11

update

7e68e1d

update

327a122

Merge branch 'main' into lzh/llava

15c927a

Update README.md

761a7ea

pppppM approved these changes Dec 26, 2023

View reviewed changes

pppppM merged commit 6b962e6 into InternLM:main Dec 26, 2023
1 check passed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Feature] Support LLaVA #196

[Feature] Support LLaVA #196

LZHgrla commented Nov 2, 2023 •

edited

Loading

[Feature] Support LLaVA #196

[Feature] Support LLaVA #196

Conversation

LZHgrla commented Nov 2, 2023 • edited Loading

LZHgrla commented Nov 2, 2023 •

edited

Loading