Hi, thanks for you code，but I got very low accuray when used the lora as fllow. #16

LiZhangMing · 2024-07-16T09:13:04Z

for GPU is 3090. the profile is： CUDA_VISIBLE_DEVICES=0,1,2,3,4,5,6,7 python finetune.py
--base_model 'yahma/llama-7b-hf'
--data_path 'dataset/openbookqa/train.json'
--output_dir ./finetuned_result/dora_r32_epoch_test1
--batch_size 16 --micro_batch_size 16 --num_epochs 3 --scaling 4.0
--learning_rate 2e-4 --cutoff_len 256 --val_set_size 120 --bottleneck_size 32
--eval_step 80 --save_step 80 --adapter_name lora
--target_modules '["q_proj", "k_proj", "v_proj", "up_proj", "down_proj"]'
--lora_r 16 --lora_alpha 32 --use_gradient_checkpointing

**then the result of OBQA is  0.08333333333333333.**

The text was updated successfully, but these errors were encountered:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Hi, thanks for you code，but I got very low accuray when used the lora as fllow. #16

Hi, thanks for you code，but I got very low accuray when used the lora as fllow. #16

LiZhangMing commented Jul 16, 2024 •

edited

Loading

Hi, thanks for you code，but I got very low accuray when used the lora as fllow. #16

Hi, thanks for you code，but I got very low accuray when used the lora as fllow. #16

Comments

LiZhangMing commented Jul 16, 2024 • edited Loading

LiZhangMing commented Jul 16, 2024 •

edited

Loading