did anyone reproduce the transformer network with frozen GPT-2？ #70

baiyuting · 2023-07-12T08:48:52Z

did anyone reproduce the transformer network with frozen GPT-2？

I enter command
python train.py --only_prefix --data ./data/coco/oscar_split_ViT-B_32_train.pkl --out_dir ./coco_train/ --mapping_type transformer --num_layers 8 --prefix_length 40 --prefix_length_clip 40

model is trained on mscoco dataset (train+val), the result on test split is

blue4 is 20.0 and cider is 66.3 , I got the best result in third epoch, but this result is less than what the paper gives, blue4 is 33.53 and cider is 113.08

I am confused about the result, did anyone reproduce the result? Did I miss something?

rongtongxueya · 2023-07-14T11:33:36Z

I want to know how the results of this evaluation are displayed, how can I not run train.py

baiyuting · 2023-07-16T03:36:09Z

I use https://github.com/salaniz/pycocoevalcap to evaluate result, I rewrite the captions_val2014_fakecap_results.json in folder "example" and enter command "python coco_eval_example.py"

cjc20000323 · 2023-09-25T12:51:47Z

I want to know have you reproduced the results of transformer reported in the paper

baiyuting · 2023-09-25T13:31:43Z

no, I have not reproduced this result

cjc20000323 · 2023-09-25T13:32:05Z

您好，我已收到您的邮件，请您知悉。

cjc20000323 · 2024-04-10T02:30:11Z

您好，我已收到您的邮件，请您知悉。

qvqqa · 2024-04-12T03:06:44Z

I also trained the only-transformer model and evaluate as you say, and the result is similar to yours. It's not as good as the result in this paper. Have you solved it?

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

did anyone reproduce the transformer network with frozen GPT-2？ #70

did anyone reproduce the transformer network with frozen GPT-2？ #70

baiyuting commented Jul 12, 2023

rongtongxueya commented Jul 14, 2023

baiyuting commented Jul 16, 2023

cjc20000323 commented Sep 25, 2023

baiyuting commented Sep 25, 2023

cjc20000323 commented Sep 25, 2023 via email

cjc20000323 commented Apr 10, 2024 via email

qvqqa commented Apr 12, 2024

did anyone reproduce the transformer network with frozen GPT-2？ #70

did anyone reproduce the transformer network with frozen GPT-2？ #70

Comments

baiyuting commented Jul 12, 2023

rongtongxueya commented Jul 14, 2023

baiyuting commented Jul 16, 2023

cjc20000323 commented Sep 25, 2023

baiyuting commented Sep 25, 2023

cjc20000323 commented Sep 25, 2023 via email

cjc20000323 commented Apr 10, 2024 via email

qvqqa commented Apr 12, 2024