Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

did anyone reproduce the transformer network with frozen GPT-2? #70

Open
baiyuting opened this issue Jul 12, 2023 · 7 comments
Open

Comments

@baiyuting
Copy link

did anyone reproduce the transformer network with frozen GPT-2?

I enter command
python train.py --only_prefix --data ./data/coco/oscar_split_ViT-B_32_train.pkl --out_dir ./coco_train/ --mapping_type transformer --num_layers 8 --prefix_length 40 --prefix_length_clip 40

model is trained on mscoco dataset (train+val), the result on test split is
image

blue4 is 20.0 and cider is 66.3 , I got the best result in third epoch, but this result is less than what the paper gives, blue4 is 33.53 and cider is 113.08
image

I am confused about the result, did anyone reproduce the result? Did I miss something?

@rongtongxueya
Copy link

I want to know how the results of this evaluation are displayed, how can I not run train.py

@baiyuting
Copy link
Author

I use https://github.com/salaniz/pycocoevalcap to evaluate result, I rewrite the captions_val2014_fakecap_results.json in folder "example" and enter command "python coco_eval_example.py"

@cjc20000323
Copy link

I want to know have you reproduced the results of transformer reported in the paper

@baiyuting
Copy link
Author

no, I have not reproduced this result

@cjc20000323
Copy link

cjc20000323 commented Sep 25, 2023 via email

1 similar comment
@cjc20000323
Copy link

cjc20000323 commented Apr 10, 2024 via email

@qvqqa
Copy link

qvqqa commented Apr 12, 2024

I also trained the only-transformer model and evaluate as you say, and the result is similar to yours. It's not as good as the result in this paper. Have you solved it?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

4 participants