-
Notifications
You must be signed in to change notification settings - Fork 85
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Why Result is less than the one in the paper ? #12
Comments
Hello, @mohamedaboalimaa have you resolved this problem? |
No I couldn't get same results of the paper from the trained Model , Do you have an idea how ? |
Thanks for raising the concern, are they the models you train yourself or the model we provided? |
the model I used is the one I you provide "checkpoint_fashion200k.pth" ,the result that come out of this model (in test datset) is : ['1 ---> 2.7, |
@mohamedaboalimaa The provided checkpoint on Fashion200k is not well trained. I can get the same recall as the paper by do it myself.
Something you should add to your code to get the best result. You should set deterministic, save the checkpoint at the iteration with the lowest loss and remember to set train shuffle True.
I finally get it after 160k iterations. |
@huynhtruc0309 thank you for your response, how much time it takes to run 160K iterations? , because I tried once on my PC, 1 iteration almost takes one day. |
We updated HEAD so that the text model vocabulary is more deterministic now, please use it so that your model is saved correctly. Note that you still need the original training dataset to construct the model. I trained a new model for fashion200k
|
@lugiavn I would like to thank you very much, it works perfect now . |
Why Result is less than the one in the paper ?
The text was updated successfully, but these errors were encountered: