We read every piece of feedback, and take your input very seriously.
To see all available qualifiers, see our documentation.
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Hello
I want to fine tune the recognize-anything model to label images with tags for real people or cartoon characters. I have two questions:
Would fine tune just the ram++ be enough, or do I also need to work on the text2tag part?
Also, I'm not sure how to go about this step. Could you please provide a detailed explanation?
Prepare pretained Swin-Transformer, and set 'ckpt' in ram/configs/swin.
thanks.
The text was updated successfully, but these errors were encountered:
You can find some answers here: #173
I think you don't need the step "Prepare pretained Swin-Transformer". You just need to fine-tune the model. No need for steps 1 to 5.
I'm also trying to train the model. It is not an easy task!
Sorry, something went wrong.
No branches or pull requests
Hello
I want to fine tune the recognize-anything model to label images with tags for real people or cartoon characters. I have two questions:
Would fine tune just the ram++ be enough, or do I also need to work on the text2tag part?
Also, I'm not sure how to go about this step. Could you please provide a detailed explanation?
thanks.
The text was updated successfully, but these errors were encountered: