New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

Sign up for GitHub

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Jump to bottom

Some questions about fine-tuning recognize-anything model #174

Open

weijiafs opened this issue Apr 16, 2024 · 1 comment

weijiafs commented Apr 16, 2024

Hello

I want to fine tune the recognize-anything model to label images with tags for real people or cartoon characters. I have two questions:

Would fine tune just the ram++ be enough, or do I also need to work on the text2tag part?
Also, I'm not sure how to go about this step. Could you please provide a detailed explanation?

Prepare pretained Swin-Transformer, and set 'ckpt' in ram/configs/swin.

thanks.

adbmdp commented Apr 17, 2024

You can find some answers here:
#173

I think you don't need the step "Prepare pretained Swin-Transformer".
You just need to fine-tune the model. No need for steps 1 to 5.

I'm also trying to train the model. It is not an easy task!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment