Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Some questions about fine-tuning recognize-anything model #174

Open
weijiafs opened this issue Apr 16, 2024 · 1 comment
Open

Some questions about fine-tuning recognize-anything model #174

weijiafs opened this issue Apr 16, 2024 · 1 comment

Comments

@weijiafs
Copy link

Hello

I want to fine tune the recognize-anything model to label images with tags for real people or cartoon characters. I have two questions:

  1. Would fine tune just the ram++ be enough, or do I also need to work on the text2tag part?

  2. Also, I'm not sure how to go about this step. Could you please provide a detailed explanation?

Prepare pretained Swin-Transformer, and set 'ckpt' in ram/configs/swin.

thanks.

@adbmdp
Copy link

adbmdp commented Apr 17, 2024

You can find some answers here:
#173

I think you don't need the step "Prepare pretained Swin-Transformer".
You just need to fine-tune the model. No need for steps 1 to 5.

I'm also trying to train the model. It is not an easy task!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants