Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Test model training #292

Open
wants to merge 4 commits into
base: main
Choose a base branch
from
Open

Test model training #292

wants to merge 4 commits into from

Conversation

RunDevelopment
Copy link
Member

@RunDevelopment RunDevelopment commented Jul 13, 2024

This adds a util function to test that models can train and uses it to test all image models.

Out of 44 models, 10 fail. I added a TODO comment to all of those. We have to decide how to deal with them.


I also noticed that it's impossible to use the call API for training, which is a shame.

@joeyballentine
Copy link
Member

I don't think the call API should be used for training anyway imo. People training the arches should be using the raw output without any of the extra things we do like clipping, since they wouldn't be running losses on the actual model output

@joeyballentine
Copy link
Member

Alternatively, we could have an optional named arg for whether or not the call api is being used during training, and then not do eval if thats set to true?

@joeyballentine
Copy link
Member

But, i still kinda think that if you're training, you shouldnt be using our model descriptor and should instead just be loading the raw arches

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

2 participants