Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Classifier Performance Stats #8

Open
a-desmons opened this issue Apr 1, 2022 · 2 comments
Open

Classifier Performance Stats #8

a-desmons opened this issue Apr 1, 2022 · 2 comments

Comments

@a-desmons
Copy link
Collaborator

This issue lists the performance stats after adding a classifier layer to our SSML model.

Total number of objects in data set: 45,128
Number of objects with known tidal features: 187

  • Images were first encoded and the Classifier Layer was trained for 500 epochs
  • The Classifier outputs a single number between 0 and 1, with 0 signifying "object with no tidal features", and 1 signifying "object with tidal features"

From the 187 images with known tidal features, 12 were set aside for testing performance and testing for overfitting - These are shown here

All images with known tidal features (i.e. the 187) were found within the first 7038 images with a ranking closest to 1.

The table below shows:

  • Test image number: The image used for testing
  • Similarity index: Its ranking determined by the classifier, with 0 being the "best" and 45,128 being the "worst"
Test image number Similarity Index
1 3982
8 307
18 4030
27 3800
76 451
86 997
97 418
117 21
119 7037
161 869
163 113
177 391

And a histogram of the classifier's performance in finding the images with known tidal features:

71788D37-204A-4764-A1B8-E2A529CDBBFD_4_5005_c

@a-desmons
Copy link
Collaborator Author

Comparing the performance of just the model (with single galaxy as query) to model+classifier.

x-axis: Similarity index in thousands (i.e. 1 = 1000)
y-axis: Fraction of galaxies with tidal features found

download-1

Zooming in on first half of the plot:
download-2

@a-desmons
Copy link
Collaborator Author

Sampling results by taking 6 images every 10,000 similarity-ranked images

sample_by_10000

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant