You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
Creating plate clusters for training and validation splits
In order to split all the Stain2, Stain3, Stain4, and Stain5 (condition C) plates into clusters that are most similar to each other, I created a hierarchical cluster map based on the PC1 loadings of the mean aggregate profiles of these plates. First including all the outliers and then iteratively removing these until I find x clusters which are similar enough so that they can be used for training and validation.
Main takeaways
The final 7 clusters were chosen based on the largest clusters that can be seen in the final clustermap iteration.
Cluster number 5, 6, and 7 are the highest quality clusters, i.e. have the highest correlation of the PC1 loadings of the plates.
We already now that the model beats the baseline on cluster 1 for most plates, except on BR00113818 and BR00112199. However, note that this is actually one of the most diverse diverse clusters of the 7 I have created here.
Creating plate clusters for training and validation splits
In order to split all the Stain2, Stain3, Stain4, and Stain5 (condition C) plates into clusters that are most similar to each other, I created a hierarchical cluster map based on the PC1 loadings of the mean aggregate profiles of these plates. First including all the outliers and then iteratively removing these until I find x clusters which are similar enough so that they can be used for training and validation.
Main takeaways
Clustermap all plates
plate cluster 1: 3 plates
plate cluster 2: 67 plates
plate cluster 3: 9 plates
plate cluster 4: 1 plate
Clustermap remove iteration 3
plate cluster 1: 8 plates
plate cluster 2: 8 plates
plate cluster 3: 41 plates
plate cluster 4: 2 plates
plate cluster 5: 9 plates
plate cluster 6: 2 plates
plate cluster 7: 3 plates
Clustermap final iteration
Cluster numbering goes from top to bottom (where the bottom right cluster is number 7)
Clusters final iteration
The text was updated successfully, but these errors were encountered: