The results of validation/test accuracy in NATS-Bench paper #38

Tommy787576 · 2022-01-27T13:35:37Z

Hi! I want to get the validation and test accuracy as Table 4 in "NATS-Bench: Benchmarking NAS Algorithms for Architecture Topology and Size" paper. I just want to check the following commands are correct or not:

After finishing the architecture search (I'm studying Weight-Sharing approach), I get the genotype. Then, I get the arch_index via:
arch_index = api.query_index_by_arch('......genotype here......')
Therefore, for Cifar10 validation accuracy:
info = api.get_more_info(arch_index, 'cifar10-valid', hp=200)
for Cifar10 test accuracy:
info = api.get_more_info(arch_index, 'cifar10', hp=200)
for Cifar100 validation/test accuracy:
info = api.get_more_info(arch_index, 'cifar100', hp=200)
for ImageNet16-120 validation/test accuracy:
info = api.get_more_info(arch_index, 'ImageNet16-120', hp=200)

Following are some points I want to check:

Does Cifar10 test accuracy results use train + valid set for training and test set for testing? Thus, I should use 'cifar10' instead of 'cifar10-valid' to get test accuracy?
What does valtest-accuracy mean in Cifar100 and ImageNet16-120?
I get the architecture with validation accuracy higher than Optimal values reported in the paper for ImageNet16-120. Why?

Great Thanks!

The text was updated successfully, but these errors were encountered:

D-X-Y · 2022-01-28T09:02:01Z

Thanks for your interest in NATS-Bench.

1, Yes, please use 'cifar10' to get the test accuracy.

2, valtest-accuracy means the accuracy on the joint of validation and test sets. Note that, here the validation set and test set following the split strategy in our paper, which is different from the original CIFAR-100 setting.

3, Possibly because you are using is_random=True, which will randomly select a seed. When we report the Optimal values, we use the average results from all seeds.

Tommy787576 · 2022-01-28T09:16:13Z

Hi, thank you for quick reply!
Therefore, I should set is_random=False to get the average results from all seeds. Am I correct?

D-X-Y · 2022-01-28T09:19:35Z

Yes, when you are using is_random=False, you can get the average results.

If you are benchmarking your own NAS algorithm, it is also suggested to use the simulate_train_eval API as in our examples: https://github.com/D-X-Y/AutoDL-Projects/blob/main/exps/NATS-algos/regularized_ea.py#L144

Tommy787576 · 2022-01-28T09:26:36Z

Ok. Thank you so much!
Hope you have a wonderful day!

D-X-Y self-assigned this Jan 27, 2022

D-X-Y added the question Further information is requested label Jan 27, 2022

Tommy787576 closed this as completed Jan 28, 2022

D-X-Y pinned this issue Mar 5, 2022

D-X-Y mentioned this issue Mar 5, 2022

best accuracy find #39

Closed

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

The results of validation/test accuracy in NATS-Bench paper #38

The results of validation/test accuracy in NATS-Bench paper #38

Tommy787576 commented Jan 27, 2022 •

edited

Loading

D-X-Y commented Jan 28, 2022

Tommy787576 commented Jan 28, 2022

D-X-Y commented Jan 28, 2022

Tommy787576 commented Jan 28, 2022

The results of validation/test accuracy in NATS-Bench paper #38

The results of validation/test accuracy in NATS-Bench paper #38

Comments

Tommy787576 commented Jan 27, 2022 • edited Loading

D-X-Y commented Jan 28, 2022

Tommy787576 commented Jan 28, 2022

D-X-Y commented Jan 28, 2022

Tommy787576 commented Jan 28, 2022

Tommy787576 commented Jan 27, 2022 •

edited

Loading