Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Hyper-parameter #13

Open
mmnn-00 opened this issue Aug 28, 2024 · 3 comments
Open

Hyper-parameter #13

mmnn-00 opened this issue Aug 28, 2024 · 3 comments

Comments

@mmnn-00
Copy link

mmnn-00 commented Aug 28, 2024

Thanks for your codes. However, I have a question about setting hyper-parameters.
When using CIFAR10, the DQ paper has the following description:

image

According to the above description, I set the hyperparameters as below, although it is different from the sample code written inthe README. Are the following correct? (Changed '-se' from 0 to 10, and removed '--pretrained')
# Dataset bin generation (By default we use a bin number of 10) CUDA_VISIBLE_DEVICES=0 python -u quantize_sample.py \ --fraction 0.1 --dataset CIFAR10 --data_path ~/data_cifar \ --num_exp 10 --workers 10 -se 10 --selection Submodular --model ResNet18 \ -sp ../results/bin_cifar_010 \ --batch 128 --submodular GraphCut --submodular_greedy NaiveGreedy

@vimar-gu
Copy link
Collaborator

Thanks for pointing out the inconsistency between the paper and the implementation. In the paper we kept it consistent with the original Deepcore setting to conduct 10-epoch pre-training. And directly adopting pre-trained model is also practical. Please try running these two scripts and compare the performance.

@mmnn-00
Copy link
Author

mmnn-00 commented Aug 29, 2024

I got it. I will try. Thank you very much!

@haiduo
Copy link

haiduo commented Sep 5, 2024

I got it. I will try. Thank you very much!

Hi @mmnn-00 , what are your reproducible results, please?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants