Evaluation on multiple GPUs #73

ezhang7423 · 2024-09-22T20:36:10Z

Hi there! I am trying to run evaluation with multiple GPUs. I have ran everything in './scripts/minimal_example.sh'. Running on 1 GPU works perfectly, however when I pass in more than 1 GPU, all Huggingface Generator workers still only initialize on the first GPU, leaving the rest unutilized. Unfortunately, --use-vlm does not appear to be working either, as an error is thrown that the T5 model is not supported. Any support would be greatly appreciated.

yangky11 · 2024-09-24T00:57:28Z

Hi,

I'm not able to reproduce the problem. I tried running python prover/evaluate.py --data-path data/leandojo_benchmark_4/random/ --gen_ckpt_path kaiyuy/leandojo-lean4-tacgen-byt5-small --split test --num-workers 40 --num-gpus 8 and got the GPU utilization below. It seems all GPUs are running the evaluation.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Evaluation on multiple GPUs #73

Evaluation on multiple GPUs #73

ezhang7423 commented Sep 22, 2024 •

edited

Loading

yangky11 commented Sep 24, 2024

Evaluation on multiple GPUs #73

Evaluation on multiple GPUs #73

Comments

ezhang7423 commented Sep 22, 2024 • edited Loading

yangky11 commented Sep 24, 2024

ezhang7423 commented Sep 22, 2024 •

edited

Loading