Release algoperf-benchmark-0.1.4 · mlcommons/algorithmic-efficiency

Upgrade CUDA version to CUDA 12.1:

Upgrade CUDA version in Dockerfiles that will be used for scoring.
Update Jax and PyTorch package version tags to use local CUDA installation.

Add flag for completely disabling checkpointing.

Note that we will run with checkpointing off at scoring time.

Update Deepspeech and Conformer variant target setting configurations.

Note that variant targets are not final.

Fixed bug in scoring code to take best trial in a study for external-tuning ruleset.

Added instructions for submission.

Changed default number of workers for PyTorch data loaders to 0. Running imagenet workloads with >0 may lead to incorrect eval results see #732.
Update: for speech workloads the pytorch_eval_num_workers flag to submission_runner.py has to be set to >0, to prevent data loader crash in jax code.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

algoperf-benchmark-0.1.4