Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

fix: record progress in partial units [DET-3869] #1091

Merged
merged 1 commit into from
Aug 14, 2020
Merged

fix: record progress in partial units [DET-3869] #1091

merged 1 commit into from
Aug 14, 2020

Conversation

stoksc
Copy link
Contributor

@stoksc stoksc commented Aug 14, 2020

Description

This change alters trials to report units completed as a float64 so that partial units can be reported correctly. Previously, if the searcher was configured in terms of epochs and anything less than an epoch was completed in a workload, the reported completed model.Length was rounded to the nearest epoch; this usually ends up being 1 (except in the case of unusually small epochs or unusually large scheduling units) and makes progress appear completely wrong.

Test Plan

  • test locally with an example that was broken previously
  • add a unit test that checks searcher progress with records, batches and epochs.

Commentary

The weird thing about "add a unit test that checks searcher progress with records, batches and epochs" is that when an experiment completed we destroy the progress, so it seems impossible to write without being super flaky.

Copy link
Member

@rb-determined-ai rb-determined-ai left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

lgtm

@hkang1 hkang1 assigned stoksc and unassigned rb-determined-ai Aug 14, 2020
@stoksc stoksc merged commit 0b3b7e8 into determined-ai:master Aug 14, 2020
@stoksc stoksc deleted the fix-searchers branch August 14, 2020 22:16
@dannysauer dannysauer added this to the 0.13.0 milestone Feb 6, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants