Skip to content
This repository has been archived by the owner on Sep 18, 2024. It is now read-only.

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

correctly deal with job retries on openpai #919

Closed
QuanluZhang opened this issue Mar 26, 2019 · 1 comment
Closed

correctly deal with job retries on openpai #919

QuanluZhang opened this issue Mar 26, 2019 · 1 comment
Assignees
Labels
enhancement New feature or request nnidev
Milestone

Comments

@QuanluZhang
Copy link
Contributor

QuanluZhang commented Mar 26, 2019

Short summary about the issue/question: a job on openpai may be retried. In the current version, nni is not aware of such event. This may induce potential issues, for example, an assessor may find such a trial's learning curve is strange, leading to incorrect behavior.

Brief what process you are following: normal

How to reproduce it: when a trial is retried on openpai

nni Environment:

  • nni version: 0.5.2
  • nni mode(local|pai|remote): pai
  • OS: ubuntu
  • python version: 3.5
  • is conda or virtualenv used?: no
  • is running in docker?: no

Anything else we need to know:
Related to #863 and #865.

@leelaylay
Copy link
Contributor

leelaylay commented Mar 26, 2019

I think it is an important bug need to be fixed. Related to #863 and #865.

@scarlett2018 scarlett2018 added the enhancement New feature or request label Apr 10, 2019
@ultmaster ultmaster added this to the Backlog milestone Oct 20, 2019
@microsoft microsoft locked and limited conversation to collaborators Jun 9, 2021

This issue was moved to a discussion.

You can continue the conversation there. Go to discussion →

Labels
enhancement New feature or request nnidev
Projects
None yet
Development

No branches or pull requests

5 participants