Releases: determined-ai/determined
Releases · determined-ai/determined
0.28.1
Release Notes
Changelog
- f6cb624 chore: bump version: 0.28.1-rc3 -> 0.28.1
- baaa3bd docs: add release notes for 0.28.1 (#8861)
- fbf9df4 chore: bump version: 0.28.1-rc2 -> 0.28.1-rc3
- a965f15 ci: fix broken ci due to queue version change (#8853)
- d91e8b0 chore: bump version: 0.28.1-rc1 -> 0.28.1-rc2
- 3129d33 fix: revert config work from #8765 and #8789 due to feature regressions (#8849)
- 1888b90 chore: bump version: 0.28.1-rc0 -> 0.28.1-rc1
- c443073 fix: show error message from backend API for workspace deletion (#8848)
- 1fc1496 fix: job queue test failures (#8843)
- a74685f chore: bump version: 0.28.1-dev -> 0.28.1-rc0
- 5b2e32d chore: cleanup the last traces of experiment git fields. [MD-258] (#8830)
- 92a380f feat: Generic task restore (#8802)
- b0fa7dc feat: generic tasks: support startup hooks (#8840)
- ca80022 chore: bunify postgres_checkpoints and add tests (#8783)
- a4dbc03 chore: fix error on terminating experiments on restart (#8837)
- aa98d82 chore: agent state wasn't getting deleted and logged error (#8838)
- bb469fa fix: update hew with bugfixes (#8839)
- 393cfde Fix broken ref (#8836)
- 7a13863 perf: improve GetExperiments + SearchExperiments counting (#8801)
- d8d9965 chore: remove unused SetAllocationName (#8829)
- 1946d9a docs: Update slurm install (#8832)
- 1fd21e7 fix: Fix small typo in Webhook documentation (#8820)
- e341e27 feat: Generic Tasks (#8724)
- fff85e3 fix: handle helm templating in older go template versions (#8828)
- f300d97 chore: hide genai helm values config and fix var name (#8821)
- 6206bde feat: add streaming updates core functionality and project streaming (#8669)
- ed61121 fix: stop truncating log timestamps to avoid missing logs [WEB-1791] (#8815)
- 43d3f21 fix: check for models before deleting workspace (#8804)
- bb59fa2 ci: wait longer for performance test db to startup (#8796)
- cfffe96 docs: Remove legacy pages (#8818)
- 1c3f3c4 fix: mitigate many unnecessary api calls in user management table (#8816)
- 4612c41 fix: agent config precedence (#8656)
- 762fcef feat: Deploy GenAI in Helm (#8727)
- 8e067d9 fix: remove possible hang from ship_logs.py [MLG-1565] (#8803)
- 1daf9d3 docs: remove duplicated note (#8813)
- 56e7000 fix: remove extra quotes around IdentifyTask (#8792)
- 3805ebd chore: add testing for k8s informer panic (#8810)
- a35696d refactor: condense trial update functions (#8808)
- 45c578b chore: bump version: 0.28.0-dev0 -> 0.28.1-dev
- 6520629 chore: add docs dropdown link for new version
- ed2136d docs: add release notes for 0.28.0 (#8807)
- 8258565 chore: bump version: 0.27.2-dev0 -> 0.28.0-dev
- c5afb6c fix: fetch experiment in case config data is not contained (#8789)
- 4e17ef7 chore: differentiate between programmatic and web page requests (#8795)
- a1a6e20 chore: add ee helm chart changes to oss (#8799)
- 65c811c docs: Add mention of RPMs to on-prem _index.rst (#8773)
- ad765d4 docs: adds/corrects EE changes, merges to OSS (#8788)
- f1a45ae perf: update proto_checkpoint_view to use index (#8793)
- abd590d Revert "docs: Update oidc and saml docs (#8777)" (#8791)
- bb88b01 fix: improve trial log request cancelling (#8787)
- 17f305f ci: make perf tests only alert on failure (#8790)
- 422f5aa perf: avoid loading model def in experiment model (#8742)
- 7698452 perf: improve GetExperiments showTrialData performance (#8753)
- e801cfe perf: add index to checkpoints_v2 id (#8758)
- 71db4e1 perf: add indexes to tasks and allocations (#8757)
- e0e6cf0 perf: improve get_workspaces query (#8751)
- ef656bc perf: improve resource agg performance (#8735)
- e873381 fix: retry watcher failure causes infinite loop (#8786)
- ba2f190 fix: replace experiment config (#8765)
- 85d1053 chore: rename postgres_command_intg_test.go (#8785)
- 40a70cf test: performance test CI work (#8761)
- 36a2e29 chore: bunify db/postgres_tasks.go (#8764)
- 07494cf fix: update hew to a version without broken documentcard prompts (#8782)
- 9502059 feat: GCS client should retry on
TooManyRequests
. (#8780) - ec850ae test: add intg tests for db/postgres_tasks.go (#8750)
- 9ec2f7d chore: update gke version to comply with latest release for e2e tests (#8781)
- cefa242 chore: persist checkpoint storage backend ID (#8690)
- 905e449 chore: migrate db schema trials to runs (#8723)
- dfbb926 chore: clean up leftover debug print statements (#8755)
0.28.0
Release Notes
Changelog
- 7f9b082 chore: bump version: 0.28.0-rc4 -> 0.28.0
- ed1b7f0 docs: add release notes for 0.28.0 (#8807)
- c4b6f57 chore: bump version: 0.28.0-rc3 -> 0.28.0-rc4
- 27ce0a2 chore: add ee helm chart changes to oss (#8799)
- 959a096 chore: add ee helm chart changes to oss (#8799)
- f513174 chore: bump version: 0.28.0-rc2 -> 0.28.0-rc3
- 083c314 chore: bump version: 0.28.0-rc1 -> 0.28.0-rc2
- e272bf0 chore: bump version: 0.28.0-rc0 -> 0.28.0-rc1
- 6080d39 chore: bump version: 0.27.2-rc4 -> 0.28.0-rc0
- 3cbce1d chore: bump version: 0.27.2-rc3 -> 0.27.2-rc4
- 89df98b docs: adds/corrects EE changes, merges to OSS (#8788)
- 2e27c71 chore: bump version: 0.27.2-rc2 -> 0.27.2-rc3
- 1abe34f chore: bump version: 0.27.2-rc1 -> 0.27.2-rc2
- e23e162 fix: improve trial log request cancelling (#8787)
- 55b5bd4 chore: bump version: 0.27.2-rc0 -> 0.27.2-rc1
- 5edfd81 fix: retry watcher failure causes infinite loop (#8786)
- 6a21d44 fix: update hew to a version without broken documentcard prompts (#8782)
- 74e341d chore: bump version: 0.27.2-dev0 -> 0.27.2-rc0
- ea9e903 chore: lock published urls to preserve redirects
- 0321e1f chore: lock api state for backward compatibility check
- 3783f2b docs: Update oidc and saml docs (#8777)
- 141afa4 docs: update dependency version in contributing readme (#8776)
- 994527f fix: Text filter on ProjectMoveModal (#8775)
- aa65c07 chore: use vite-plugin-svg-to-jsx package (#8772)
- 98c61f3 test: do not import model_hub test requirements (#8771)
- 1e2da10 ci: retry git fetch for early stopping checks (#6318)
- c73712b docs: Replace basic quickstart (#8770)
- fda515d fix: python requirements for pytest and moto (#8769)
- 78929c0 fix: Filter value resets when switching column types [WEB-1949] (#8731)
- 7ddf965 docs: Fix minor issues (#8768)
- 31f6f99 fix: add default transport to proxy connection (#8767)
- 149b7fa build(deps): bump slackapi/slack-github-action from 1.24.0 to 1.25.0 (#8766)
- 719169a docs: Fix dropdown url (#8763)
- 56406a2 Update helm chart config ref (#8762)
- 5973f8e chore: bump version: 0.27.1-dev0 -> 0.27.2-dev0
- 260c2bc chore: add docs dropdown link for new version
- 4b4d14a docs: add release notes for 0.27.1 (#8746)
- 7841d9e feat: the new quick start guide link (#8759)
- 995311a feat: expconf flag to force scheduling on a single node/container/pod (#8743)
- 64d588f refactor: use hew Tree and Divider components [WEB-1920] (#8736)
- f771acb fix: cease many model fetch api calls in checkpoint tab (#8749)
- 96b9064 docs: Add qs for webui users (#8754)
- d68ffaa docs: API deprecate returning config for bulk endpoints (#8732)
- f21a516 tests: cover queries inside internal/users/postgres_users.go (#8729)
- 90a57cb fix: Experiment table, right-click context menu [WEB-1942] (#8756)
- 2ffc18f chore: import missing EE helm chart change [ci skip] (#8747)
- 87b6cf3 fix: use the new genai docker repo (#8745)
- 7c3650f chore:
make devcluster
to rebuild bindings before harness and webui. (#8748) - 7f3ddfb feat: Add a modal to enable/disable Agents [WEB-1718] (#8721)
- bd0a9ea fix: pagination fix in model detail page (#8744)
- 43c074e feat: helm option to mount shared_fs checkpoints to master (#8741)
- 9f06d35 fix: use selected checkpoints when registering (#8739)
- 6db8c06 test: cover agent_state.go SQL queries (#8740)
- eb48302 test: cover db.GroupCheckpointUUIDsByExperimentID (#8508)
- d661404 fix: compress data from API for the page load performance improvement (#8720)
- b71da7a fix: batch metric writes to TensorBoard [MLG-990] (#8688)
- bd78ec1 feat: Preserve 'redirect' query during logout [GAS-489] (#8728)
- 62941a2 refactor: remove antd App component [WEB-1922] (#8713)
- bf5b1d1 chore: fix unused-imports warning in protos build. (#8726)
- 0bda0d9 fix: use hew Alert [WEB-1918] (#8711)
- 1c21f6a chore: Move from internal glide-table-grid to v6.0.0 [WEB-1945] (#8725)
- f32e015 fix: local checkpoint download path fix (#8722)
- 190af1d docs: [FE-270] add PBS known issue - Cluster tab does not display GPU information (#8719)
- 6d744f7 feat: content-length for tar checkpoint downloads (#8684)
- 11e3ba9 chore: upgrade
[email protected]
(#8718) - 92fe3a6 docs: [FE-269] Add documentation detailing configuration steps to set the values for ngpus. (#8714)
- b69a49c chore: update github path in docker docs (#8687)
- faea553 chore: codecov reports to match go coverage reports (#8696)
- 0782c35 chore: standardize oidc/saml group & display attribute names in helm config (#8689)
- acca434 chore: update oss/ee oidc & saml helm config (#8680)
- 7188b69 fix: use Hew dropdown on FilterGroup [WEB-1938] (#8715)
- a410c45 chore: Upgrade to vite 5 (#8676)
- dbeb458 fix: support
CommandState
for experiment icon (#8709) - 83fe474 docs: fix references on children of "training reference" root (#8708)
- 71eaa5a chore: Replace antd reset.css with modern-normalize (#8706)
- e0e08b6 fix: Update hew for chart fix, avoid error from Typography.Label (#8712)
- fef93a4 build(deps): bump actions/cache from 3 to 4 (#8710)
- 4aedded docs: Update docs to pass linter (#8705)
- 00c2746 fix: restore original user store poll on leaving workspace details (#8702)
- 73760cd Revert "docs: Update docs to pass linter" (#8704)
- 4b7b705 [docs] Update docs to pass linter (#8703)
- 2402133 docs: Update Docker Installation Instructions (#8659)
- f8a2434 docs: Update Linux distros, add WSL, and archs to Quickstart (#8662)
- 132919f docs: Overhaul WSL deployment instructions (#8658)
- e7dc7aa chore: Replace custom archived note with Hew badge (#8695)
- f2899cc fix: fix CreateExperiment for Remote Users (#8700)
- f00768f chore: remove unused files (#8698)
- 2e60167 chore: TrialsComparisonModal style fixes [WEB-1919] [WEB-1909] (#8674)
- e8d6448 Revert "fix: restore original user store poll on leaving workspace details"
- 6d3f9ff fix: playwright fix (#8699)
- c869ce7 fix: restore original user store poll on leaving workspace details
0.27.1
Release Notes
Changelog
- e05d57d chore: bump version: 0.27.1-rc1 -> 0.27.1
- 94316ae docs: add release notes for 0.27.1 (#8746)
- b8a772a feat: helm option to mount shared_fs checkpoints to master (#8741)
- 3abd7c7 fix: use selected checkpoints when registering (#8739)
- c479a28 chore: bump version: 0.27.1-rc0 -> 0.27.1-rc1
- 6e58dca fix: playwright fix (#8699)
- 204c32e fix: use Hew dropdown on FilterGroup [WEB-1938] (#8715)
- f2baf39 fix: support
CommandState
for experiment icon (#8709) - 7a9b0fd fix: Update hew for chart fix, avoid error from Typography.Label (#8712)
- 4085b42 fix: fix CreateExperiment for Remote Users (#8700)
- 23574cf chore: bump version: 0.27.1-dev0 -> 0.27.1-rc0
- e2cf980 chore: bumpenvs 0.27.1 (#8701)
- 4fffb09 docs: Update references to RHEL and CentOS to Enterprise Linux (#8660)
- 6ccc7dc docs: Fix broken ext link (#8697)
- 06d7669 docs: fix a few hyperlinks in the Python SDK reference. (#8693)
- e0558f9 fix: Workspace icons display redundant tooltips [WEB-1912] (#8677)
- 0329667 fix: docs version switcher [MLG-1524] (#8692)
- 130ebb7 fix: Trial APIs
--local
should respect.detignore
. [MLG-1352] (#8683) - 3d69af2 test: make compute_stats have a min tolerance of 3 seconds (#8691)
- 4a30ce8 chore: improve error message for continuing a completed hp search [DET-10041] (#8636)
- 11a0f27 fix: add zmq heartbeat in DistributedContext [MLG-1133] (#8681)
- 10aaf02 chore: update helm image path to use main branch (#8686)
- 8671fbd feat: set jupyter notebook file browser root to
/
. (#8678) - 2fdda37 fix: update hf trainer api example (#8685)
- 4118806 fix: HP ScatterPlots scrollbar (#8682)
- 248e88c fix: update helm chart for new logo (#8675)
- 4e8d9a9 feat: Truncate and pad cell values in glide table [WEB-1778] (#8665)
- 38e6825 fix: Remove out of place spinner at dashboard page (#8668)
- df67efe fix: use FixedSizeList for column picker to fix jitter (#8628)
- 127cf98 fix: break out stopping... states from stopped states in webui (#8672)
- 766577d feat: add health check for genai deployments (#8613)
- 0d93855 feat:
det (notebook|tensorboard|shell|command) ls --sort-by ...
[DET-6126] (#8649) - 39ad129 fix: wait on algolia search upload (#8661)
- f5bebc3 chore: delete unused project model (#8673)
- c99bfc3 fix: Helm master-service clusterip mngt. openshift (#8546)
- 4354c8e chore: reinstate original DuplicateError to postgres_users.go (#8657)
- 67a2c40 chore: bump version: 0.27.0-dev0 -> 0.27.1-dev0
- 8446c43 chore: add docs dropdown link for new version
- 8651f41 docs: add release notes for 0.27.0 (#8671)
- 25f0ba9 chore: Add warning to Patch Mast config to specify changes are ephemeral (#8577)
- fd4c1a0 fix: don't drop active_user on expired tokens [MLG-1494] (#8653)
- 9dc4fc8 feat: add
det e unpause
alias. (#8562) - 7d9f295 style: reformat
det (n|t|s|c)
arguments code. (#8652) - c1dbed9 chore: Remove antd usage from determined [WEB-1723] (#8605)
- 0c9e19f feat: conditionally add genai to the sidebar (#8496)
- 4f4ef5a fix: stop using distutils (for python 3.12) [MLG-1519] (#8667)
- 1e8baee fix: Add theme to TableFilterDropdown [WEB-1952] (#8663)
- 24cfbb8 docs: Add section on viewing topology (#8638)
- ef26ad2 fix: Revert "chore: Job/task displays Running instead of Scheduled (#8335)" (#8654)
- 926a6de chore: delete unused experiment fields in db (#8639)
- 33dcfbd fix: replace View Logs link with useNavigate (#8655)
- 87f713f fix: Check if loading before saying no workspaces / projects [WEB-1904] (#8627)
- ae072f7 docs: Reorder the tutorials for visibility (#8650)
- 5774f5c chore: improve string_to_bool help text of cli [MLG-1208] (#8651)
- ab08345 chore: move postgres_command.go to command module (#8648)
- c1a7461 chore: add Go coverage ratchet to unit and integration tests (#8602)
- c0614a8 fix(cli/deploy/simple-rds): set template default snapshot (#8645)
- 2403cd5 fix: Sort order in users.active column (#8646)
- b60f8de fix: Trial metric chart series always have unique colors (#8626)
- d1b585c test: fix
test_tf_keras_parallel
. (#8641) - 5f19a53 feat(cli/deploy): add db snapshot flag for simple-rds [INFENG-269] (#8443)
- 2e6a634 chore: print TB status message to logs [MLG-1119] (#8642)
- c8d50a3 fix: ResourcepoolDetails polls for job stats [WEB-1914] (#8635)
- 2604235 chore: Update hew to 0.6.22 (#8634)
- 34891a1 docs: Add oidc note about uniqueness (#8637)
- 001d827 chore: extend CLI NTSC timeout [MLG-870] (#8632)
- 86aab14 docs: Document scim display name attribute (#8606)
- 134c2e1 fix: reduce tfkeras iris const batch size (#8633)
- a5e4b70 docs: Mention examples in the git readme (#8623)
0.27.0
Release Notes
Changelog
- eb0eae8 chore: bump version: 0.27.0-rc4 -> 0.27.0
- b51bfc5 docs: add release notes for 0.27.0 (#8671)
- 15783b5 Revert "adding 0.27 release notes"
- d765c6e revert doc changes
- 13727ad Revert doc changes:
- e962977 Revert "revert doc changes"
- a5e2849 revert doc changes
- e8ee69d adding 0.27 release notes
- bc7c75b chore: bump version: 0.27.0-rc3 -> 0.27.0-rc4
- 69b93a2 fix: Add theme to TableFilterDropdown [WEB-1952] (#8663)
- bb12b93 chore: bump version: 0.27.0-rc2 -> 0.27.0-rc3
- f323409 fix: ResourcepoolDetails polls for job stats [WEB-1914] (#8635)
- 1785b44 chore: bump version: 0.27.0-rc1 -> 0.27.0-rc2
- 35e4b49 chore: Update hew to 0.6.22 (#8634)
- b899482 chore: bump version: 0.27.0-rc0 -> 0.27.0-rc1
- 633d455 chore: bump version: 0.27.0-dev0 -> 0.27.0-rc0
- 6e6a23e chore: add docs dropdown link for new version
- 0bc1644 chore: bump version: 0.26.8-dev0 -> 0.27.0-dev0
- f0e4c0c chore: added release note for breaking CLI experiment creation change (#8630)
- eea7ca5 fix: remove UNSPECIFIED values from end-user enums (#8367)
- 9de3e4b fix: os.uname() not found on windows (#8629)
- 78068ee chore: cli describe workspaces no longer also lists projects (#8609)
- 79df355 chore: delete unused trials_augmented_view (#8588)
- edd71b3 chore: update hew to 0.6.20 (#8625)
- 1818cb4 ci: fix test split for skipped tests (#8624)
- fa8bf66 chore: fix a typo in RRI (#8612)
- fac9fa0 fix: update experiment hp search param quoting (#8620)
- 601a98f docs: Fix mmdetection readme link (#8607)
- b50a810 build(deps): bump golang.org/x/crypto from 0.14.0 to 0.17.0 (#8604)
- 8d175e6 chore(deps): bump flake8-commas from 2.0.0 to 2.1.0 (#4338)
- 9b88de4 build(deps): bump actions/setup-go from 4 to 5 (#8556)
- 590c0b9 build(deps): bump actions/setup-python from 4 to 5 (#8555)
- 27e3c1d feat: Replace useModal with updated Modal component [WEB-1828] [WEB-1891] (#8578)
- 63e44b0 Add network connectivity diagram (#8446)
- 83f5642 chore: Model Version pages to Hew UI [WEB-1814] (#8603)
- b462a54 chore: bump version: 0.26.7-dev0 -> 0.26.8-dev0
- 14ba6bd docs: add release notes for 0.26.7 (#8601)
- c2e7d1d build(deps): bump github/codeql-action from 2 to 3 (#8594)
- 490f537 chore(deps): bump actions/download-artifact from 3 to 4 (#8598)
- ded49ab chore(deps): bump actions/upload-artifact from 3 to 4 (#8599)
- 7ddcb92 fix: Show experiment loading state before first loading (#8583)
- 2c29f99 chore: Remove attrdict from model hub (#8554)
- 7d0afb7 chore: Upgrade hew to 0.6.19 (#8597)
- 0f5a16d chore: Change CLI and constants from lore to genai (#8593)
- ac9ee99 fix: prevent theme settings conflict in multi-client/window scenario (#8596)
- 2cd8b3e refactor: Replace antd imports with UI kit usage [WEB-1723] (#8582)
- 576ffb5 fix: antd.Typography with expansion tooltips moved to hew [WEB-1862] (#8513)
- e37ca82 chore: drop raw_checkpoints table (#8584)
- 3719968 fix: Change slot number back to 2 in Keras example (#8595)
- 22a17d4 chore(deps): bump github.com/docker/docker from 20.10.24+incompatible to 24.0.7+incompatible (#8316)
- 909c0d5 fix: allow for
model_dir
to be None when submitting experiments (#8422) - 79321f7 chore: encode path parameters in generated bindings [MLG-781] (#8543)
- ce9fe43 chore: update byExternalToken signature (#8587)
- dccc526 test: skip more
mmdetection
tests [DET-10036] (#8592) - 6fe2c91 feat: add local checkpoint storage download through server (#8505)
- e125955 chore: fix api state (#8589)
0.26.7
Release Notes
Changelog
- a125dd4 chore: bump version: 0.26.7-rc1 -> 0.26.7
- cc161d5 docs: add release notes for 0.26.7 (#8601)
- ed4d11a chore: bump version: 0.26.7-rc0 -> 0.26.7-rc1
- a138b0a fix: Change slot number back to 2 in Keras example (#8595)
- 47c885b chore: fix api state (#8589)
- dc175a1 chore: bump version: 0.26.7-dev0 -> 0.26.7-rc0
- 0584703 chore: lock published urls to preserve redirects
- 40d6c32 chore: lock api state for backward compatibility check
- 6e224ce chore: add docs dropdown link for new version
- e1c1749 fix(tasks): persist rendezvous readiness (#8545)
- 150eae4 chore: bump version: 0.26.6-dev0 -> 0.26.7-dev0
- 660de26 feat: Update the "Continue Single trial experiment" workflow (#8526)
- 940d1d7 fix: master cannot download s3 from us-east-1 (#8558)
- dac3fe8 build: avoid installing playwright by
make build
(#8569) - 005634b ci: unit-test login scenarios, and others (#8471)
- 4a808c3 chore: remove --dry-run option from det deploy aws [MLG-983] (#8542)
- aa71929 chore: implement GetSlot, GetSlots, and GetAgent for K8s rm (#8464)
- 791e532 chore: release notes for 0.26.6 (#8566)
- 51f9f71 chore: fix bad arg validation for det deploy aws up (#8576)
- e5566ee docs: Rename submit experiment (#8570)
- c05eab1 chore: include cluster name in stack deletion confirmation (#8575)
- 2a55fc1 docs: Update oidc group claim name notes (#8573)
- c9ba474 chore: add lore help and experimental disclaimer (#8563)
- 8527011 feat: filter out inactive users from list and add option to see all users (#8421)
- 431dec0 chore: Model metadata sections move to Surface [WEB-1813] (#8557)
- 6263695 Revert "docs: Add a link to mldes trial (#8561)" (#8572)
- 3371746 chore: Port smaller Antd.Modals to standard Hew Modal (#8567)
- 799d6d4 ci: make backend own codeowner of go.mod / go.sum (#8568)
- 6232ce2 chore: bump version: 0.26.5-dev0 -> 0.26.6-dev0
- faa9dee docs: add release notes for 0.26.5 (#8564)
- 05c5320 test: store test results for
test-e2e-gke
. (#8560) - 921826a docs: Fix workspaces projects left nav (#8565)
- 4bccbce fix: prevent workspace list race condition (#8524)
- ef9a686 test: add more Go db tests to internal/db (#8553)
- 17666a4 docs: Add a link to mldes trial (#8561)
- f461c74 Mention det python sdk demo (#8550)
- 8d3f6fb chore: Rename show_ssh_command in CLI and add message for VSCode WSL Users (#8387)
- f988566 chore: replace readme logo with svg [WEB-314] (#8559)
- f543c41 chore: remove OIDC Config from OSS (#8534)
- 12e31af build(deps): bump golang.org/x/net from 0.7.0 to 0.17.0 in /master (#8124)
- f9fc1d1 docs: Describe remote user management webui (#8479)
- bb37b53 test: skip all failing
mmdetection
tests. (#8540) - 3fff399 test: fix failing nightly test splits (#8549)
- ff97304 test: add more tests to db/postgres_experiments.go (#8537)
- c08613a fix: correct docs link in empty project [WEB-1879] (#8548)
- b9caae5 docs: Improve webhooks tasklog (#8544)
- 816d203 chore: Use Hew SplitPane [WEB-1682] (#8482)
- 76bdfe9 test: make
DET_MASTER
configurable in the perf testMakefile
. (#8533) - a6ae0c7 feat: add the slots property to the props (#8498)
- ef4b787 fix(api): delete experiment error handling corrections (#8510)
- 203477d chore: remove CI go unit tests and rename CI integration target (#8530)
- 65227fa ci: add Go coverage regex match so we can require some functions to be tested (#8514)
- 22af6f7 fix: allow slots per trial to be 0 [WEB-1871] (#8521)
- 0771185 test: add more Go db tests (#8519)
- 1c9575f fix: callback webhook action modals to re-fetch webhooks after successful call [WEB-1869] (#8522)
- 3d18c13 fix: trial spinner (#8528)
- ad55716 feat: Enable Retry for multi-trial exp with errored trials (#8518)
- 88784cc fix: Model delete redirect only needed if inside the model [WEB-1867] (#8512)
- 9a09d15 feat: add lore redirect (#8492)
- 6b4f52e test: remove
cifar10_tf_keras
tests and examples. (#8444) - 0590e80 fix: ensure filter columns are valid when selecting special columns (#8517)
- 9f14c6e fix: typo (#8516)
- 0d59196 chore: add ElementsMatch to usergroup test (#8501)
- 5fff579 feat: add experiment id to SDK Trial objects (#8499)
- 8e6252f refactor: replace resource pool looping with direct call (#8503)
- b37403f fix: fetch projects after archive/unarchive (#8504)
- 7da2777 chore(deps): bump actions/setup-java from 3 to 4 (#8507)
- 7f5de0f fix(agentrm): resource pools must filter agents.list() by name (#8509)
- e33ebb7 chore: remove defunct yogadl dependency (#8450)
- 72ef508 test: fix failing test_delete_experiment_removes_tensorboard_files (#8511)
- 9abdbf1 fix: Overwrite omnibar antd modal with Hew standard modal [WEB-1830] (#8476)
- 17f1efb chore: re-enable CI metrics upload (#8506)
- c8804e5 chore: take out submodule update cmd (#8502)
- 42b38fb feat: Enable multi-trial "retry" for errored/cancelled exp (#8495)
- 0b855c7 feat: hide hyperparameter search for unmanaged multi tiral experiments (#8497)
- 7077ab4 fix: Update Trial Download Error Message (#8398)
- 05e5974 ci: add date to python cache, fix moto linting issue (#8493)
- bf79a43 ci: fix Go build cache by not making it run get-deps (#8483)
- ec90590 feat: allow creation of tasklog webhooks in webui + docs (#8434)
- 324f148 chore: add UpdateUsergroupMembership (#8489)
- 66b7225 chore: run migration moving trials to view and rename [DET-9989] (#8440)
0.26.6
0.26.5
Release Notes
Changelog
- cfa7730 chore: bump version: 0.26.5-rc3 -> 0.26.5
- 2755e5e docs: add release notes for 0.26.5 (#8564)
- 617fb0c chore: bump version: 0.26.5-rc2 -> 0.26.5-rc3
- 5eff3b4 chore: bump version: 0.26.5-rc1 -> 0.26.5-rc2
- 54aa7a8 feat: add the slots property to the props (#8498)
- 52adafb fix: allow slots per trial to be 0 [WEB-1871] (#8521)
- 6eb6bf0 fix(api): delete experiment error handling corrections (#8510)
- 54f2cec fix: trial spinner (#8528)
- 5cfc4cc chore: bump version: 0.26.5-rc0 -> 0.26.5-rc1
- 5f4a7f4 fix: ensure filter columns are valid when selecting special columns (#8517)
- 189b4c1 fix(agentrm): resource pools must filter agents.list() by name (#8509)
- f47bde5 ci: add date to python cache, fix moto linting issue (#8493)
- 22d6d25 chore: bump version: 0.26.5-dev0 -> 0.26.5-rc0
- 275ea84 chore: lock api state for backward compatibility check
- cea9ff4 fix: Experiment state now is an ExperimentState (#8457)
- 06b7b79 fix: tqdm logs within wrap_rank [MLG-1236] (#8488)
- f50f7db fix: use explicit e.state in bulk experiment delete query (#8491)
- 0f65698 fix(rm): tasks shouldn't hang on restore failures (#8486)
- f847b26 chore: Revert "test: quarantine GPU execution of test_task_logs (#8261)" (#8484)
- f085e10 fix: failing custom searcher due to ExtraEnvVars being overwritten (#8490)
- 88f64f6 chore: update libraries (#8463)
- 2e48a6f chore: migrate detaileduser and experimentitem types to io-ts (#8477)
- d8ed945 chore: add aliases to det dev commads (#8156)
- a2730cb feat: adding PACHD_ADDRESS and DEX_TOKEN to task env (#8473)
- 837bc29 chore: Update Hew Version to 0.6.12 (#8481)
- 1ee9b81 chore: clean up version dropdown update script (#8415)
- 398f879 docs: Add requirement and known issue for singularity-suid (#8478)
- 64e299e fix: wrong skip experiment config regex for log policies (#8475)
- 0b4e1d2 chore: cleanup some spurious cluster logs (#8468)
- 8c9dfbf fix: add delete cascade to generic metrics (#8469)
- 0b28148 ci: register unit pytest marks (#8470)
- 815f5ae fix: Kill task permission on interactive page (#8358)
- e67807d chore: preserve CI logs when bringing an AWS cluster down (#8461)
- 50d40bd chore: update trial complete or early exit to always notify searcher (#8466)
- 6bdf061 Update k8s install info (#8465)
- e06b472 chore: export AddUserTx (#8458)
- 16ea5a0 chore: introduce and use observables with improved update checking (#8405)
- 5805df2 chore: set up ownership for .circleci [skip ci] (#8402)
- c2a211d fix(api): handle delete experiment failures correctly (#8459)
- dfb4dc5 chore(actors): remove pkg/actor (#8452)
- 167e237 chore: add error check to KillNTSC (#8441)
- 4edbc7f chore: log RestoreAllCommands error (#8454)
- 9d71abd Fix minor issues including hard coded reference (#8427)
- 756a79c chore: bump CI node version to 20.9.0 (#8455)
- 3090e42 chore: use ResourcePool info for consistent capacity calculation [WEB-1796] (#8447)
- 192a2b3 chore(actors): remove pkg/actor usage from agentrm (#8395)
- 5ded38d chore: bump version: 0.26.4-dev0 -> 0.26.5-dev0
- 4339f67 docs: add release notes for 0.26.4 (#8451)
- a25e4f5 fix: add back pin icon in experiment list header (#8429)
- adb5191 ci: store npm log artifacts (#8449)
- e29006b fix: det slot task name for no-permissions RBAC users (#8416)
- 695a648 fix: SDK list_checkpoints not defaulting to searcher metric sort (#8448)
- 1ddad7d feat: add Topology into the RP details page (#8276)
- 2f7dda6 ci: cache install Python (#8426)
- cde18df fix: Calculate allocation bar stats same as overview [WEB-1822] (#8431)
- 9a48ff1 docs: Update upgrade instructions (#8346)
- e9a199a fix: k8s autoscaling nodes not counted towards RP (#8439)
- bd19e7a chore: command actor refactor & add intg test [DET-9660] (#8136)
- 99c4cea test: create a test for delete-tensorboards via
det e delete
(#8336) - 7b7d1eb feat: Add remote user settings to Users table [WEB-1798] (#8397)
- 8051039 ci: fix linting with responses==0.24.1 (#8436)
- b087c10 chore: add version dropdown url for previous release (#8437)
- 8e9c505 test: fix model registry rbac wrong user regression (#8420)
- 292c75d fix: new experiment list tooltip styling (#8433)
- 1b47bf0 ci: delete broken fixture (#8428)
- bf07e61 fix: Wrap older modals in theme class [WEB-1824] (#8432)
- 0c8fad9 chore: filterformstore comment re: change tracking (#8386)
- 1041e56 fix: replace antd select with hew select (#8424)
- aa34aa7 feat: add workspace/project creation/deletion (#8430)
- 2e0a5a2 feat: client gets list_models, too. (#8425)
- 69df80e Revert "feat: Client gets list_models, too."
- d1343ca feat: Client gets list_models, too.
- a1e660d chore: converting SearchGroupsWithoutPersonalGroups into tx (#8419)
- 6ba688d test: fix TestAddAndRemoveBindings flake (#8423)
- d4c4195 chore: update Column and Row from Hew (#8412)
- d1d09e7 docs: Update non root container instructions (#8273)
0.26.4
Release Notes
Changelog
- bf665ae chore: bump version: 0.26.4-rc4 -> 0.26.4
- 2f86950 docs: add release notes for 0.26.4 (#8451)
- f2ef0fe chore: bump version: 0.26.4-rc3 -> 0.26.4-rc4
- f0a37a9 fix: Calculate allocation bar stats same as overview [WEB-1822] (#8431)
- 9acfbf2 chore: bump version: 0.26.4-rc2 -> 0.26.4-rc3
- 9dd0211 fix: k8s autoscaling nodes not counted towards RP (#8439)
- 3bf6647 chore: bump version: 0.26.4-rc1 -> 0.26.4-rc2
- 47397a4 fix: new experiment list tooltip styling (#8433)
- 680ac02 ci: fix linting with responses==0.24.1 (#8436)
- d4200d2 chore: add version dropdown url for previous release (#8437)
- 0a4b6bc test: fix model registry rbac wrong user regression (#8420)
- 242ff97 fix: Wrap older modals in theme class [WEB-1824] (#8432)
- e3c109a chore: bump version: 0.26.4-rc0 -> 0.26.4-rc1
- 22e18ae fix: replace antd select with hew select (#8424)
- 00d349a feat: add workspace/project creation/deletion (#8430)
- 9f727fe feat: client gets list_models, too. (#8425)
- b8c1be7 chore: update Column and Row from Hew (#8412)
- 4e6fd52 chore: bump version: 0.26.4-dev0 -> 0.26.4-rc0
- e9a457d chore: lock published urls to preserve redirects
- 2fae9ba chore: add docs dropdown link for new version
- 6c3bf84 chore: make insert-dropdown-url.sh executable (#8418)
- b5ca7f4 chore: fail deployment if launching part of the service fails (#8409)
- 8498674 fix: allow --json in det master config CLI command (#8413)
- d123932 fix: Place modal inside of ResourcePoolCard (#8414)
- ff19924 chore: Add eslint rule for ?? operator (#8410)
- d56b3ae chore: convert DOS line endings to Unix (#8411)
- c1219eb fix: Hide stats card when 0 on cluster page (#8359)
- da77efb fix: added permission check on GetAllocation (#8281)
- 3b0550c chore: Bumpenvs 0.26.4 (#8407)
- e48d03d fix: user flag to prompt for password during user requests (#8158)
- 513e6d7 fix: Project and Workspace cards wrap modal divs (#8378)
- 2497d84 chore: export AddUserTx (#8403)
- ad764f0 refactor: implement Glossary component from Hew (#8385)
- 52326d1 feat: change cli command for patch master log config DET[9720] (#8054)
- 1e9155d chore(type): stricter tsconfig (#8349)
- 16f18cc chore: revert task obfuscation lint failures (#8406)
- dde3156 chore: Implement Theming updates in Determined [WEB-1726] (#8388)
- 4edfc3c ci: move packaging test to test-e2e-longrunning (#8381)
- d3c208a ci: cache go modules deps and build cache (#8383)
- 8924996 chore: temporarily disable CI upload job (#8399)
- 356f651 Revert "chore: temporarily disable upload_test_results job step"
- 6dd9701 chore: temporarily disable upload_test_results job step
- ba49dbd ci: up parallelism for slowest test_e2e premerge tests (#8374)
- 5f3e556 ci: finish removing growforest (#8389)
- 62084e2 fix: NTSC task and slot viewing obscured for RBAC users with no Viewer Permissions (#8311)
- 0254f7d chore: fix nil ptr on allocation.Proto() (#8372)
- 119e759 chore: fix profiler test in CI (#8382)
- b428d5e feat: add hide column header menu item to explist (#8342)
- 7ae0501 chore: update the lore service port (#8375)
- 052cf8d feat: Cluster historical usage charts move to UI Kit LineChart [WEB-1786] [WEB-1764] (#8327)
- 819948d feat: clear filter from experiment table header (#8376)
- a590999 test: fix slow delete_checkpoint test (#8377)
- b0505db chore: Job/task displays Running instead of Scheduled (#8335)
- 1d64941 chore: short dsat e2e tests (#8288)
- 6afa836 chore: fix CI mnist_pytorch (#8364)
- 4d3eaab chore: Update Horovod Cycle Time (#8362)
- d3b01cb docs: Add det pach tutorial (#8082)
- 7cebc30 fix: adjust card size on workspaces page (#8370)
- 5c93cb0 chore: enable more Go linters (#8333)
- a279967 fix: aws deployment can deploy priority scheduler (#8345)
- 3d9293c fix: fixed bug in error handling in experiment.go (#8339)
- 194bfd5 fix: Cell can be undefined in experiment list table (#8360)
- 1da92aa chore: bump environment images to ubuntu 18.04 [MLG-1194] (#8356)
- 990c56f chore: add list_experiments to experimental.client (#8361)
- 3a7d9ea fix(tests): lower e2e_gpu_quarantine parallelism (#8363)
- 4c48458 fix: patched remote users were able to login with password (#8337)
- baf5c96 chore: port over PyTorch example to use Trainer API [MLG-1181] (#8292)
- 235bd8f feat: delete TB files from the SDK (#8329)
- 2fe3d99 chore: update Typography from UI kit (#8323)
- 2b23674 fix: prevent carriage return in env from crashing deepspeed launcher (#8321)
- 461c307 chore: Remove DesignKit since it's now maintained in Hew [WEB-1790] (#8338)
- 5ee87ec fix: Set group name and number columns to handle Safari [DET-9948] [DET-9949] (#8355)
- 10deef9 fix(experiments): transient errors shouldn't leave trial hung (#8352)
- 512b9f3 chore: remove accidental mock commit (#8354)
- 9d17dbf feat: Show "-" for null values in data cells for experiment list (#8343)
- ea50987 fix: properly interpret flag values (#8326)
- 8b6fc68 fix: Allow SAML and OIDC logins to work differently [WEB-1797] (#8308)
- 274288e docs: fix linting failure (#8351)
- 73bf0e8 docs: log policies (#8302)
- 8418029 chore: ft slot capacity check for each trial [DET-9897] (#8213)
- 494ca57 fix: replace TODO with ctx for deleteTensorboard (#8332)
- cfde2f6 docs: Docs Version Dropdown Automation (#8340)
- 8e69941 chore: Remove examples/legacy (#8153)
- af995ba fix: cli is not a library! (#7891)
- bf0a03d test: fix
ray.air.session
import. (#8344) - 9bb10cc ci: mypy fix for responses>=0.24.0 (#8341)
- b924b25 fix: add pin icon in dropdown (#8324)
- 62b7f3b chore: remove fit-content from
TimeAgoc
(#8328) - 86d6962 chore: update determined-ui to hew (#8334)
- f580385 fix: metric group charts have more than one color (#8304)
- 1966373 feat: Add tensorboard delete command to CLI (#8227)
- 656c8b2 chore: bump version: 0.26.3-dev0 -> 0.26.4-dev0
- af43248 docs: add release notes for 0.26.3 (#8322)
- b262a3d chore: Update lore.yaml to use the new version
- d64a0ac chore: use a single .golangci.yml file (#8320)
- ad94d20 chore: Add progress bar from UI Kit [WEB-1675] (#8181)
- b3b5be0 feat: implement CodeSample from UI Kit [WEB-1677] (#8270)
- d723b7f docs: fix typo in user edit release note (#8319)
- 6e5d840 chore: initial experiment actor refactor (#8229)
- 8a1ff58 chore: use a single root level go mod (#8285)
- 3511abf chore: delete dead code (#8313)
- d0e6375 chore: add a new deployment type for aws (#8279)
- 3929e8c chore(actors): remove ctx usage in agent_state.go (#8267)
- 5bf1b87 ci: delete broken wait_for helper (#8312)
- 50535f1 test: quarantine GPU execution of test_task_logs (#8261)
- d5b8e80 chore: deployment's --dry-run option doesn't print template (#8303)
- ac89d44 fix: allow experiments with directory checkpoint storage to parse (#8310)
- 306c0c3 fix: Project info not presists when forking (#8307)
- dc1b131 chore: sort out issues after bringing EE e2e_tests into OSS (#8084)
- d182abe chore: slurm support for blocklist (#1111)
- efdf62b fix: return correct location URL for /Users SCIM API endpoint (#1115)
- 37a84d1 fix: ruamel.yaml fixes for EE
- 0ce925a chore: Update nightly tests that use legacy cifar10_pytorch (#1102)
- 6cad296 fix: update for error message change in product (#1098)
- 1e302b5 chore: update e2e tests affected by examples_pruning (#1100)
- ad3dcda chore: cleanup model registry rbac test
- a3ffb5d test: enable command run tests for PBS (#1073)
- 9dd0e42 test: enable command and deepspeed tests run on slurm/pbs (#1044)
- ea4f4c4 chore(templates): ee fixes for template rbac
- c48e48d fix: Test test_slurm_verify_home fails with podman and it shouldn't [FE-136] (#1028)
- 760a738 test: Add pytorch2 distributed e2e tests on slurm [FE-168] (#1007)
- b5aee79 chore: use longer running no op experiment when seeding workspace (#994)
- facbda9 test: run test_hpc_job_pending_reason only on gcp vm (#996)
- 393d0b5 ci: FE-133 Configure non agent slurm/pbs tests to skip without explicitly listing test names in circleci. (#977)
- fd15535 ci: add ee-only files to the import-restrictions linter exclusions.
- 9cf2a26 test: slurm/pbs test for pending reason (FE-90) (#960)
- b3c2ca3 chore(actors): allocation.go, ee side
- eb7d1a1 test: [ALLGCP] Add e2e test for HPC that verifies that user HOME is preserved (#972)
- f3a8b0e test: fix test_slurm.py lint error (#949)
- 71896f3 chore: FE-91: Update base images (slurm/pbs) to include a populated singularity_image_cache (#943)
- 5891567 feat: add rbac to
api/v1/master/config
[DET-9633] (#931) - 0a5c32e ci: FE-72: Add test-e2e-pbs-*-gcp tests (#941)
- 4c233c0 feat: add rbac for strict job queue control (#927)
- 6e23aa2 chore: removed admin dependency from delete model/version (#912)
- 7c6c59e feat: rbac for templates (#909)
- e89cc08 ci: DET 9622: (ee) test_slurm.py::test_cifar10_pytorch_distributed failures (#919)
- c6ee094 fix: test_rbac goes to wrong url (#918)
- 6b34e0d fix: DET-9483 successfully run e2e_slurm_preemption tests as part of nightly workflow (#903)
- 4f6277d ci: FE-14 Migrate test-e2e-slurm to GCP slurmcluster (#879)
- f4507f7 tests: fix a miss indentation leading to missing project err (#878)
- 5cad9bb chore: fix a missing check for global permissions in jq (#874)
- bca3848 feat: add rbac support for reading job queue (#871)
- 5c79474 chore: update how we wait for tasks to be ready (#863)
- b292862 test: fix
test_master_host
[DET-9482]. (#851) - 5375a08 ci: quarantine flaky slurm tests (#850)
- 8e44c6c fix: Patch groups test [DET-9473] (#845)
- 49d2e08 fix: fix bug with launching tensorboards on trials (#842)
- d4dcbe5 test: Fix and add e2e_slurm_preemption tests to nightly workflow [D...
0.26.3
Release Notes
Changelog
- bd74446 chore: bump version: 0.26.3-rc3 -> 0.26.3
- 162de31 docs: add release notes for 0.26.3 (#8322)
- a472745 chore: bump version: 0.26.3-rc2 -> 0.26.3-rc3
- bab1dad fix: allow experiments with directory checkpoint storage to parse (#8310)
- ec438f2 fix: adjust width size in group table (#8309)
- 0d49b53 chore: fix job service panic when workspace does not exist (#8306)
- 13525e8 fix: check externalConfig is enabled before setting det_jwt as auth header (#8298)
- 101f279 fix: undefined handling in
CreateGroupModal
(#8301) - b9e64ab docs: quick fix for version dropdown (#8300)
- 48e915f chore: bump version: 0.26.3-rc1 -> 0.26.3-rc2
- 3c2cb53 ci: update wrapper config to always run
- 55c8a3a chore: bump version: 0.26.3-rc0 -> 0.26.3-rc1
- c138065 chore: bump version: 0.26.3-dev0 -> 0.26.3-rc0
- 862b41e chore: bump version: 0.26.2-dev0 -> 0.26.3-dev0
- b2e4b02 docs: add release notes for 0.26.2 (#8245)
- 989a0e3 fix: update bumpversion cfg for new CircleCI config (#8293)
- 54a12b8 fix: fix docs linting (#8291)
- 108cca7 docs: add documentation for Keras and PyTorch profilers [MLG-1094] (#8253)
- a7dfd47 chore: lock published urls to preserve redirects
- 8ae316c chore: lock api state for backward compatibility check
- d0c8273 chore: refactor filterformstore (#8239)
- 2992b82 feat: Updated LineChart in UI Kit [WEB-1700] (#8105)
- c6c3235 fix: login error message (#8240)
- 949ecf9 feat: support PyTorch Profiler in DeepSpeed trials [MLG-1095] (#8251)
- b2ea839 fix: use proper default cpu env image in the helm chart. (#8287)
- ccc78dd chore: install request as dev dep to fix proxy.js (#8286)
- af40f2c fix: only train for one batch in PyTorch Trainer test mode (#8260)
- cbab7e3 fix: dsat with all yaml formats (#8284)
- d17a2cc fix: migrate CLI from deprecated SDK methods (#8282)
- f43acd8 feat: directory checkpoint storage [DET-9594] (#8255)
- f375fbb feat: log policies (#8145)
- 862951f chore(actors): remove slots, slot proxy hacks from agentrm (#8266)
- 8d79c17 feat: webhook type task logs (#8175)
- b086982 test: fix delete experiments potential flake (#8283)
- fe06a0a fix: update copy for Agent UID/GID modal in the user mgmt UI. (#8278)
- 1fbb9f0 chore: restore set resource pool using job service (#8280)
- 0bea20b chore(actors): remove actors from resource aggregation (#8265)
- 76b4bfe feat: update add members to group (#8262)
- f24748c docs: Fix a 404 error (#8277)
- e4701b2 fix: remove Topology from the ResourcePoolDetail page (#8274)
- bec5edd chore(actors): refactor k8s rm without actors (#8264) [DET-9658]
- 46b9360 fix: icon size in TaskBar (#8263)
- 5c12463 test: don't commit Go mocks (#8258)
- a33d2b8 chore: ci should always setup python venv for caching (#8257)
- c09c529 ci: wart removal (#8147)
- 65219ca docs: fix inaccuracies in
bind_mounts
docs. (#8254) - d3bd2cb feat: unify new/edit group modals [WEB-1741] (#8236)
- 3179505 build(deps): bump actions/setup-node from 3 to 4 (#8230)
- e0a7efc fix: support ruamel.yaml>=0.18.0 (#8237)
- 2365df6 docs: check for dropped urls in PRs (#8247)
- f9c6402 fix: deepspeed e2e_tests fail when environment variable contains a newline character [FE-256] (#8154)
- ec7e004 fix: prevent passing login r= param to relayState (#8244)
- ab4cf83 feat: Update "Add Members" to Workspace experience (#8195)
- 25adf7d fix: dont suggest registering or deleting a checkpoint that is already deleted (#8246)
- c52ef71 feat: add new CLI command to edit multiple fields at once (#8075)
- 1d86343 chore: move files from /Clusters into /Cluster [WEB-1730] (#8231)
- 15ee78f docs: Add article for using detached mode (#8217)
- 115d7c2 fix: GetTasks doesn't respect rbac (#8233)
- 47b37eb chore: update Avatar component in UI kit [WEB-1055, WEB-1734] (#8178)
- 4c59393 chore: indirect import for jobs to avoid import cycle (#8238)
- aa77386 chore: add back missing commit in Python SDK (#8206)
- 9a80d99 ci: alternate mechanism for running nightly tests (#8221)
- 27a279b refactor: use Nameplate component in NavigationSidebar [WEB-1057] (#8152)
- c0e3062 revert: fix: support ruamel.yaml==0.18.0 (#8235)
- ae839a8 fix: support ruamel.yaml==0.18.0 (#8228)
- bb7020a fix: inaccurate task queue time [DET-9912] (#8225)
- 12e23b0 fix: pin ruamel.yaml<0.18.0 (#8232)
- 164b920 feat: Update users and groups tab to reflect count (#8224)
- 2ac9661 chore: shorten distributed-quarantine name (#8234)
- c964987 docs: Apply minor edits (#8215)
- f61d250 chore: support redirect on auth failure for echo routes (#8196)
- f251add docs: update the status of TLS security for notebooks. (#8191)
- 54551c5 refactor:
CliError
does not neede_stack
property. (#8179) - 7c6b2d0 chore: deprecate
apex
support. (#7526) - bfda78d docs: quick fix for version dropdown (#8223)
- 5929161 docs: Fix broken Slack links (#8220)
- b076179 chore: refactor actor system out of internal/job (#8174)
- facdc30 fix: remove 'contents' parameter from remove_notes (#8209)
- 448162d docs: Edit setup checklist (#8214)
- 164579f feat: update group table (#8194)
- 8f5e883 fix: Multi-trial visualizations switch from metricType to group string [DET-9896] (#8137)
- 834bd2f chore: remove WebSocket actor (#7552)
- b6e28d7 fix: prevent extra updates when observing settings store (#8212)
- 707f77d fix: Trigger function updates new user modified_at without error (#8210)
- f9c8a86 chore(actors): refactor k8s' resource_pool.go without actors (#8186) [DET-9657]
- ea3d6bf docs: sort articles by weight custom extension (#8208)
- 464fb54 docs: Create new advanced setup section (#8203)
- dd1c6f0 docs: toctree tile css (#8207)
- 53b9bdb fix: CLI uses default where pagination not included in args [DET-9908] (#8192)
- 079304a fix: filter agents/nodes by poolName (#8205)
- 3af965f chore: move ui kit to separate repo (#8104)
- 8735acf docs: remove references to PyTorchTrialContext.from_config() (#8187)
- 03383f2 fix: fix issue with db migrations (#8193)
- 2ff18bc fix: Display byte axis values using humanReadableBytes [DET-9906] (#8189)
- bd5e628 feat: add "Topology" section to the cluster UI (#8108)
- 6dfde99 fix: icons should appear in safari (#8190)
- 36ad3ca chore(actors): refactor pods.go without actors (#8170) [DET-9901]
0.26.2
Release Notes
Changelog
- 85b5135 chore: bump version: 0.26.2-rc4 -> 0.26.2
- 25e578d docs: add release notes for 0.26.2 (#8245)
- 1f7945e chore: bump version: 0.26.2-rc3 -> 0.26.2-rc4
- 87c11fb fix: inaccurate task queue time [DET-9912] (#8225)
- 07ea3b2 fix: pin ruamel.yaml<0.18.0 (#8232)
- 6e8a762 docs: quick fix for version dropdown (#8223)
- 8896d23 fix: remove 'contents' parameter from remove_notes (#8209)
- b3fdf9e chore: bump version: 0.26.2-rc2 -> 0.26.2-rc3
- 5585f38 fix: prevent extra updates when observing settings store (#8212)
- 826f633 chore: bump version: 0.26.2-rc1 -> 0.26.2-rc2
- 829c3bf fix: Trigger function updates new user modified_at without error (#8210)
- 0b0d268 fix: CLI uses default where pagination not included in args [DET-9908] (#8192)
- b0071c2 chore: bump version: 0.26.2-rc0 -> 0.26.2-rc1
- f049bd6 fix: fix issue with db migrations (#8193)
- 1f4bbe2 fix: icons should appear in safari (#8190)
- 5d3a80d chore: bump version: 0.26.2-dev0 -> 0.26.2-rc0
- 57fee58 chore: lock published urls to preserve redirects
- 883135a chore: various deprecations to standardize SDK get/list/iter (#8165)
- 0ab908e chore: add release notes for Python SDK (#8184)
- 2c09e10 docs: fix redirects (#8188)
- bfd20f1 docs: update experiment config reference for records_per_epoch in PyTorchTrials (#8185)
- 1fb50cd fix: pin icon should adapt to theme colors (#8183)
- aff81a5 chore: migrate SDK to a generic OrderBy [MLG-1056] (#8171)
- 88549ee Revert "feat: use max_results in SDK's list_trials (#8173)" (#8182)
- c29d086 fix: tensorboard sync for profiler data, Core API v2 managed mode [MLG-1063] (#8163)
- 8324bf2 fix: Change oicd client secret env var name to comply with naming convention (#8113)
- 3946185 docs: Restore path to singularity file (#8180)
- d12373f chore: reinstate core_api example e2e tests (#8148)
- 16ef0bb feat: use max_results in SDK's list_trials (#8173)
- ee2e633 refactor: add delete cascade to tables affected by experiment deletion (#8016)
- 404831b chore: final trial actor refactor (#8164)
- eb98aeb fix: case insensitive member search (#8166)
- b76dd90 fix: Learning curve point click (#8155)
- 95f2ad7 chore: task resources actor refactor (#8157)
- 1bff62e chore: configure a training port offset (#8125)
- e4d5ad1 docs: Modify the info architecture (#8100)
- 556022a chore: change CSS values (#8151)
- 88d9533 fix: Theme dropped after page refreshing (#8139)
- 515f128 refactor: update UI Kit Icon component [WEB-1699] (#8122)
- 82b4d31 chore: Post pruning hotfixes (#8141)
- 67f6dc1 Python SDK v1 (#8005)
- 3a24611 docs: fix python-sdk reference syntax (#8146)
- e7247a5 docs: Restore core api integer incrementing tut (#8115)
- 2a7f141 chore: add option to proxy requests to internal service (#8044)
- d40dee3 fix: Move setting of playwright browsers path (#8144)
- 31c2515 chore: update metrics documentation (#8118)
- b973356 chore: Add Message component to UI kit [WEB-1056] (#8133)
- 5df3d57 chore: remove actors from resource manager interface (#8126)
- f9a61b4 chore: upgrade to node v20 and npm audit [WEB-1662] (#8036)
- 38ff7b5 docs: update link in prometheus docs (#8138)
- ea8d760 chore: Examples pruning (#8140)
- 8b9ec03 fix: move couldn't-connect-to-master message into ship_logs.py (#8127)
- f52f7c6 docs: Apply style guide edits (#8134)
- 2f65eec chore: bump version: 0.26.1-dev0 -> 0.26.2-dev0
- ee3c478 docs: add release notes for 0.26.1 (#8131)
- 5d55da1 ci: dannys/reenable docs autoassign (#8012)
- 017e0f2 docs: Remove ref to fluent bit (#8128)
- 42535c2 chore: add hf context to all dist e2e [DET-9893] (#8119)
- 7e6689b chore: make agent.yaml readable by default (#8095)
- 8d2b5ce fix: hotfixes for ship_logs.py (#8116)
- 233cd95 feat: add allocation exit status to db and implementation for get allocation (#7897)
- 22f1f92 chore: remove task allocation group actor ref (#7853)
- 42bfb72 fix: Return ResourcePools with a fixed order (#8103)
- 3b42b91 chore: write log shipper in python [MLG-993] (#7974)
- ae7c415 fix: DET_CERT_MASTER_FILE=noverify det shell open (#8110)
- 3e1b83e docs: redirects.py moves subdirectories properly (#8111)
- 29aa772 fix: fix primary key for allocation_accelerators table (#8106)
- f9e9a45 build: print sphinx-build command on make build (#8109)
- a51ef7c fix: update WorkspaceMemberAddModal when new user or group is created [WEB-1111] (#8069)
- ff9b77a fix: trigger
autoupdate_users_modified_at
byusername
change (#8093) - 4899df7 feat: Edit experiment from list (#8086)
- d638303 feat: Dont allow users to add deactivated users to a workspace (#8073)
- f559eee fix: single point different axis ranges (#8096)
- 0c54010 fix: Empty / NotEmpty operators for descriptions, tags [WEB-1751] (#8090)
- 469c2ae perf: improve task stats IMAGEPULL performance (#8067)
- e7751fe chore: auth task logs [DET-7554] (#8089)
- 588d817 chore: cleanup allocation exit logic (#8088)
- e2b51e7 feat: add implementation for get and set acceleration data api and intg test [DET-9748] (#7856)
- 5483d12 fix: ignore
last_auth_at
to updatemodified_at
on users table (#8091) - 0f8c1d4 fix: Trial data loading state (#8083)
- 6a91c3d fix: Metric type is blank in comparison chart (#8085)
- 1819810 chore: allow attaching select dropdown to select container (#7940)
- e5b2e35 refactor: removed reference removed agents/./slots/. endpoint (#7424)
- ecc0f6b chore: support "formData" in swagger bindings parsing (#8078)
- 9dfc0a3 feat: Hide deactivated user in ws members list (#8077)
- 89c8da2 feat: introduce batch actions in user management page (#8056)
- ca63335 docs(performance): add sections to help with getting started (#8079)
- 9ea05a1 chore: change master config to k8s secret (#8053)
- 58fab08 fix: Experiment name settable in fork config (#8081)
- 6006d42 fix: Show experiment loading state (#8066)
- 8073a2d fix: order of API paths in proto for AssignMultipleGroups (#8080)
- 5be764b feat(performance): implement slack report send [INFENG-234] (#8045)
- 18d7286 fix: shell open gives unfriendly message on terminated (#8074)
- fc2e8bf test: fix test_pytorch_parallel logs check (#8064)
- ca763a4 fix:
det deploy aws list
results were incomplete. (#8062) - c8edfab chore: cleanup pod informer logging (#8072)
- 14bd162 docs: quick fix for version dropdown (#8070)
- bcfdeac chore(tests): workaround data races in uptrace/bun (#8065)
- 5d56e95 chore(tests): fix data races in webhook integrations (#8061)
- ce5bcf2 ci: backend owns e2e_tests folders cluster, command, and template (#8037)
- 3075820 chore(tests): fix data races in streaming API intg tests (#8059)