Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Review the concept of connections #613

Open
viravit opened this issue Aug 19, 2021 · 0 comments
Open

Review the concept of connections #613

viravit opened this issue Aug 19, 2021 · 0 comments

Comments

@viravit
Copy link

viravit commented Aug 19, 2021

ODAHU Connections Manager allows to manage access to variety of external systems like VCS (git), Docker Registry, Object Storages (AWS S3, GCP GCS, etc.. ) But Connections Manager is not designed as a dataset registry/catalog that allows to manage multiple versions of datasets needed during ML multiple experiments.

The dataset management is to be a separate service/component that has features :

  • API, UI and CLI to register dataset. The data cab be located in external Data Storage / DB.
  • Pluggable Data Storages / DBs.
  • Metadata (version, notes, labels, comments, schema, etc) is to be associated with datasets
  • Access credentials is to be associated with datasets. (reuse connections manager service - TBD)
  • Dataset access permissions for user and user groups
  • ODAHU Model Training Services is to consume data during training job execution from/through dataset registered in ODAHU.
@viravit viravit changed the title Review the concept of connection Review the concept of connections Aug 19, 2021
@viravit viravit added the 1.7 label Aug 19, 2021
@dsuslov dsuslov added the epic label Aug 19, 2021
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

No branches or pull requests

2 participants