Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

How to Catalog Data Duplication Settings per Dataset #1589

Open
Jonessmj opened this issue Feb 7, 2024 · 1 comment
Open

How to Catalog Data Duplication Settings per Dataset #1589

Jonessmj opened this issue Feb 7, 2024 · 1 comment
Labels
dcat future-work issue deferred to the next standardization round

Comments

@Jonessmj
Copy link

Jonessmj commented Feb 7, 2024

Dear DCAT team,

I have a question on how to properly use DCAT to capture metadata about instructions for copying a dataset. The instructions/configurations are per dataset. The scenario is that there is a dataset in a data catalog that could be copied to an AWS Redshift Cluster. It hasn't been copied yet, but if certain application-level things happen then a service will copy the data to one or many AWS Redshift Clusters. Prior to this happening though, the owner of the dataset will specify default DIST and SORT configurations to be used for the duplicated dataset.

Since these parameters/configurations are being set per source dataset and the duplicated dataset doesn't exist when these parameters/configurations are being defined I was thinking that it should be a property of the source dataset, but I'm not sure what dcat terms or extensions of dcat I should use. Alternatively, should these settings be some first class entity of their own with a prov relationship to the source dataset?

@dr-shorthair
Copy link
Contributor

This looks like a specialization for a particular application.
SO probably out of scope for DCAT per se.

You can propose an extension, else develop your own application profile with additional elements connected to the standard DCAT base.

@riccardoAlbertoni riccardoAlbertoni added dcat future-work issue deferred to the next standardization round labels Apr 22, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
dcat future-work issue deferred to the next standardization round
Projects
None yet
Development

No branches or pull requests

4 participants