Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Create retire-unpublished.md #26

Merged
merged 3 commits into from
May 16, 2022
Merged

Create retire-unpublished.md #26

merged 3 commits into from
May 16, 2022

Conversation

hot007
Copy link
Contributor

@hot007 hot007 commented Apr 29, 2022

I have not added this file to the toc until it's been reviewed!!! Please add on merge if approved @paolap or @chloemackallah

I am not happy with the number rendering here where I've done 'custom' things, if you wish to rework the numbered lists I think it'd help?

This commit attempt to address most of Sharon's issues raised in #10

I have not added this file to the toc until it's been reviewed!!! Please add on merge if approved @paolap or @chloemackallah 

I am not happy with the number rendering here where I've done 'custom' things, if you wish to rework the numbered lists I think it'd help?

This commit attempt to address most of Sharon's issues raised in #10
Copy link
Contributor

@paolap paolap left a comment

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I only fixed one misspelling.
My main concern with this is that is looking at a very specific use case
"plethora of data that is associated with published data creation."
and it is looking at this mostly from a model run point of view.
I think it would benefit from being more generalised. We do have a current issue with lots of data, both model and analysis generated, that is re-use by the community. So basically unpublished data which is not versioned but has been used in publications.
In conclusion, I'm happy for this to be merged as it is, but then we should discuss at the meeting if we want to generalise this example or add another "unpublished" use case to cover what I described above. In first instance it might even encourage publication.

Another action we should add regardless is "documenting the process" if this data isn't published how we document its provenance. It could be included in associated published data documentation, or you might need a metadata record somewhere which refers specifically to this a separate dataset.

@paolap paolap merged commit 7f859de into main May 16, 2022
@hot007
Copy link
Contributor Author

hot007 commented May 16, 2022

I very much agree - I think it'd be great to generalise or if it's substantially different, add a whole extra example.
The second use case you describe certainly might be used as a lever to encourage publication for the purposes of reuse!

And yes, we haven't actually captured provenance here, and i wonder if it belongs more in the "creating" section than "publishing" or "retiring", but it maybe should be mentioned throughout??

@paolap
Copy link
Contributor

paolap commented May 16, 2022

The provenance I'm thinking of is about documenting the "retirement" step. It is something that should be added to any use case to complete the dataset information, in a way that if someone asks about the data it's easy to query it and let them know what happen to the data and why, and if there's an alternative way to retrieve it, i.e. from deep storage

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

3 participants