Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Guidance on OCIDs for re-publishers #413

Closed
timgdavies opened this issue Jan 12, 2017 · 3 comments
Closed

Guidance on OCIDs for re-publishers #413

timgdavies opened this issue Jan 12, 2017 · 3 comments
Labels
Focus - Documentation Includes corrections, clarifications, new guidance, and UI/UX issues
Milestone

Comments

@timgdavies
Copy link
Contributor

timgdavies commented Jan 12, 2017

We are seeing a number of organisations re-publishing data from others.

We need to work further on guidance around how OCIDs should be constructed and managed in these cases.

(Internal reference to helpdesk issue 1543)

@mireille-raad
Copy link

are they re-publishing as is? or are they adding data as well? any links you can share?

you might want to add to the guidance on OCID some guidance on mentioning the dates for when the dataset was re-published. (I can't imagine everyone is already doing real time integration/republishing)

@jpmckinney jpmckinney added the Focus - Documentation Includes corrections, clarifications, new guidance, and UI/UX issues label Jul 27, 2017
@jpmckinney jpmckinney added this to the 1.2 milestone Dec 27, 2017
@jpmckinney
Copy link
Member

jpmckinney commented Sep 18, 2018

Having read the mentioned issue, there are a few scenarios we've seen, in which the original source is not in OCDS format:

  1. A CSO publishes data in OCDS format, using available data from a government source that includes process identifiers.
  2. A CSO publishes data in OCDS format, authoring new data based on a government source that has no structured data or process identifiers.
  3. An aggregator publishes data in OCDS format, using data aggregated from many sources that are not in OCDS format, and that may not have process identifiers.

Going through each case:

  1. There is the choice of (1) issuing a prefix representing the CSO as publisher or (2) issuing a prefix representing the source as 'ideal' or 'formal' publisher. The latter might mean that, if the source adopts OCDS, then its OCIDs might match the CSO's – an advantage in terms of interoperability. However, there's no guarantee that the mapping to OCDS would be identical, which is a larger barrier to interoperability. The government source might, also, want to avoid any confusion or issues around a CSO publishing data that is not guaranteed to be identical to its own, under the same prefix.
  2. There's no opportunity to promote interoperability in a way similar to 1, so a prefix is issued for the re-publisher only.
  3. There's limited opportunity to promote interoperability in a way similar to 1, so a prefix is issued for the re-publisher only.

Given the issues with 1(2), I think a simpler way to promote interoperability, in such cases, is for data users to use the common process identifier to match each publisher's data.

The CRM issue only considers scenarios where the source publisher isn't using OCDS. There may be different considerations if the source is in OCDS format, for which a new issue can be opened.

I'm closing this issue as the answer, when the original publication is not in OCDS format, is to simply issue an OCID prefix for the downstream publisher alone.

@jpmckinney
Copy link
Member

Related issue is #364

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Focus - Documentation Includes corrections, clarifications, new guidance, and UI/UX issues
Projects
None yet
Development

No branches or pull requests

3 participants