Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

ILI status annotations #8

Open
goodmami opened this issue Feb 2, 2021 · 7 comments
Open

ILI status annotations #8

goodmami opened this issue Feb 2, 2021 · 7 comments

Comments

@goodmami
Copy link
Member

goodmami commented Feb 2, 2021

There is the idea that an ILI can be proposed and deprecated/superseded, but I don't see where this status would be annotated in, e.g., ili.ttl. I don't know if the existing ontologies have some relevant property (xyz:status or something) or if we need to make something up, but we also need an inventory of possible statuses. Vossen, Bond, and McCrae 2016 describes actions taken on existing ILIs (deprecate, supersede, split, and fork), but not the current status of an ILI. How about the following:

  • provisional (from something proposed via ili="in" in WN-LMF, given some provisional identifier in CILI)
  • active (accepted and in use; maybe this is the default, unannotated value?)
  • deprecated (sometimes accompanied by a separate superseded link to something else)

I don't know if we'd need something like removed after deprecated with the distinction that deprecated ILIs may still be in use, but their continued use is discouraged, and removed ILIs are no longer recognized (maybe we clear the descriptions, but need to keep the IDs so they doesn't get recycled). But I think simpler is better, in general.

@fcbond
Copy link
Member

fcbond commented Feb 2, 2021 via email

@goodmami
Copy link
Member Author

goodmami commented Feb 2, 2021

Thanks! So these vocabularies are defined in OWL and schema.org? For the latter, I saw the following:

The meta section contains terms primarily designed to support the implementation of the Schema.org vocabulary itself. It includes terms such as Class, Property, domainIncludes and supersededBy. They are not currently advocated for widespread use across the web.

So perhaps we should think about our own vocabulary for this one, or at least provide documentation explaining our use of the term, since it's not intended to be used outside of schema.org itself.

And those two terms don't cover the "provisional" status, so we still need something else.

@jmccrae
Copy link
Member

jmccrae commented Feb 2, 2021

Yes, could be a good idea to define our own vocabulary here.

@goodmami
Copy link
Member Author

goodmami commented Feb 4, 2021

I'm not familiar with RDF conventions, but could this be a new relation ("predicate"?) with a controlled vocabulary of values ("objects"?), e.g.:

<i48540>	a	<Instance> ;
	skos:definition	"a fictional mouse created by Lewis Carroll"@en ;
	dc:source	pwn30:02451912-n ;
	ili:status	ili:deprecated .

<i18263>	a	<Concept> ;
	skos:definition	"not at all; certainly not; not in any circumstances"@en ;
	dc:source	pwn30:00020997-r ;
	ili:status	ili:deprecated ;
	ili:supersededBy	<i18262> .

@goodmami
Copy link
Member Author

Is the ili namespace prefix used for ILI IDs themselves? That is, is i48540 more explicitly ili:i48540? If so, then using the ili namespace for these statuses doesn't seem appropriate.

@jmccrae
Copy link
Member

jmccrae commented Sep 28, 2021

There are some oddities in the Turtle file that probably need to be fixed with regards to the namespaces. Currently the namespace maps ili to http://globalwordnet.org/ili/ which I don't think is a URL that ever works. It also uses base namespace so we get the mapping of <i123> to http://globalwordnet.org/ili/ili.ttl#i123

None of these URLs actually work at the moment anyway (I have to contact Piek).

I will make a PR to fix the file to the normal URL schema.

@goodmami
Copy link
Member Author

goodmami commented Sep 28, 2021

Thanks for the fixes in the PR!

I guess what I meant is that if each ILI ID is in the namespace as, e.g., ili:i48540, then the properties and values mentioned above (ili:status, ili:proposed, etc.) are sharing that namespace. I think a collision is unlikely, but they aren't really the same kind of thing. I'm not really sure what is standard practice in linked data or if that's a problem.

If it is a problem, then what if we had, e.g., http://globalwordnet.org/ili/ as the general namespace and http://globalwordnet.org/ili/concept/ as the one containing IDs? Maybe like this:

@prefix ili: <http://globalwordnet.org/ili/> .
@base <http://globalwordnet.org/ili/concept/> .

Or if it's not a problem, then I guess we need some rules and conventions for ILI identifiers. E.g., status, proposed, active, deprecated, and superseded are reserved; all identifiers must match the regex [A-Za-z][1-9][0-9]* so we have, e.g., i1 for CILI, g1 for GeoNames, and so on.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants