Skip to content
angelobo edited this page Jan 13, 2016 · 1 revision

Task 3

… of the Semantic Publishing Challenge 2015.

Motivation

Several information about papers published in CEUR-WS.org exists in other datasets. Our goal is to interlink entities, e.g. publications, authors, events etc. with the same entities as they appear in other datasets.

Persons acting e.g as authors of a publication or editors of a workshop, and their affiliations might already appear on other datasets of LOD, e.g. DBLP. Similarly, events, as conferences and workshops, might also appear on the aforementioned datasets or at COLINDA and their venue at DBpedia. All those entities should be identified, disambiguated and interlinked.

This way, the knowledge regarding CEUR-WS proceedings is extended beyond the dataset's boundaries. Participants are required to identify the entities that appear in CEUR-WS dataset in other datasets too and interlink the CEUR-WS.org linked dataset with relevant datasets already existing at the Linked Open Data cloud. Task 3 can be accomplished either as a named entity recognition and disambiguation task, or as an entity interlinking task, or as a combination of methods. Moreover, as triples are generated from different sources and due to different activities, tracking provenance information becomes increasingly important.

Training Datasets

The input datasets consists of a set of datasets, taken from the Linked Open Data cloud and interlinked with the CEUR-WS dataset.

CEUR-WS

Metadata about CEUR-WS.org Workshop Proceedings

homepage: http://ceur-ws.org/

data dump: https://github.com/ceurws/lod/blob/master/data/ceur-ws.ttl

Triple Pattern Fragments: http://data.linkeddatafragments.org/ceur-ws

COLINDA

Metadata about events announced at Eventseer and WikiCfP

homepage: http://www.colinda.org/

data dump: https://github.com/ceurws/lod/blob/master/data/colinda.nt

endpoint: http://data.colinda.org/endpoint.html

Triple Pattern Fragments: http://data.linkeddatafragments.org/colinda

DBLP

Metadata about Computer Science publications in the DBLP collection

homepage: http://dblp.l3s.de/dblp++.php

data dump: http://dblp.l3s.de/dblp-2015-02-14.sql.gz

endpoint: http://dblp.l3s.de/d2r/sparql

Triple Pattern Fragments: http://data.linkeddatafragments.org/dblp

Semantic Lancet Triplestore

Metadata about papers published in the Journal of Web Semantics by Elsevier

homepage: http://www.semanticlancet.eu/

data dump: https://github.com/ceurws/lod/blob/master/data/lancet.ttl

endpoint: http://two.eelst.cs.unibo.it:8181/sparql.tpl

Triple Pattern Fragments: http://data.linkeddatafragments.org/lancet

Semantic Web dog Food

homepage: http://data.semanticweb.org/

data dump: http://data.semanticweb.org/dumps/

Triple Pattern Fragments: http://data.linkeddatafragments.org/dogfood

Springer LD

Metadata about conference proceedings published at Springer

homepage: http://lod.springer.com

data dump: https://github.com/ceurws/lod/blob/master/data/springer.nt

endpoint: http://lod.springer.com/sparql

Triple Pattern Fragments: http://data.linkeddatafragments.org/springer

Evaluation Datasets