GitHub - oeg-upm/lubm4obda: Inference and Meta Knowledge Benchmarking of OBDA Systems

The LUBM4OBDA Benchmark is an extension of the popular LUBM Benchmark to evaluate Ontology-Based Data Access (OBDA) engines over relational databases. In addition, LUBM4OBDA considers meta knowledge (also called reification or statement-level metadata) benchmarking. The main characteristics of LUBM4OBDA are:

SQL data dumps for MySQL and PostgreSQL.
Data generator for custom scaling factors.
Original LUBM query set (queries 1-14).
Meta knowledge query set for standard reification, singleton property and SPARQL-star (queries 15-18).
R2RML and RML mappings.

Citing LUBM4OBDA: please cite the JWE paper:

@article{arenas2024lubm4obda,
  title     = {{LUBM4OBDA: Benchmarking OBDA Systems with Inference and Meta Knowledge}},
  author    = {Arenas-Guerrero, Julián and Pérez, María S. and Corcho, Oscar},
  journal   = {Journal of Web Engineering},
  publisher = {River Publishers},
  issn      = {1544-5976},
  year      = {2024},
  volume    = {22},
  number    = {8},
  pages     = {1163–1186},
  doi       = {10.13052/jwe1540-9589.2284}
}

Data

There are two options to obtain the SQL data dumps:

Download the SQL data dumps for scaling factors 1, 10, 100 and 1000 from Zenodo.
Use the Docker container with the data generator to produce the data with custom scaling factors.

Mappings

The mappings directory of this GitHub repository contains all the R2RML and RML documents. The following mappings are provided:

R2RML:
- Original, without meta knowledge.
- Standard reification.
- Singleton property.
- R2RML-star.
RML:
- Original, without meta knowledge.
- Standard reification.
- Singleton property.
- RML-star.

Ontology

The Univ-Bench ontology is available in the ontology directory of this GitHub repository.

Queries

The queries are available in the queries directory of this GitHub repository. Keep in mind that original mappings should be used for queries 1-14. There are three different versions of queries 15-18, one for each meta knowledge approach (standard reification, singleton property or RDF-star), with each approach having its corresponding mapping.

CSV & Apache Parquet

It is also possible to run the benchmark with CSV and Apache Parquet files. The resources for these data sources are available in Zenodo and they have been described in an ESWC paper.

Name		Name	Last commit message	Last commit date
Latest commit History 101 Commits
generator		generator
mappings		mappings
ontology		ontology
queries		queries
CITATION.cff		CITATION.cff
LICENSE		LICENSE
README.md		README.md
logo.png		logo.png

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Data

Mappings

Ontology

Queries

CSV & Apache Parquet

About

Releases 1

Packages

Contributors 2

Languages

License

oeg-upm/lubm4obda

Folders and files

Latest commit

History

Repository files navigation

Data

Mappings

Ontology

Queries

CSV & Apache Parquet

About

Topics

Resources

License

Stars

Watchers

Forks

Releases 1

Packages 0

Contributors 2

Languages

Packages