Query optimisation. #1240

pulquero · 2021-11-09T21:34:57Z

No description provided.

sonarcloud · 2021-11-09T21:35:53Z

Kudos, SonarCloud Quality Gate passed!

0 Bugs
0 Vulnerabilities
0 Security Hotspots
0 Code Smells

No Coverage information
No Duplication information

osma · 2021-11-10T08:59:51Z

Thanks for the PR @pulquero !

Can you explain what you did here in a bit more detail?

Which operation was slow, and how much does the optimisation help?

Are there any side effects that you are aware of?

pulquero · 2021-11-10T12:39:22Z

For my data it was milliseconds vs minutes.

Added my observations as comments below:

 SELECT ?object ?label (GROUP_CONCAT(STR(?dir);separator=' ') as ?direct)
 WHERE {
    <$uri> a skos:Concept .
    OPTIONAL {
      <$uri> $propertyClause* ?object . # ?object may not be bound, but looks like we only care about ?object being bound, what is the reason for this being in an optional?
      OPTIONAL {
        ?object $propertyClause ?dir .
      }
    }
    OPTIONAL {
      ?object skos:prefLabel ?label . # only has an effect if ?object is bound, else it has no correlation with the non-optional part.
      FILTER (langMatches(lang(?label), "$lang"))
    }
    $otherlang
  }
  GROUP BY ?object ?label

osma · 2021-11-12T13:25:24Z

Thanks for the details. It's still not entirely clear to me which operation was slow from the user perspective. The function in question (generateTransitivePropertyQuery) is a rather low level one and is used, indirectly, at least to generate the QL query used for querying breadcrumb paths in the web UI, but also for some of the REST API methods. It would be good to know e.g. which direction is relevant here (transitive broaders - like in the breadcrumbs - or transitive narrowers?)

Also, what does your data look like? Is the hierarchy somehow big or complicated since the query ends up taking minutes? This hasn't been a big performance issue in the past for us, that's why I'm asking.

Also, which triple store? We're using Fuseki mostly, but are you perhaps using GraphDB as in your other PR?

pulquero · 2021-11-12T14:30:57Z

I'm using graphdb, and my vocabulary consists of a million skos:Concept. It is in the default graph alongside other vocabularies and cross-referenced. I think the avg tree depth is about 3. I believe

<$uri> a skos:Concept
OPTIONAL {
?object skos:prefLabel ?label . # only has an effect if ?object is bound, else it has no correlation with the non-optional part.
FILTER (langMatches(lang(?label), "$lang"))
}

results in a cartesian product with ?object skos:prefLabel ?label matching everything in the entire default graph.

Query optimisation.

363114a

osma added bug performance labels Nov 12, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Query optimisation. #1240

Query optimisation. #1240

pulquero commented Nov 9, 2021

sonarcloud bot commented Nov 9, 2021

osma commented Nov 10, 2021

pulquero commented Nov 10, 2021

osma commented Nov 12, 2021

pulquero commented Nov 12, 2021

Query optimisation. #1240

Are you sure you want to change the base?

Query optimisation. #1240

Conversation

pulquero commented Nov 9, 2021

sonarcloud bot commented Nov 9, 2021

osma commented Nov 10, 2021

pulquero commented Nov 10, 2021

osma commented Nov 12, 2021

pulquero commented Nov 12, 2021