Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

bad triples in KELM #5

Open
Shimorina opened this issue Mar 14, 2022 · 0 comments
Open

bad triples in KELM #5

Shimorina opened this issue Mar 14, 2022 · 0 comments

Comments

@Shimorina
Copy link

Hello,

Thank you for the dataset!

It seems to me that in KELM there are lots of ill-formed triples (if you look at the "triples" part).

Examples:

  1. Splitted relations
['Cyndia Sieden', 'voice', 'type', 'Soprano'] 
['Joseph Campbell', 'academic', 'degree', 'Bachelor of Arts']

should be

['Cyndia Sieden', 'voice type', 'Soprano'] 
['Joseph Campbell', 'academic degree', 'Bachelor of Arts']
  1. Badly parsed subjects/objects
["Mark English ( athlete )", "2014 European Athletics Championships", "-- Men 's 800 metres award received", "Bronze medal"]

should be with the award received subproperty

["Mark English ( athlete )", "2014 European Athletics Championships -- Men 's 800 metres", "award received", "Bronze medal"]
  1. This two triples should be one (bad parsing of the object?)
["Duke Peter of Oldenburg", "sibling", "Princess Marie"], ["Duke Peter of Oldenburg", "Countess", "of", "Neipperg"]

should be

["Duke Peter of Oldenburg", "sibling", "Princess Marie, Countess of Neipperg"]
  1. Bad subproperty detection?
["Ghasem Dehnavi", "Saba Qom F.C. start time", "01 January 2015"]

should possibly be (?)

["Ghasem Dehnavi", "member of sports team", "Saba Qom F.C."], [ "Ghasem Dehnavi", "Saba Qom F.C.", "start time", "01 January 2015"]

etc...

All such examples can be derived when checking for the second element to be in the Wikidata property list (or for the third element in the case of subproperty).

Is there a way to correct all those bad triples?

Thank you!

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant