Replies: 1 comment 2 replies
-
In general I would totally agree with cleaning up data. |
Beta Was this translation helpful? Give feedback.
2 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Improving the quality of the spots/ data on https://hitchmap.com is planned for a long time.
Currently spots can be placed anywhere that lead to the current state of duplicate and misplaced spots.
We want to discuss all measures to clean the data one by one in single issues here https://github.com/Hitchwiki/hitchmap-data/labels/cleaning. Measures are tagged see: https://github.com/Hitchwiki/hitchmap-data/labels/cleaning. You are free to add more issues for this purpose.
First clarifying if and second how to implement it. Sometimes we have to parameterize a measure - parameters can be identified by an "x". The data belongs to all of us - changing it is a severe intervention. The more real world hitchhiking experience we can incorporate the better. So I am hoping for your participation and that we will find a common ground.
Thus for each measure there should also be a way to ensure that new data adheres to it.
Giving an example:
We will apply the measure to the current dataset of spots and give all resulting changes that are not minor to community members to review and approve.
The code for cleaning resides here https://github.com/Hitchwiki/hitchmap-data/tree/main/cleaning.
To make this more realistic we gave it a first try with the measures described here https://github.com/Hitchwiki/hitchmap-data/blob/main/cleaning/README.md which resulted in https://github.com/Hitchwiki/hitchmap-data/blob/main/cleaning/map/map_Germany.html that you can view in your browser.
You are very welcome to request some analysis on the data or do it yourself to boost our discussion.
A further idea could be to set the goal to possibly integrate our spot data into OSM. This implies that spots are assigned to real world features e.g. the road or gas station you are hitchhiking at. This would increase data quality a lot. On the other hand it would take away simplicity and freedom the current "just-place-your-spot-anywhere" policy provides. We are super happy to discuss this below :)
Beta Was this translation helpful? Give feedback.
All reactions