-
Notifications
You must be signed in to change notification settings - Fork 15
Analysing overlapping features in OSM using tile-reduce #145
Comments
This is a great idea! OSM QA Tiles aren't the best for this research though because - no relations, tile edges, multipolygon handling. @amishas157 Let's use osmium. Take a look at osmlazer for a sketch. |
@geohacker Yes, that would be great. Now how I am thinking of approaching this problem is following: Find the overlapping features with an area greater than a threshold value and find the different kind of overlapping and overlapped features. And then doing a deeper analysis on the combination of two. |
Analysing Feature Overlaps Idea behind doing this analysis is to get an idea what kind of overlaps exist in OSM , get an estimate of the numbers for the same and also to differentiate a good overlaps from the bad ones. The example of good ones can be building overlapping with landuse feature but the bad ones can be water feature overlapping over buildings. The above process is carried out as following:
Analysis for Monaco, a small city in Europe is done initially.
Results from overlaps:
The first JSON object represents primary tags present in feature1 and second represents primary tags present in feature 2. The third parameter gives the list of count of such overlaps. In the above case: highest overlaps is found with Next action is to perform the same process for a large city and see what all comes out. |
Wow! Awesome analysis Amisha! |
@amishas157 💥 ! this analysis is amazing. Is it possible to list down what next actions look like to you here? |
Next actions:
|
Cleaned up the JSON @amishas157 posted ^ into a csv for 👀 better It is super-interesting to see The highest overlap of |
Here is the updated JSON object. https://gist.github.com/amishas157/ec0f042d7e69a576a337d156742547f5 after removal of few dups and improving a bit of logic. Thanks @bkowshik for the CSV. 🙇♀️
Yes, correct. But this seems to be a legit overlap kind no 🤔 ? |
Per voice with @manoharuss and @amishas157 Priority
Rendering
Percentage of overlap
Noise
|
Updates
|
Analysis for overlaps between Total number of overlaps found: 37 Based on eyeballing these overlaps, can be categorized as following:
Learnings:
|
I still think it is a valuable addition, especially in light that while Case 1 is not found very often in the map - it is some bad vandalism we've seen before. Apart from water I think this will helpful to help detecting Pokemon users adding new parks on top of buildings. |
Also the detailed documentation how you approach this problem is an inspiring example! Thanks for digging into this. |
Really enjoying how this is moving, awesome work @amishas157 🎉 |
Awesome work @amishas157. @krishnanammala and I reviewed 32 changesets out of which 3 were found to have been actionable. Hitrate: 9.3% Observations:Overlap feedback
|
@amishas157 This changeset was flagged with 3 features for Feature overlap comparator. https://osmcha.mapbox.com/48936480/
|
Review Feedback 6th JuneI went on by reviewing unchecked changesets by feature overlap comparator in OSMCha and captured notes on the noise observed
Let us consider this as an exception for leisure = parks vs buildings, but the park originated from an experienced user with 3k changesets. So maybe we should think about adding new user condition to the comparator. I am have a doubt on max zoom as well - https://github.com/mapbox/osm-compare/blob/master/comparators/feature_overlap.js#L14
|
Thank you @manoharuss! So looks like we have two major problems to address here:
|
Ref : Currently we have a feature overlap comparator to flag all newly added (version 1) water bodies which overlaps with any of the existing features.
Following are the uncertainities discussed by @bkowshik in the referenced post.
Per discussion w/ @bkowshik ,to study more on above discussed issues, we can perform tile-reduce to find out existing overlapping features in OSM and visualize the different overlapping combination and count of such features. It would help us in getting a list of feature types which can help us tighten this compare function.
How I am seeing it is we will get a count as following:
Overlapping feature type | Overlapped feature type | Count
Here is the tile-reduce script I am working on. So this script finds out all waterbodies in a tile and then checks for other features in the same tile overlapped by it.
One issue in above process is we will miss counts from the relations as mb tiles used in tile-reduce doesn't contains relation type features.
Would be glad to feedbacks from the team.
cc @batpad @geohacker @planemad @lukasmartinelli @ian29
The text was updated successfully, but these errors were encountered: