-
Greetings! I have been looking into the Overture data and namely the building layer. One problematic thing I see there is some buildings coming in from Microsoft, which are not actually buildings. These seem to be a mix of containers, trucks, vans and other things which have been classified as buildings. For example in this area in Helsinki, almost all of the small buildings are coming from Microsoft source and are not actually buildings: Right now there are no attributes (e.g. ML detection reliability) I could use to filter these out. The issue is very wide spread and I have seen these in multiple geographies. Especially placed on roads, these are very distracting for a map user. Would you have any thoughts on how this issue could be addressed? I have also opened a ticket upstream, but haven't heard any feedback there: microsoft/GlobalMLBuildingFootprints#87 |
Beta Was this translation helpful? Give feedback.
Replies: 2 comments 4 replies
-
Thanks, @tjukanovt. And thanks for also posting over on the upstream repo. We've had some discussions around how to collect and process these kinds of issues. @cbeddow This is the kind of thing you're thinking about being able to handle? |
Beta Was this translation helpful? Give feedback.
-
I to have the same experience having worked with the Microsoft building data in Sweden before, and now seeing it is part of the Overture data as well. Here there are so many false positives that you just can not use it with confidence. I also see containers, cars etc - that is understandable. But I also get buildings on bare cliffs by the ocean, randomly in the forest, in the mountains in the north of Sweden. They are just not reliable. I have also reported it directly to the Microsoft building projects but it seems that our regions is not very interesting to work with since there is never a reply. But I think it is important for those working with the overture project to know that Microsoft building data is not reliable. |
Beta Was this translation helpful? Give feedback.
Thanks. We actually do some of that now. ML buildings that overlap water bodies or roads from OSM are not included in the Overture distribution. We noticed that lots of stationary boats are included as buildings in the ML datasets, so they are removed since they're sitting in a water polygon (also included in Overture). But as you're pointing out, it doesn't entirely fix the issue for standalone false buildings. For those - we'd have to collect this in a separate database since there's no way to indicate it in OSM or elsewhere. And that collection system itself could be subject to bad actors that might want to incorrectly flag certain things. We're actively thinking about this.