Sensor Location Obfuscation #27
Replies: 8 comments 1 reply
-
I found this article when looking for if anyone already had guidance on managing location data within citizen science. Recommendations include:
Another resource to take a look at is PurpleAir's privacy policy:
Other citizen-science examples: |
Beta Was this translation helpful? Give feedback.
-
For some context about spatial resolution and location precision, NOAA's High-Resolution Rapid Refresh (HRRR) model which is used to make those smoke maps you may have seen, is at 3 km spatial resolution. Another high resolution data product is NASA/ORNL's DayMet which is 1 km spatial resolution. To get 1 kilometer spatial resolution, that is about 0.01 degrees of precision. Another advantage to "snapping" sensor locations to a 0.01 degree (or whatever level of precision) grid, is that we won't have to see location drift as the GPS quality changes over time. That would make using archived data easier if a true stationary sensor actually has fixed coordinates rather than showing a bunch of noise (and therefore false movement) due to GPS signal quality. |
Beta Was this translation helpful? Give feedback.
-
Wow @spestana this is really helpful to have numbers to ground this conversation and decision. It definitely seems like a 1 km spatial resolution is adequate for the state-of-the-art science at the moment based on your research. As you've also pointed out, quantizing (snapping) the GPS coordinates to a grid-like that definitely resolves the drift issue as well! It seems like that level of obfuscation would be a nice way to alleviate any privacy concerns as well. @spestana do you think it would be adequate to just round the GPS Lat / Long to that 0.01 degrees of precision? I would vote for keeping the altitude at the highest precision we have available :) If that seems good to you @spestana and @brianstrauch this should be trivial to implement on the sensor itself, so more precise GPS data never hits a database anywhere :). |
Beta Was this translation helpful? Give feedback.
-
@yuzhang0302 fyi |
Beta Was this translation helpful? Give feedback.
-
Adding to this discussion: if we are rounding to the nearest 0.01, should we be taking average of datapoints that are within the same 1km square? For instance, you have 2 sensors located on the same block, should we be displaying multiple sensors on the map or would we want to take the average of these sensors' readings? We might not need to address this right away, it would be an edge case, hopefully the sensors are more spread out. |
Beta Was this translation helpful? Give feedback.
-
FYI I've created Issue 41 in the sensor repo to implement this change. |
Beta Was this translation helpful? Give feedback.
-
I've just deployed this change all. You can see here that the coordinates are now rounded to 2 decimal points: |
Beta Was this translation helpful? Give feedback.
-
The distance is about 318 meters away from the actual location of my sensors here. I'm wondering if that's a bit too far? Perhaps 3 decimal places would be sufficient? That would be about 111 meters of drift vs the 1 km of accuracy right now. What do you all think? |
Beta Was this translation helpful? Give feedback.
-
In #19 we started discussing whether the precise location of the sensors should be obfuscated. Let's continue that discussion here.
Beta Was this translation helpful? Give feedback.
All reactions