Verifying Crowdsourced Spatial Data Without Central Gatekeepers
source link: https://blog.foam.space/verifycrowdsourcedspatial-29067e15435d
Go to the source link to view the article. You can view the picture content, updated content and better typesetting reading experience. If the link is broken, please click the button below to view the snapshot at that time.
Verifying Crowdsourced Spatial Data Without Central Gatekeepers
State of the Map-US 2018, OpenStreetMap and FOAM’s approach to the challenge of geospatial data verification
From Oct. 5–7th in Detroit, FOAM had the opportunity to participate in State of the Map-US (SOTMUS) as both sponsors and speakers. SOTMUS is an annual gathering for stakeholders of the open-source mapping community based on OpenStreetMap (OSM) in the US. You can learn more about what the organisation does here and here)
I attended the conference as a FOAM community member hoping to see the perspective of people deeply involved in the mapping industry. In this article I will:
- Give a general overview of OSM and current data verification processes
- Look back on our experience at the conference along with feedback and reactions from participants
- Give a call to action for the OSM community on how FOAM can return verification back to local volunteers
WHAT IS OSM AND DATA VERIFICATION?
OpenStreetMap is an initiative started in 2004 with the intent to provide an open and free alternative to other existing mapping solutions. From Wikipedia:
“The creation and growth of OSM has been motivated by restrictions on use or availability of map information across much of the world, and the advent of inexpensive portable satellite navigation devices.”
People from all around the world have participated in mapping their local areas for the past 14 years. By and large, it has worked remarkably well. The OSM community has seen steady growth up to the present and over 3 million changes added per day to the global set.
One reason for OSM’s continued success in data collection has been the fact that anyone can join and edit.
“Contributors include enthusiast mappers, GIS professionals, engineers running the OSM servers, humanitarians mapping disaster-affected areas, and many more.”
However, precisely because of this open nature, there is always the problem of malicious users adding problematic data points; this includes data that is false, malicious or from copy-written sources (more on vandalism here). The most recent significant issue occurred at the end of August, where an OSM user changed the label for New York City to “Jewtropolis”. This was only one change of many, per the NYT:
“One of the [OSM] volunteers made more than 80 anti-Semitic or hate-oriented edits to locations around the world”
The malicious data ended up in front of end consumers that use the OSM base map, an outcome which they certainly would have liked to avoid. Problems of data vandalism, paired with other issues like raised rates elsewhere, have encouraged the growth of enterprise verification on top of OSM. Services like this clean and sanitise data before it makes it to subscribing customers. One of the more prominent companies performing verification is Mapbox, which sells access to their verified data to corporate entities, including Snap, the NYT, and Mastercard.
ENTERPRISE SOLUTIONS TO VERIFICATION
At SOTMUS, Mapbox presented a few of their solutions, one of which included quadrupling the number of human reviewers for every change they incorporate. Another encouraged the use of a central repository called OSMCha that many enterprise stakeholders would be able to use to cross-reference. Other strategies under development include AI and machine learning tools.
It remains to be seen whether these solutions will be both effective and financially sustainable in the long-term. Zooming out, we can see a larger issue, and one which the community has been aware of for a while now: the ethical dilemma of monetising access to data mostly provided by volunteers.
How can local knowledge be accurately represented and updated? What mechanisms can ensure fresh POI data? In his talk at SOTMUS, Lukas Martinelli from Mapbox acknowledged the paramount importance of local knowledge:
“In the end the local contributor knows best. there are languages we don’t speak, … cultural references only you get, and that’s where the community comes in and map gardening needs to get better, we need better tools for that .. that the community can see issues, flag them, fix them, and also mark those bad edits and their hotfixes.”
FOAM agrees — mappers need to have the best tools and most effective frameworks to be successful as the community grows.
The focus on enterprise solutions over time shifted the narrative away from the original OSM ethos of ground truth. Speaking with many attendees at the conference, it was clear that many others felt the same way — while great enterprise work was being done, it seemed like the passionate volunteers who had done much to grow the ecosystem had less and less influence. Are there alternative models that can return value to the users who form the backbone of the community?
WHAT IS FOAM’S APPROACH?
We were happy to share the larger FOAM ideals with the people who stopped by our booth. At its core, FOAM is about keeping data verification and the economic incentives in the hands of the geospatial community, in line with OSM’s original vision.
FOAM is about keeping data verification and the economic incentives in the hands of the geospatial community
To watch Ryan’s lightning talk on the FOAM map at SOTMUS, check out the first talk here:
Our first products are the Spatial Index and the recently released FOAM Map. Both components are complementary within the FOAM vision to construct a new Web3.0 compatible location protocol.
How does the POI mechanism work? Simply put, FOAM tokens are staked by mappers to a point, representing their confidence in the veracity of the POI data. Inaccurate POI data can be challenged by other cartographers. Successful challenges are then rewarded by distributing the previous stake to the participants that voted on the challenge side. For a more detailed dive on mechanics and incentives, check out this writeup:
By requiring an economic stake for mappers adding info, vandalism is not as likely to crop up — hopefully mitigating the previously discussed enterprise centralisation. The opportunity of data validation gets returned to the edges of the network; the users and volunteers. Most significantly, map stakeholders gain an unprecedented ability to participate in network governance — how parameters are set and the data integrity maintained. Active mappers now have a seat at the table, a virtuous cycle that compounds directly back into the network.
The FOAM model is open and free for anyone to use or contribute to — hopefully this article piqued your interest in participating. We would love to have anyone take a look at our first live product at map.foam.space.
Learn more about the issues and abuses of user location data in Web2.0 through this reading list:
Aggregate valuable and interesting links.
Joyk means Joy of geeK