« Hipster Books and Old CDsSidewalk traces »

Open data isn't


Open data isn't


On the open government mailing list, my friend Thomas Lord made an observation about the ownership and management of the data generated by our engagement with mashups of services that we tend to assume are "free".  The key insight is a trend toward privitization of access through the pre-eminence in the marketplace of these services, which results in the removal of public sector alternatives or a diminuation of incentives to create public services. 

I asked for permission to cite his thoughts in full, and he graciously agreed.  There are ramifications for most online services, such as Google Books, which are rarely considered.  Obviously these commercial services are useful, often enlightening, but we need to define and exert expectations for privacy and management of these data.

Below, then, his words -


[W]hat about those increasingly popular "crime maps"?   They take public data from the police and, via "mash-ups", not only format and deliver that data to Google, but also deliver a real-time history of interest in that data.   Does an upward trend in detailed exploration of a crime map by IP addresses correlated with the region indicate a rising concern about crime that suggests an upward trend in crime and degradation of the neighborhood?  Or is it just mass hysteria?   Or just a spike curiosity or civic engagement?  The metaphysically true answers to such questions may matter less than how those in possession of the intel treat it:  If lenders decide "that probably means property values there are shaky" then that's a self-fulfilling prophecy.

It would be harder and more expensive to build the same functionality for citizens in a less (not un-) exploitable way.   For example, in mash-ups, data is formatted and given to Google and a fine-grained record of clickstreams is formatted and given to Google.   An alternative is for a jurisdiction to host its own servers, use its own maps, and avoid Google altogether.

There is a principle here, therefore:

Making open data more accessible to software does not make the data "more open" if the new uses that are created can only be created by immediately combining the data with proprietary data and services. That's a privatization of a civic function, not an "opening".

The crime map services are not a democratization of access to the data: they are a privatization of asymmetric, preferential access to the data.

See it?

Prior to these maps the main way that neighborhood crime data propagated in Berkeley was through registered neighborhood groups, each of whom got police department contacts in exchange for their registration.   It was a two-way channel between all the stake-holders.

The rush to put data on-line and perhaps turn a quick buck or get some resume fodder with a mash-up has helped to sever the two-way channel and meanwhile invited in Google, marketers, and others to lurk around and spy on the propagation of the information.

-t

(C) Copyright 2008, Thomas Lord (lord [at] emf dot n-e-t)

This work is licensed under the Creative Commons Attribution-Noncommercial-No Derivative Works 3.0 United States License. To view a copy of this license, visit http://creativecommons.org/licenses/by-nc-nd/3.0/us/ or send a letter to Creative Commons, 171 Second Street, Suite 300, San Francisco, California, 94105, USA.

 

Jun 25, 2008 | Categories: DLF, Identity, Privacy, Community | pbrantley

3 comments

Comment from: Paul Courant [Visitor] Email · http://paulcourant.net
The choice between well-funded, far-seeing, wise, intelligent government and the world that we live in is a false choice. We never had such a government. Neither did ancient Athens.

It has always been the case that private entities (R.H. Polk, for example) have used public data and reused them to provide detailed geographic information to those who were willing to pay for it. Is that better or worse than the example with Google maps? I would argue the old way was less democratic, and less open, although we are still, alas, short of being able to get our news from the brow of the philosopher-monarch.

More generally, if private entities can add value (preferably not by monopolizing something, but by actually adding value) to public data, that's capitalism at its best. Along these lines, I have an entry on my blog (Au Courant at paulcourant.net) that explains why I choose to have an attribution only Creative Commons license on the blog. I may, of course, be wrong about this, but my position derives from the notion that we want date to be used in ways that people value.

Again, the choice is between levels of imperfection, not between the ideal and something less.
06/26/08 @ 03:24
Comment from: Raymond Yee [Visitor] Email · http://blog.dataunbound.com
What does Thomas Lord suggest that we who would otherwise use a commercial service like Google maps to do? Not create mashups at all? Create mashups that involve no commercial services? (I'm writing as someone interested in both mashups and helping people make sense of public data.)

06/26/08 @ 07:20
Comment from: Samuel Generoux [Visitor]
I actually think that this is an excellent and highly relevant observation. In terms of Google maps and Google books, we might be looking at different types of things, but the pattern is similar. In terms of access to content, this is really looking more like an issue of degree or maybe type of access. Google is indeed providing more access to material than was previously available, but there are costs and trade-offs. There is certainly no harm in saying and discussing this point. The solution might be to continue to work towards mapping and articulating these types of issues and promoting discussion as a way of understanding how information and culture can be and should be shared in a networked digital environment. In order to make the most informed decisions, libraries, archives, museums and holders of public information need to understand the ramifications of these partnerships and the terms and conditions that come with them.
Additionally, we need to understand what is being provided by Google and the limitations of what is being provided in order for our public institutions to create and adjust their policies, priorities and actions.

I just finished reading Jonathan Zittrain’s Future of the Internet and I think that his book does an amazing job of parsing out a lot of these issues. He uses the term “generative” to refer to the degree of openness in being able to innovate, experiment and hack on content and technology in order to create new things. Services like Google Books allows us to search across vast amounts of content for free and it provides citation, snippet and some full text access for free. A powerful and highly useful service. However, users are quite limited in how they can hack up and riff on this mass of data. It is open to search, open to read (to various degrees according to copyright), but not completely open to do R&D and other experimenting. Google is providing an API platform for others to develop and play with the data, but as Zittrain points out, this openness is contingent. Services and tools built on open APIs can be withdrawn at anytime by the owner of the API (in this case Google).

I think a good perspective on all of this is that it is not an either-or debate. Not completely open or completely locked down. There are and will continue to be degrees of openness, but it is important for libraries, archives and museums to make decisions about how they share content based on an understanding of the networked digital environment and the degrees and types of access.
07/28/08 @ 12:42

Leave a comment


Your email address will not be revealed on this site.

Your URL will be displayed.
(Line breaks become <br />)
(Name, email & website)
(Allow users to contact you through a message form (your email will not be revealed.)
7 + 3= ?
antispam test
This is the personal blog of Peter Brantley, and the opinions expressed here are his own and are not reflective of any of his employers in the continuum of history, or the University of California, which provides support for this blog.

Join EFF today

Recent Posts

Search

Subscribe

  • RSS
  • Bloglines
  • MyYahoo!
  • MyMSN
  • Newsgator
  • Google Feeds
How to subscribe
powered by b2evolution

Server manager: contact