| « DLF Fall 2007 Forum Survey | A More Open Congress » |
Tim O'Reilly has blogged a couple of times about Google transitioning to a strategy of "trading for its own account" - in other words, moving to position where they acquire or actively license content, which is then incorporated and presented alongside content harvested on the external web. This leads to a number of potential conflicts in the ranking and presentation of content. In short, there is no guarantee that Google will not prejudice its own content over others, and in fact, every reason that it might, despite assurances they may provide to the contrary.
Tim, in his latest entry, cites a recent blog post by Anil Dash, Google and Theory of Mind, which discusses Google's effort to pull attention from Wikipedia toward its new UGC-based product, Knol. Anil expresses his concerns:
... Knol shares with Google Book Search the problem of being both indexed by Google and hosted by Google. This presents inherent conflicts in the ranking of content, as well as disincentives for content creators to control the environment in which their content is published. This necessarily disadvantages competing search engines, but more importantly eliminates the ability for content creators to innovate in the area of content presentation or enhancement. Anything that is written in Knol cannot be presented any better than the best thing in Knol.
The presentation of other people's content (belonging to publishers, authors, and the public) within Google Book Search [GBS] becomes an experience owned by Google, and engineered in a manner opaque to others, out of others' control, with complex attendant issues in how Google intermingles GBS results with externally harvested material. Most importantly, from the perspective of publishers and authors, there is very little they can do to control the discovery and presentation experience. Even ascertaining whether the results are flat across providers is beyond the realm of assured knowledge for any one content owner; Random vs. Harper vs. public domain -- no one can have definite knowledge of what is ranked, and in what manner, thus incapacitating innovation.
This has all been very much in my mind, as the director of DLF. When I was in Washington D.C. last December for the CNI Task Force meeting, I spoke with a couple of IP lawyers who are in the business of supporting non-profits and libraries. Both had interesting things to say relating to Google and its purported negotiations with publishers. (Neither have any engagement in the AAP/AG litigation with Google.) One of the most troubling comments related to an aspect of the possible settlement terms that I hadn't really thought through before.
As the New Yorker's Google's Moon Shot article relates, the most likely settlement agreement would involve a voluntary collective license in which revenue from viewing or sharing texts is shared among publishers, authors, and Google. "Well, I love deals like that," one lawyer friend opined sarcastically. "Everyone gets something for nothing, except maybe the libraries. People get money for things they don't deserve." I didn't understand, and he explained, "For orphan works, where there are undetermined rights holders, there would undoubtedly be revenue sharing between Google and the publishers, and the authors, if they are all treated collectively. So these parties will still be making money that does not really belong to them, except by virtue of any agreement that they might come to terms on." In other words, who ever owns the rights -- because no one knows for sure in the case of orphans, whether it be publisher or author or public -- the money will almost inevitably flow to commercial parties that have no claim to it; in the case where the work is rightfully public domain, but has not been exposed as such, then publishers and authors would make money at the public's expense.
Obviously, libraries would have to be the source of many of these out of print books in such an arrangement; otherwise the publishers would have possession of them, and likely provide them through the Google publisher's partner program. I wondered: would the libraries be able to direct any income from the utilization of orphan works that such an agreement might secure? Could libraries be counted upon to speak not just for themselves, but rather for the overall community, to secure the general good? No answers are forthcoming.
These are mere hypothetical musings; nothing save wild thoughts. Except that recent events keep bringing me back, like the incessant tug of a moontide, to Tim O'Reilly's writings on Google "trading on their own account."
At the very end of 2007, I was happy to announce that the University of Michigan, the first Google Book Search library partner, was willing to host U.S. congressional documents being digitized by the Internet Archive, Public.Resource.Org, and the Boston Public Library; this might eventually be the largest single mass of publicly available U.S. congressional documents on the web, and it is funded in part by a $250,000 grant from the Omidyar Network.
It struck me as interesting that no large research library, public or private, was willing or able to be the initial protagonist; nonetheless, I am heartened that the University of Michigan will act with its characteristic boldness to provide a second home and access point for the content; fwiw, Michigan has also released congressional hearings digitized through their partnership with Google, whereas Google has not.
Research libraries have been agonizingly conservative over these issues, afraid of rights entanglements, particularly. Congressional hearings sometimes include submitted material of varying rights status, but once published, all government documents are public domain by definition. Nonetheless, the possible presence of non-government generated material has kept Google from fully releasing these materials for viewing, fearful of providing another target for rights-related litigation. Spurning this concern as misplaced and ill-informed of the law, Michigan must be commended for their stance.
In consideration of these issues, a colleague at another large research university said of our community [n.b.: minor edits for comprehension]:
There was an ARL Directors meeting in Phoenix where they deliberated on a SPARC-related proposal to digitize government documents, complete with a business model. All of the brightest directors and our best friends had plenty of reasons why we should not do such a thing, and it went down in flames. It was a no-brainer, obviously, but pre-Google Book Search.
Several of those folks have now changed their tune. More importantly, I think Brewster Kahle has gone more vigorously after things like the government documents in part because of Google, and people take him more seriously because of Google's digitization. We might actually get someplace.
Perhaps my friend is right.
And pointedly, that (oblique) optimism is in the context of suspecting that the terms of any settlement between Google, publishers, and authors, should it be consummated, will be good for no one except those entities, on the basis of its own terms, particularly considered that such an agreement could conceivably advantage the public to a far greater extent than is likely to be the case.
Fundamentally, as Tim O'Reilly and Anil Dash have suggested, one has no choice but to suspect all that lies beneath GBS, which will, inevitably, serve its creators before and ahead of its peer book repositories elsewhere. Book search will not work like web search. And I worry -- as the precedent of government documents sadly argues -- that libraries may not have the strength or sense of purpose to stand up for any other than their own individual interests, neglecting the broader communities they serve.
In the beginning, at least. But in an off-handed way, like my friend, I sing praise to Google. For if our aims are in the near term defeated, if we fail to stand for ourselves, then through Google may finally rest an important redemption.
We must learn to trade for our own account - not the account of Google, Elsevier, the AAP, or the Authors' Guild. We must acquire, and build, a shared universe of information, freely available to all, on our terms. We must stand together for all we profess, against all danger -- stand for what no other organization in this world can: the fundamental right of access to information, and the compulsion to preserve it for future generations. This is not an economic imperative; nor will it ever be the goal of an internet advertising company. It is a mission that defines libraries.
We must not fail to shy from the fleetingly seductive "opportunity of a lifetime" motivations presented by Google and others who can only, ultimately, cultivate their fields in the service of an economy that prices their shares like so many toy balloons, with no bounds to their inflation, directed in a willy-nilly fashion, and exposed to an easy collapse after joyful hands of rough play have their say. Those hands are not ours. Those hands trade for their own account.
With our hands: let us trade for our own.
Peter Brantley, Executive Director, Digital Library Federation
2008:01:01
This post has 2 feedbacks awaiting moderation...
Server manager: contact