« The BookstoresOn scholarly communication and university presses »

Monetizing libraries


Monetizing libraries


"Proverbs for Paranoids, 1: You may never get to touch the Master, but you can tickle his creatures."

- Thomas Pynchon, Gravity’s Rainbow, 1973


A colleague in Europe recently forwarded to me the Google agreement with the CIC libraries. Even though I had been told this new agreement had some very different language from that in prior contracts, it was still eye-opening reading.

Simply put, the CIC libraries are contributing in-copyright material to Google for scanning, but for the first time (known to me), they will not get a copy back. (Michigan and Wisconsin have previously negotiated arrangements with Google, and the terms of the CIC agreement hold only to the extent that it permits the two universities to participate in a CIC-wide project; the contract terms of Michigan and Wisconsin are not superseded.)

Let's go to the contract. Clause 4.7(b) states:

Escrow Deposit. As Google "successfully processes" the works contained in the Selected Content, Google will place the University Digital Copy of such Selected Content in escrow on a secure server maintained by Google at Google’s cost and expense.


(The University Digital Copy being a copy of the material digitized by Google.)

Clause 4.9(a) then stipulates that "Works in dispute" may be withheld from Escrow deposit, and 4.9(b) permits a CIC institution to assert that a work thought by Google to not be public domain, is actually in such a condition, and as long as the CIC University partner indemnifies Google and provides assurances that any claims can be addressed, the institution can receive a copy of it. (This is a high bar, so don't anyone wait in line for it to be invoked unless there is fair certainty; that said, I know the University of Michigan has encountered situations [podcast] where they feel Google treads extremely conservatively, and where a reading of the law is clear enough to grant the University assurance to act.)

It is in "4.10 University Digital Copy of In-Copyright Works" where the story starts getting interesting:

In General. As noted in Section 4.1 above, Google may, in ways consistent with applicable Copyright Law, select and Digitize In-Copyright Works contained in the Selected Content. Such works will be part of the University Digital Copy and, as such, the Digitized files will be maintained in escrow as set forth in Section 4.7 above and released to the Source CIC University as set forth in Section 4.11. Until such time as these In-Copyright Works are released, Google agrees to provide CIC Universities with searchable access to such In-Copyright Works as described in Section 4.6 above.


(4.6 provides for a hosted solution by Google for the CIC libraries.)

Then come the constraints:

4.11 Release of In-Copyright Works Held in Escrow. Subject to the terms of this Section 4 Google agrees to enable download capability from the escrow to the CIC Administrative Offices or the applicable Source CIC University for one copy of the digital file for any In-Copyright Work(s) held in escrow in the event that any of the following release conditions (each, a "Release Condition") occurs:

(a) the In-Copyright Work becomes in the public domain;

(b) a Party has obtained permission through contractual agreements with copyright holders that includes the right to make a copy of the In-Copyright Work and to provide it to the CIC or Source CIC University;

(c) well established case law exists that In-Copyright Works can be copied and held by the CIC Administrative Offices and/or the Source CIC University without infringing on the rights of a copyright holder;

(d) if at any time Google is in material breach of its obligations under Section 4.3(b) or 4.6(a) and Google does not remedy any such failure within ninety (90) days after its occurrence (or, in the event such failure is caused by technical problems or causes similar to those described in Section 12.5, within such longer period as Google, working diligently, reasonably requires to remedy such problems); or

(e) the CIC Administrative Offices or the Source CIC University and Google agree in writing that the release of a particular In-Copyright Work or Works is legally supported and appropriate under the terms of this Agreement.


In other words - pretty much, unless Google ceases business operations, or there is a legal ruling or agreement with publishers that expressly permits these institutions (excepting Michigan and Wisconsin which have contracts of precedence) to receive digitized copies of In-Copyright material, it will be held in escrow until such time as it becomes public domain.

That could be a long wait.

Why was this done this way? What, after all, are the chances that residents of these U.S. Midwestern States might actually receive a digitized copy of the works that their tax dollars made a contribution towards purchasing? (N.B.: Not all CIC members are public institutions, but the overwhelming majority of them are.)

The answer to this question may well depend on the outcome of the larger contest.

In an article early this year in The New Yorker, "Google's Moon Shot," Jeffrey Toobin discusses possible outcomes of the antagonism this project has generated between Google and publishers.

Paramount among them, in his mind, is a settlement:

Google's endeavor is encountering opposition. A federal court in New York is considering two challenges to the project, one brought by several writers and the Authors Guild, the other by a group of publishers, who are also, curiously, partners in Google Book Search. Both sets of plaintiffs claim that the library component of the project violates copyright law. Like most federal lawsuits, these cases appear likely to be settled before they go to trial, and the terms of any such deal will shape the future of digital books.


Toobin then goes on to delineate what such a deal might actually look like:

The terms of such a deal aren't hard to imagine. The Authors Guild is concerned that pirated copies of the books on Google's site could leak to the public, and so the organization would insist on security measures. [...] As for distribution of the proceeds from the site, Google might agree to share revenue with publishers, in the way that radio stations pay for the music they play; publishers could receive a fee based on a statistical analysis of how often their books are viewed. Google could pay in cash or in kind, with advertising.


That's an eye-opening observation, and from my perspective as the Director of the Digital Library Federation, one that has not received enough speculation.

Obviously, any settlement would not cover the In-Copyright materials that are already part of Google's Partners' Program, in which publishers submit their material directly to Google for digitization (if necessary) and inclusion in Google Book Search. And, I think by any likely definition, a settlement would not include public domain material, which is differentially encumbered in these agreements. Therefore, a settlement must quite manifestly concern itself with works that are believed to be In-Copyright but where no publisher has stepped forward with an explicit opt-out, or where there is suspicion that they might be In-Copyright, but it is not known for certain (i.e., they are "orphan" works).

If this is what a settlement might cover, publishers will race to establish their historical rights portfolios with a zeal that will be astounding to watch.

More importantly, as Toobin intimates, remuneration must be involved, and it is at least open to suspicion that libraries will have to license access to the material covered in a settlement. (Perhaps paranoid on my part, but I have been long concerned about the new liabilities associated with moving into a realm of digital monograph licensing). The publishers and authors are not suing over tiddlywinks; there is real money at stake for them, far into the future, and the legal bills alone are already breathtaking. I find it hard to wrap my head around how a voluntary collective licensing arrangement such as the one Toobin describes might work in practice, but if the opportunity arose, I suspect that Google is clever enough to come up with something.

(I should note that a settlement would have several beneficial secondary consequences: someone, for example, will have to create a registry to record the known rights status of orphan works, a goal that everybody who is sentient, reasonably sane, and not drugged out of their minds desires. A derivative concern is whether or not such a registry would be public, or more specifically, what parties would have access, through what means, and under what conditions.)

Let us briefly return to the CIC deal. If a settlement along the lines that Toobin suggests occurs, then it is seriously in question whether the CIC institutions (again, excepting Wisconsin and Michigan, which have prior digitization agreements) would be able to obtain access to their In-Copyright works until such time as they fall into the public domain; I find it hard (not impossible, but hard) to imagine why publishers, as a community, would permit the CIC to obtain such copies; the "library copy" is something that has deeply irritated them since the Google Book Search program started.

I think the CIC agreement is a significant enough departure from the prior public contracts that we must take notice of its suggestions that the relationship between Google and publishers is maturing, and that Google is more cautious of the distribution of In-Copyright material than they ever have been before.

That may well be the harbinger of something broader; something that libraries and the larger public might not find much cause to celebrate. Observing, "[A] settlement that serves the parties' interests does not necessarily benefit the public," Toobin invites Larry Lessig to comment:

"It's clearly in both sides' interest to settle," Lawrence Lessig, a professor at Stanford Law School, said. "Businesses in Internet time can't wait around for years for lawsuits to be resolved. [...] For the publishers, if Google gives them anything at all, it creates a practical precedent, if not a legal precedent, that no one has the right to scan this material without their consent. That's a win for them. The problem is that even though a settlement would be good for Google and good for the publishers, it would be bad for everyone else."


Being neither Google nor a publisher, I'm in the part of the global community encompassed within Lessig's "everyone else."

Lessig continues:

"If Google says to the publishers, 'We'll pay,' that means that everyone else who wants to get into this business will have to say, 'We'll pay,' " Lessig said. "The publishers will get more than the law entitles them to, because Google needs to get this case behind it. And the settlement will create a huge barrier for any new entrants in this field."


A settlement between Google and publishers would create a barrier to entry in part because the current litigation would not be resolved through court decision; any new entrant would be faced with the unresolved legal issues and required to re-enter the settlement process on their own terms. That, beyond the costs of mass digitization itself, is likely to deter almost any other actor in the market.

And that to me is potentially the saddest loss, should such an arrangement come to be realized. Because in real terms, across this vitally important collection of humanity’s literature and thought, of all the ways of thinking about books and working with ideas on the Web, we might be left with only one way.

 

- Peter Brantley, Digital Library Federation, 2007:06:13

Jun 13, 2007 | Categories: DigLibs, Libraries, BookRights | pbrantley

9 comments

Comment from: Adam Hodgkin [Visitor] Email
A very interesting post. I am not sure that I buy your pessimistic and worried conclusion. Why should a settlement between Google and the Publishers lead to there being only one way to access books digitally? The Publishers, Authors and Agents obviously dont want this to happen. These author-side parties cannot possibly give Google any exclusivity (or they would attract a shower of lawsuits from their own side) and its not in their interest to prevent others signing up to whatever latitude is given to Google. There are plenty of big players who are already investing in digital media and can tackle this from a slightly different angle in an important way (obviously Amazon, MSFT and Apple). Why should a Google-legitimated approach close the door to other distribution possibilities.

The existence of a strong and growing public domain is crucial to there always being the possibility of an alternative distribution and aggregation strategy (3 cheers for the Gutenberg project, and another 3 cheers for Brewster Kahle). I dont see why Google's massive investment and slightly cack-handed negotiating approach (cack handed and boot in the door) should really stop other approaches. There will always be another way and Google is leaving plenty of gaps.
06/13/07 @ 23:59
Comment from: Jerome McDonough [Visitor] Email
While I'm in agreement with your analysis on the likely outcomes, I agree with Adam that all will not be lost if this comes to pass, but for slightly different reasons. Copyright law is just that: a set of laws. And they can be changed. So, I'm sticking with my position on this one. The outcome we're seeing unfold is unfortunate, but predictable (and in my not particularly trustworthy opinion, legally correct) under the current copyright regime. If we want to change the outcome, we work to change the copyright laws. That will, of course, put the library community at war with publishers of all kinds, but probably not with Google and the rest of the search companies. If we want the freedom to do what we want to do with this material, the law is going to have to change. End of story.
06/14/07 @ 08:48
Comment from: Michael W. Perry [Visitor] Email · http://www.InklingBooks.com/
I'm delighted by the news that Google is showing a willingness to be flexible and negotiate a deal with publishers. Google has never had a legal leg to stand on. Digitized, searchable databases of books are derivatives in the same sense that abridgments and movies are. Any suggestion that they can be done without the copyright holder's permission is absurd. Quite a few movie derivatives show less of a book's plot (and far more added creativity) than Google does and yet they still require permission.

The real potential losers in what Google has been doing are authors of helpful 'how to' and recipe books with limited sales. If anyone online can scan and post up to 20% of those books, they make buying them pointless, and those authors don't have the money to sue. Also, authors who'd like to escape an unhelpful publisher and bring a book back into print elsewhere may find themselves locked to a publisher who's happy to pick up pennies from Google's digital copies for doing nothing. That issue needs to be addressed.

The real answer probably doesn't lie in the courts, which as noted can take years, or in private agreements, which leave everything in legal limbo. The real answer lies in comprehensive legal reform. Strong copyright protection needs to be linked to registration with constantly updated contact information required that sharply reduces the cost of tracking down the copyright holder. (Those who don't register get only limited rights defined by law.) And with copyright terms in the US now absurdly long, passing on literary rights needs to be part of the boilerplate of every will probated in this country. I'm planning to publish a book in the near future that'll lean on the slender reed of having all the children of the author orally agree to the publication. No rights to a literary estate were passed on. What if one disagrees or changes his mind?

Unfortunately, today's Congress doesn't seem to be up to the task of hearing all parties and coming up with a fair balance of interests. With control of Congress on a razor edge, both parties seemed locked into winning a majority by dealing out pork to special interests. The Republicans were bad. The Democrats now in power seem to be vying to be still worse.

But seeming miracles do happen. Just yesterday Bowker sent me an email informing me that their Books in Print database has been radically improved. I checked, and they are right. What I've been telling people was "the worst online database on the Internet" is now quite easy to use.

If Google really wants to put access to information online, information I use quite extensively in my editing, they should consider offering to create powerful rights and texts databases and giving them gratis to the world--databases that don't tilt the results toward any one source or distribution model. A good example of how that could be done in another area is the OCLC Worldcat database at:

http://www.worldcat.org/

It's a marvelous tool for locating books in libraries. We need a similar one stop source for locating rights holders and for locating published books and etexts, either commercial or free, online or elsewhere. No pure business seems able to pull that off. Amazon, for instance, no longer sells ebooks in any format but their woefully inadequate proprietary one. All the pdf books I had there--historical texts that can't be adapted to their quirky format--are now gone, ebooks that I'd created as an alternative to our more pricey printed versions for students and researchers in a hurry or a tight budget. There should be a place where both the printed and ebook versions are displayed side by side.

Haste makes waste. Google's hasty attempt to put copyrighted books online when non-copyrighted texts are still in limited supply has wasted a lot of 'Internet time.' They need to listen more to their critics, pontificate less, and be a bit more creative and helpful, not equating what's good for Google with what's good for everybody. These recent moves are a step in the right direction.

--Michael W. Perry, Inkling Books, Seattle
Author of Untangling Tolkien


06/14/07 @ 09:06
Comment from: K.G. Schneider [Visitor] Email · http://freerangelibrarian.com
Thank you for posting these important links and your analysis of the agreement. I was asked at my NASIG keynote why libraries buy into Google's agreements if they do not benefit their users in the long run, and I was at a loss for words. I can't really believe librarians don't read what they're signing, but as one of "everyone else," I too am worried.
06/14/07 @ 09:17
Comment from: Adam Corson-Finnerty [Visitor] Email · http://www.library.upenn.edu
I am not sure quite what to make of your informative post. It sounds like Google partner libraries are wasting a lot of time in cooperating with in-copyright scanning. Just taking the books off the shelves, transporting to the scanning site, and then putting the books back involves many hours of staff time. And in return the libraries don't even get a digital copy until all the legal kinks are worked out?

Your second concern, about paying for online monograph access--well, that's going to happen anyway. Publishers like HarperCollins are moving their own stuff online, in addition to whatever Google gets its hands upon.

We can already purchase digital downloads of books, through SONY Reader, and this traffic will increase as the readers get better, notes can be taken, passages highlighted, etc.

We can purchase POD copies of scanned books, in and out of copyright. We can even scan our own out-of-copyright material and upload it for printing, or print in the library.

So what exactly is the problem? Is it that we are going from an ownership model to a license model? From pay once to pay per drink? That movement is already far along.

06/14/07 @ 12:50
Comment from: pbrantley [Member] Email
Since this post was written, I have heard renewed speculation that the escrow clauses were actually placed by CIC counsel to mitigate the risk of litigation. Although I personally find this speculation odd, as CIC is still contributing In-Copyright material, and only IMHO a stilted, conservative, and deeply risk-averse, read of the current copyright regime failing any vestige of normative insight might lead them to this averment, I never cease to be amazed at the actions of universities and libraries (that I love, oh with so much hurt). Still, I withhold judgment in the hope that a semblance of rectitude still persists in the American Midwest.

2007:06:20. Confirmed.
A sense of rectitude still persists in the Midwest.
06/14/07 @ 17:50
Comment from: Peter Hirtle [Visitor] · http://blog.librarylaw.com
I am one of those who was surprised by your assumption that it was Google, and not the CIC, that came up with the escrow option. I am willing to bet that a lawyer at one of the CIC institutions drew the not unreasonable conclusion that receiving a copy of a scanned in-copyright work pushes the boundary of fair use too far and might place the institution at some risk. A reasonable alternative is presented by the CIC contract - that Google must provide the public free continuing access to the scanned works (see 4.3.B), but that ownership of the digital files will only pass to the CIC schools when it is clearly legal. Since most Google partners don't seem to know what they are going to do with the scanned works, and since it is very unclear what, if anything, a school could legally do with a scan of an in-copyright work, this doesn't seem like a major problem.

The important thing is that the public is going to get improved full-text access to probably 8 million volumes (if we assume 80% of all works are protected by copyright). This seems like a great thing, and the CIC should be applauded.
06/15/07 @ 14:31
Comment from: Rebecca [Visitor] Email · http://blog.oup.com
In light of this dicussion I was curious to see what your thoughts were on this post, http://blog.oup.com/2007/06/google_library/. Evan talks about the best way to punk Google.
06/19/07 @ 09:12
Comment from: pbrantley [Member] Email
I am at a conference, so this is very brief. I think the OUP post is rather drastic. I fully support the vision of GBS. I think the dispute with libraries demonstrates the stunning inadequacies of current copyright regimes and CR expectations in the U.S. and overseas. There is a huge mass of book material that publishers cannot speak for, because they do not even know who owns it. Registries, contractual agreements, and shifts in copyright legislation are all part of the solution. I thought the OUP post too one-sided.
06/19/07 @ 15:39

Leave a comment


Your email address will not be revealed on this site.

Your URL will be displayed.
(Line breaks become <br />)
(Name, email & website)
(Allow users to contact you through a message form (your email will not be revealed.)
7 + 3= ?
antispam test
This is the personal blog of Peter Brantley, and the opinions expressed here are his own and are not reflective of any of his employers in the continuum of history, or the University of California, which provides support for this blog.

Join EFF today

Recent Posts

Search

Subscribe

  • RSS
  • Bloglines
  • MyYahoo!
  • MyMSN
  • Newsgator
  • Google Feeds
How to subscribe
free open source blog

Server manager: contact