Peter Friedman
Associate Professor, Legal Analysis & Writing
Case Western Reserve University School of Law
Ruling Imagination: Law and Creativity
Let’s get straight the historically profound benefits of making information available online — Scribd this time.
Two days ago I wrote about the court decision holding that the video hosting service Veoh is protected by the ”safe harbor provisions” of the Digital Protection Millennium Act from liability if any of the service’s users upload videos that infringe existing copyrights. One of the reasons Veoh is entitled to those protections is that it uses adequate technological safeguards to police the content its users upload.
So I don’t expect there is much of a chance that a new lawsuit against Scribd, a web site that hosts documents uploaded by its users, will will succeed or even survive a motion to dismiss for failure to state a claim, a procedural device that ends the lawsuit at its very beginning by means of a court determination that even if everything the plaintiff alleges is true she is not entitled to legal relief. As Geek.com reports, the lawsuit alleges copyright infringement by Scribd not because it hosts copyrighted materials but because the software it uses to detect copyrighted materials before they are published on the site allegedly uses copyrighted materials:
A children’s author in Texas has leveled a strange lawsuit against the company, claiming that the company infringes copyright, but not by hosting infringing works on its service.
No, her claim is even weirder: she maintains that Scribd prevents copyrighted material from being placed on the site by copying the text of copyrighted books and other publications into its copyright infringement detection software, which therefore infringes copyright itself!
The claim may not be as weird as Geek.com believes, though it is likely not to survive long. The original legal challenge to the Google Books Project by the Authors Guild and individual authors holding who were identified was premised largely on the contention not that Google was going to make those authors’ copyrighted works available. It wasn’t. It was only going to make those works searchable so that snippets could be brought up by researchers who could thereby identify and by library loan or purchase obtain relevant works they never otherwise would have found without traveling from Palo Alto, California to Ann Arbor, Michigan to Oxford, England. So what was the problem? The authors alleged that the fact Google was copying their works in their entirety to create the database that would yield the snippets constituted copyright infringement.
And in A.V. v. iParadigms, LLC, 544 F. Supp. 2d 473 (E.D. Va. 2008), aff’d in part and remanded, F.3d 630(4th Cir. 2009), plaintiffs were students who alleged that iParadigm’s Turnitin plagiarism detection system — used by schools throughout the country to detect plagiarism committed by students — constituted copyright infringement. Schools that use Turnitin require each student turning in a paper to submit it through Turnitin. Turnitin then compares the paper to its database and prepares a report that rates the similarities of the paper to papers in its database. In addition, Turnitin adds the paper it is rating to the database, thereby constantly growing and increasing the effectiveness of that database.
The students alleged that they owned the copyright in their papers and that IParadigms was infringing those copyrights by copying those papers and using them as part of the Turnitin database. But last March the federal court hearing the lawsuit dismissed it.
There are several interesting points to make about the decision. First, I read the trial court decision (that was later affirmed by the U.S. Court of Appeals for the 4th Circuit) on Scribd (here). Scribd is a tremendous resource for me — a lot of legal documents are not available online, and a lot of valuable ones that are available online are behind paywalls even though they cannot be copyrighted (including a lot of court decisions). Scribd is a solution to this problem, providing a central clearinghouse where lawyers can upload legal documents to make them available to the general public.
The value of resources like Scribd is one of the reasons I find criticisms like that Chris Castle directed at the decision in the Veoh case so maddeningly unhelpful. If one looks at sites like Veoh and Scribd as doing nothing but making available for free works that people would otherwise pay for, then it is much easier to rant and rave that those sites are nothing but distributors of stolen merchandise and to rationalize a stubborn refusal to admit that copyright must be balanced against strong competing interests in free speech and the exchange of ideas. But if you see these sites as profoundly gratifying resources that make the internet the greatest innovation in the history of information technology, the fact that media companies (and even independent writers, artists, and musicians) can readily identify infringing uses that do slip through detection programs does not seem so profoundly troubling. Those copyright owners can quickly employ the DMCA’s notice-and-takedown procedures, which many criticize as too friendly to the copyright holders.
Why would you use copyright to stifle marvelous new innovations? Copyright exists to encourage, not stifle, invention.
So a legal attack on Scribd, even if it is not as “weird” as it might seem on first blush, is something I will scrutinize carefully.
Second, it seems odd that Scribd would be attacked for committing copyright infringement resulting from a mechanism it is employing to minimize copyright infringement by its users and for which it is rewarded by the immunity conferred by the DMCA safe harbor provisions.
Third, a spokesperson for Scribd, as Wired reports, explains that Scribd does not copy works in their entirety as part of its copyright detection system; rather, it “creates a digital fingerprint, or a ‘hash,’ to identify infringing copies.”
Most importantly, even if Scribd did copy the entirety of the copyrighted works only to use those copies to prevent users from uploading and making available to readers those copyrighted works, the decision holding that Turnitin’s similar use of copies copyrighted materials to detect plagiarism is illuminating. The trial court, affirmed in this reasoning by the 4th Circuit Court of Appeals, explained that “iParadigms, through Turnitin, uses the papers for an entirely different purpose [than those the plaintiff did or could], namely, to prevent plagiarism and protect the students’ written works from plagiarism . . . by archiving the students’ works as digital code.” Thus, while the court recognized that iParadigms profits from its use of the student works, the court found that iParadigms’ use of plaintiffs’ works was “highly transformative” because it adds a “further purpose or different character” to the copyrighted works and “provides a substantial public benefit through the network of educational institutions using Turnitin.” Slip op. at 14.
In affirming the trial court’s decision, the 4th Circuit added to this reasoning and described as “clearly misguided” the argument that Turnitin’s use of the plaintiff’s copyrighted papers cannot be considered transformative “because the archiving process does not add anything to the work — Turnitin merely stores the work unaltered and in its entirety”:
The use of a copyrighted work need not alter or augment the work to be transformative in nature. Rather, it can be transformative in function or purpose without altering or actually adding to the original work. See, e.g., Perfect 10, Inc. v. Amazon.com, Inc., 508 F.3d 1146, 1165 (9th Cir. 2007) (concluding that Google’s use of copyrighted images in thumbnail search index was “highly transformative” even though the images themselves were not altered, in that the use served a different function than the images served). [Turnitin's] use of plaintiffs’ works had an entirely different function and purpose than the original works; the fact that there was no substantive alteration to the works does not preclude the use from being transformative in nature.
562 F3d at 639.
So let’s get it straight: what Scribd is doing is of tremendous value to society as a whole. It’s use of copyrighted works to minimize the availability on its site of copyrighted works is entirely different than and in no way diminishes the value of the copyrighted works to the owners of the copyrights. A copyright is not ownership of property like title to a car is — it does not give the owner control over any use of that car the owner doesn’t approve. There are a lot of good reasons for these differences. First, if someone else uses your car, you can’t. If someone else uses your copyrighted work, you can still use it too. If they use it for a use you never would have, what’s your problem? And if that other person’s use is doing a lot of good, why should the law confer on you a power to stop it? (Even your ownership of physical property is limited by restrictions imposed for the social good.) Finally, copyrighted works are works of expression, and we have a constitutional right to free expression. The limitation on copyright imposed by fair use is precisely a means of balancing the copyright holder’s interests against this profound social interest in free expression.
It’s an amazing world. In the name of legal rights that exist to promote progress and innovation, people everywhere are trying to stop revolutionary innovations they plainly don’t realize the value of. One of these days I’ll have to talk about the Google Book Project settlement and the fights raging in connection with it. Some are more legitimate than others. But let’s be clear: Google is trying to make available online for research purposes (not in ways that would displace the markets for the works themselves) the contents of major research libraries from around the world. Doesn’t everyone realize what an amazing and unprecedented advance this is for the life of the mind, for anyone anywhere who ever has had an interest in doing research?
Google’s Library of Babel and its opponents.
Steven Shankland has written a good piece on the proposed settlement of the lawsuits over the Google Library Project; the proposed settlement is “now under review by Judge Denny Chin of the U.S. District Court for the Southern District of New York.”
Under the proposed settlement, the owners of copyrights in books would need to opt out of the project to prevent Google from including those books in its Library database, which is being compiled by scanning the libraries of several major insitutions around the world. As Shankland points out, “that means essentially that Google would be permitted to show content from in-copyright, out-of-print books and sell online copies of those books even without an explicit agreement with the books’ rightsholders.” Copyrighted, out-of-print books constitute approximately 70 percent of the books in the library collections Google is scanning, and that 70 percent includes the vast majority of “orphan works” in those libraries. Orphan works are works whose copyright holders cannot be identified, a common problem because there is no registry of copyrights and the authors of the books are not necessarily the copyright holders. Rather, the copyright holders might include unidentifiable heirs or even corporate entities that have gone through mergers, dissolutions, or other forms of corporate reorganization that make it difficult or impossible to identify the entity that currently owns the copyright.
Nevertheless, some authors continue to oppose the Google Library Project:
“Under the actual law, it is Google’s burden and not yours to ask you for permission and then fairly negotiate terms of contract acceptable to you personally, not jam some monstrosity down your throat,” said Lynn Chu, a literary agent with Writers’ Reps who also called the proposed settlement a “ripoff for authors” in a Wall Street Journal opinion piece.
As a business matter, I don’t understand the view Chu expresses, as I’ve previously written. Why would someone whose work is out-of-print not want that work accessible to the general public? And if that someone wants to keep his work in the obscurity resulting from being out-of-print and available only at some far off insitution’s library, he can always opt out. Chu says that the “actual law” requires Google to ask permission first, not for the copyright holder to deny permission, but the wonderful thing about contracts (and a settlement is a contract) is that they can be a means parties have of altering the rules that govern their relationships in the absence of agreement.
I’ve been a fan of the Google Library Project since it was announced in 2003. It promises to make available for search the collections of many of the greatest libraries in the world. Google will only be able to display brief snippets of works that are in print and under copyright, but even that access will make known to researchers the availability of sources they never otherwise would have been able to find. The Project is one of those endeavors that make the internet and the digitization of information truly revolutionary and magical. It would be a shame if copyright law founded on old technologies and the unfounded knee-jerk reactions of copyright holders (it’s mine, and that means you can’t do anything with it without my permission!) were to end up preventing the realization of revolutionary magic.
Finally, Shankland points out that there is concern over the settlement because it would give Google an advantage over competitors: “Microsoft, Amazon, or the Internet Archive . . . –without their own handy class-action settlement [--] would be have to try to seek such permission in advance from each rightsholder or risk copyright infringement litigation.” But if copyright holders and their representatives are willing to reach this settlement with Google there’s no reason to suppose they wouldn’t with Microsoft, Amazon, or the Internet Archive. Google’s competitive advantage is the result of its initiative and daring in starting the Project in the first place and developing technology (including new scanning technology) to make it truly possible. Advantages gained by daring and initiative should be rewarded by the law, not stymied.
Stop those dangerous . . . er, player pianos!
Copyright legislation throughout history has primarily consisted of congressional efforts to preserve financial interests threatened by new technologies. We are, of course, living through a technological revolution right now, so we are living through copyright wars.
But who knew the 1909 Copyright Act (in effect until the current one was enacted and signed into law in 1976) was a response to the threat posed by . . . yes, PLAYER PIANOS!
Music publishers, who had secured their rights in sheet music, were freaked out at the thought there might be mechanical reproductions of their music they wouldn’t be paid for. As Mike Masnick explains it at Techdirt
The big innovation of the 1909 copyright [Act] was compulsory licensing on mechanical rights. This was put into place for one reason: fear about player pianos and how they would dominate the market and destroy the need for musicians. Within a matter of decades, the player piano market was effectively gone… and yet, these massive changes designed solely to deal with the player piano have stuck around ever since. Now apply that same story to basically every other technological innovation, and that gets you copyright law.
You don’t have to look far to find a current example that proves Mike’s point. Amazon’s Kindle2 ebook hit the market with the capacity to read the electronic texts loaded into it aloud in a computer-generated voice. As afterdawn reports, “the Author’s Guild saw this feature as a“performance” when used and pressured Amazon to allow publishers to decide on an eBook-by-eBook basis whether to enable the feature or not.” Whether this new technology represents a genuine threat to the existing financial interests of publishers and/or authors is pure speculation, but the Author’s Guild is adamant:
We will not . . . surrender our members’ economic rights to Amazon or anyone else. The leap to digital has been brutal for print media generally, and the economics of the transition from print to e-books do not look as promising as many assume. Authors can’t afford to start this transition to digital by abandoning rights.”
Of course, the Authors Guild was the lead plaintiff in the lawsuit directed at shutting down a much vaster and more revolutionary technological advance, the Google Library Project. As I have written, I never understood what good they possibly have been doing themselves if they’d stopped that project. Nor can I understand their efforts to stifle the transition we plainly are going through into electronic books.
But now I know: you see a machine that can reproduce your “property,” and all you can think is you’ve got to stop that machine. Even if it is just a player piano.
How do we promote creativity?
One common theme that runs through my views regarding intellectual property is that there is way too much treatment of intellectual property as the equivalent of real property (that is, land). I can fence off my land and keep everyone off of it. Therefore, too many feel, I can fence off my intellectual property and prevent anyone from doing anything with it that I don’t give them permission to do. One commenter on my post last week regarding Shepard Fairey’s Obama campaign poster manifested this confusion about the differences between real property and intellectual property. I think the authors who didn’t want their books to be accessible for word searches via the Google Library Project did as well.
My greatest knowledge about intellectual property concerns copyright. The first thing to know is that copyright is a relatively recent legal creation and that tall U.S. copyright law exists by virtue of and within the limits of 27 words in Article 1. Section 8 of the U.S. Constitution:
The Congress shall have Power . . . To promote the Progress of Science and useful Arts, by securing for limited Times to Authors and Inventors the exclusive Right to their respective Writings and Discoveries.
In other words, copyright law exists to promote invention and creativity, and to the extent it discourages invention and creativity it is unconstitutional. Works that are genuinely creative in their own right but appropriate copyrighted works (Girl Talk and Shepard Fairey, among many others) therefore have a very strong claim to legitimacy as long as they do not exploit the market created by the original work. Indeed, that’s exactly what the fair use doctrine is intended to allow and is beginning to reflect.
My views are shaped to a considerable degree by my belief that all creativity is grounded in previous work, and that the more leeway the law gives to appropriation the more creativity we will have. Of course there are limits. You cannot entirely rob the artist of the financial profits of his work. But using that first artist’s work in an altered way that creates something people want for reasons entirely different than the reasons they wanted the original work does not rob the first artist of the fruits of his labor. Rather, it allows someone else to sprout new fruit.
Apparently, IBM shares this attitude with respect to inventions it could patent. As Securing Innovation reports:
IBM used the occasion of the recent announcement of its 2008 patent record to introduce plans to help stimulate innovation and economic growth. The company plans to increase by 50% — to more than 3,000 — the number of technical inventions it publishes annually instead of seeking patent protection.
Why? According to IBM’s press release:
Publication of technological information is one means to “promote the Progress of Science and useful Arts” the phrase in the U.S. Constitution giving the Congress the power to enact patent laws. Publication protects inventors from allegations of infringement by placing the intellectual property into the body of prior art. Publications also improve patent quality, since they can be cited by patent offices in limiting the scope of patent applications. Publication also helps spur follow-on innovation that ensures dynamic business growth.
While IBM will continue to seek patents and will protect its intellectual property, its planned increase in publishing inventions will focus on those technology areas that will increase the build out of a new, smarter infrastructure. The evolution of IBM’s policy builds on prior efforts to stimulate innovation by pledging not to assert certain patent rights in the area of open source software, health care, education, the environment, and software interoperability.
Why did I call Google a (former?) “white knight”?
I wrote yesterday that some fear Google’s decision to settle the lawsuit over the Google library project heralds a new era, one in which Google will not be the rich uncle fighting the fights over copyright that others, who cannot afford being engaged in protracted lawsuits, will be unable to fight. The point deserves some further observations.
First, those who represent wealthy corporate interests typically decry the fact that in the U.S. both the party who brings the lawsuit (the plaintiff) and the party who is sued (the defendant) bears its own litigation costs, win or lose. This rule (the American Rule) is in contrast to the English Rule. In the U.K., the party who loses a lawsuit pays the costs of the winner’s lawyers. As a result, there are fewer cases brought by plaintiffs without resources. Corporate interests that advocate for “tort reform” don’t want individuals suing corporations as often as they currently do for things like personal injury and employment discrimination. If plaintiffs had to pay the costs of the defendants’ attorneys in those lawsuits when the plaintiffs lost, far fewer plaintiffs would in fact sue.
But in the copyright arena in these internet days, it generally is wealthy corporate interests that are threatening to sue or suing individual defendants. Because many of these defendants cannot afford to pay for lawyers to fight these threats, they back down. As a result, there is what is called “copyright overclaiming” – that is, copyright holders claim rights they don’t have, threaten legal action (or send DMCA takedown notices), and get what they want even though they are not entitled to it. Sometimes copyright holders can get help (from, for example, the Electronic Frontier Foundation), but in 999 cases out of 1000 they’re left to their own devices, and it usually makes far more sense to back down then to fight. Why pay for a lawyer to fight an expensive lawsuit when, if you lose, you’ll have to pay the far more expensive legal costs incurred by your adversary too?
That’s why, if Fred von Lohman is right and Google is no longer going to fight copyright battles it thinks should be fought on the merits (and not only for short term business advantage), it would be a real loss; it is, in short, why I called Google a “white knight” in the title of yesterday’s post.
Has the Copy-Left lost its white knight?
Google has been a very interesting company to anyone concerned with copyright law. Google has taken on lawsuits raising issues others don’t have the resources to fight over, and Google has been very effective in making good arguments in those cases. Fred von Lohman now wonders if those days are gone:
Late last month, Google announced a settlement in its lawsuit with book publishers and authors over its Google Book Search offering. . . .
The Book Search case is just one of a series of high-stakes lawsuits that Google has taken up in the name of the disruptive innovation that fuels the Internet economy. . . .
Google, assisted by its expensive, top-drawer legal team, has a track record of winning these precedent-setting Internet cases. And by winning, Google sets a precedent that other innovators can rely on, as well. In essence, Google’s legal investments have paid dividends for the entire Internet innovation economy.
Until now. By settling rather than taking the case all the way (many copyright experts thought Google had a good chance of winning), Google has solved its own copyright problem – but not anyone else’s. Without a legal precedent about the copyright status of book scanning, future innovators are left to defend their own copyright lawsuits. In essence, Google has left its former copyright adversaries to maul any competitors that want to follow its lead.