Zachary L. Catanzaro*
Download a PDF version of this article here.
Generative artificial intelligence (AI) systems disrupt longstanding assumptions about creativity, originality, and copyright law. Traditionally, copyright law is premised on an incentive theory—that monopolistic profits motivate human creators to be creative. The theory predicates that, without such protections, human authors would be disincentivized from creating new works in the face of potential free riders upon their creative labors. This framing presumes creativity arises from a human agent with intrinsic intentionality and experiences. Generative AI posits a challenge to copyright’s anthropocentric premises. Generative AI systems autonomously generate novel artifacts devoid of human intentionality, lived context, or desire for artistic fulfillment.
As this technology evolves, the marginal cost of automated production trends toward zero. The resulting oversupply of automated content becomes a perfect substitute for human art in the marketplace. Thus, copyright’s premise of incentivizing artists through profit motives becomes less relevant. Evaluating machine and human works primarily on substitutability or copyright eligibility ignores ontological differences in how creativity arises. If copyright is to continue to presume that humans are exceptional in the realm of creativity, then it follows that moral rights should become the focus of copyright law. Rather than dilute copyright theory to encompass the automated production of new works, we should reinforce protections for intrinsically human virtues—moral rights, like attribution, integrity, and consent. This preserves copyright’s anthropocentrism amidst proliferating machine content. As technology challenges long-held theoretical premises, copyright policy should shift its focus from a sparse and increasingly irrelevant incentive theory to upholding humanistic values against non-human creativity.
Introduction
In June 2023, the music video “At War with the Matrix” featuring “Kanye West” debuted on YouTube.1 However, Kanye did not participate in creating the song or video.2 The Kanye featured is an AI-generated facsimile, with deepfake technology simulating his likeness and a synthesizer mimicking his voice and style.3 Hip-hop artist Big Baby Gandhi and filmmaker Laila Rao created the video using the generative AI tool Midjourney to produce surreal depictions and armies of Kanye clones.4 The video culminates in a scene from The Matrix in which Kanye’s face is superimposed onto Laurence Fishburne’s Morpheus, with AI-dubbed dialogue explaining reality to Keanu Reeves’ Neo.5 This unauthorized AI-generated video provocatively blurs the lines between human creativity and machine artistry.6
The music video took seven days to make and cost$30 dollars to produce.4 In an interview with Yahoo! Finance, Big Baby Gandhi said, “[w]e’re heading towards an arms race of content, where the stakes for attention are escalating, and the content will get more extreme. . . . The economic incentives upholding the media industry will fall apart.”7 He goes on to claim that, “[i]t’s simple supply and demand: when supply goes up, price goes down. AI exponentially increases the supply of high-quality content. Many media professionals will become redundant and lose their jobs. That’s the story of AI in every industry.”7
A few weeks after the release of the AI-generated Kanye West music video, the Screen Actors Guild-American Federation of Television and Radio Artists (“SAG”) announced a general labor strike prompted by concerns over allegedly exploitative working conditions as well as apprehensions about potential displacement of human actors by artificial intelligence technologies.8 The strike was driven in part by concerns that movie studios were seeking irrevocable and permanent assignments of rights of publicity that would allow the use of actors’ images, likenesses, and performances in conjunction with generative AI systems.7 In calling for strike action, SAG aimed to secure enhanced protections for human performers in light of emerging technologies capable of digitally de-aging actors, reshooting performances, or even resurrecting deceased stars.7 At the time of this writing, negotiations between SAG and the studios were still ongoing, with the union underscoring the need to preserve safeguards for “human-created works,” including modifications to an actor’s “voice, likeness or performance,” by means of artificial intelligence.7
The AI Kanye video and SAG strike raise questions about copyright’s incentive theory. Copyright law grants limited monopolies to incentivize human creativity, assuming output and production would decline without them. But AI challenges this by enabling creative works without human authorship. As AI advances, is copyright’s incentive structure still relevant? Can it balance incentives for AI developers and human creators at risk of displacement? Does AI authorship threaten to displace human authorship? Or are we witnessing the emergence of a new medium of expression altogether? After all, “if you can’t tell, does it matter?”9
I. What is Generative AI Art?
Early AI Systems
One of the first known public displays of computer-generated art dates to the 1965 exhibition “Generative Computergraphik,” which showcased the work of German mathematician Georg Nees.10 The field developed relatively slowly until 1973, when Professor Harold Cohen programmed AARON, a set of computer systems designed to produce AI art.11 AARON was intended to evolve into a system that would eventually become “human-like” and capable of the similar cognitive capabilities similar to those used by us to make, understand, and compose images. Its early outputs, however, offered little distinction between characters and the ground or closed and open forms, with simple manipulation of image structures based on programmed syntax rule sets.7 Despite Professor Cohen’s early aspirations for stochastic computational realism, the first AARON versions produced very rough abstract images.7
As Cohen described in his seminal 1973 essay, “concepts are formed on the basis of prior concepts, decisions are made on the basis of feedback from the environment and from the results of previous decisions.”12 Cohen thus believed that the syntactic relationship between physical characteristics of art (form, line, perspective, composition, and so on) could be reduced to a series of deterministic program code.13 For this reason, “the probability is that, if one could identify the starting point for an artist’s whole life’s work, one would find a set of concepts completely formulated if not completely digested, given to him and not initiated by him.”7
Cohen demonstrated this when he programmed AARON with nearly 4,000 rules for the drawing of a realistic human head, in which he defined a series of symbolic relationship between concepts, such as where a nose should be in relationship to a figure’s face.7 In 1980, Professor Cohen made a breakthrough in his study of young children’s drawing behaviors. Applying his earlier developed theory, he realized that he could code redundancies into AARON using a set of pre-existing “core figure[s],” which would assist the system in learning simple strategies for pattern tracing and repetitive composition. This resulted in a marked jump in the “thing-likeness” of AARON’s outputs and an artistic consistency (or style) for AARON.
By 1985, AARON had produced a representation of the Statue of Liberty with enough detail that Professor Cohen successfully submitted the work for an exhibition on the history of the Statue. By constructing objects or concepts and defining them in their relationship to one another, the AARON system began producing expressive works that invoked more of a “human-like” or realistic representation of reality in their style and aesthetic. And by 1992, AARON produced a remarkable portrait of Professor Cohen himself.
Early rules-based art systems like AARON demonstrated that computer programs could autonomously generate original artistic works. However, these early systems were constrained by their reliance on human-coded rules and datasets. While exhibiting the appearance of some creative capacity, their outputs ultimately reflect their programmers’ originality.17 Much like Searle’s Chinese Room thought experiment, these algorithms produced artistic representations without any deeper comprehension of the meaning or significance of their creations.18 While superficially resembling human artistry, the programs themselves lacked true artistic agency or purpose, remaining limited tools for carrying out the creative visions of their developers. It would require an evolutionary leap in AI for computer systems to advance beyond merely executing preset deterministic programming and instead exhibit more flexible, generalizable, and human-like creative abilities.
Machine Learning and Generative AI
In contrast with Professor Harold’s deterministically programmed AARON system, modern generative AI systems use stochastic programming.19 Modern generative models like DALL-E 2, Stable Diffusion, MidJourney, GPT-3, and others employ stochastic machine learning techniques, like neural networks and large language models, trained on massive datasets to find associations and statistical correlations between data points.20 By looking for correlative patterns, these systems create new outputs that reflect the statistical regularities and averages in their training data sets.21 After training on vast datasets, these models synthesize novel outputs like images, audio, and text. While influenced by their training data, the most advanced generative models may exhibit emergent creativity in recombining aggregated representations of syntactic concepts.22 This has profound implications for copyright law.
Machine learning is a multi-step process that starts with the harvesting of large data sets.23 In a typical machine learning model, data is prepared as a training set, with larger data sets producing better results.24 From there, a computer programmer chooses a machine learning model to apply to the data set, and instructs that model to train itself to find syntactic patterns to make predictions using stochastic logic.7 For generative AI models, these data sets are constructed of digital pictures, sounds, movie clips, or text. As it processes information within the data set, the algorithm begins to observe statistical relationships between those points of data.7 This means that the robustness and accuracy of the initial data—the seed set—can have a tremendous influence on the outputs generated.7 Additionally, the systems themselves may have limitations imposed upon them by their programmer or owner. In 2022, Stability AI, for example, made changes to Stable Diffusion Version 2 to prevent the generation of “nude and pornographic output, photorealistic pictures of celebrities, and images that mimic the artwork of specific artists.”25
Users interacting with generative AI art systems start by providing text prompts.26 Using MidJourney, as an example, the system starts by reducing these prompts into discrete “tokens.” These tokens are not parsed for grammar, sentence structure, or semantic meaning.27 MidJourney does not understand what a “dragon” is or who “John Oliver” may be.28 Rather, the system starts “with a field of visual noise, like television static, [used] as a starting point to generate the initial image grids.”26 It then uses algorithms to reduce this noise into an averaged version of human-recognizable images.7 The process lacks any human input beyond the prompt stage; a human user has no superintendence or knowledge of what outputs the system will create at the time of prompting.7 Additional prompt engineering can refine or rework the image, but the user still lacks direct superintendence over the produced outputs.7
A generative AI art system neither reproduces nor transforms existing works, but rather creates aggregated statistical averages of conceptual representations like “dragon” based on its comparison of all representations of these tokenized concepts. Images are then generated using hashed token representations of other images containing “dragon”-like qualities. A large enough data set creates a statistically representative average “dragon” picture, such that the system has “learned” how to make its own representative “dragon.” All without understanding what a “dragon” is. For example, consider DALL-E, which generates images based on text prompts. If a user inputs “John Oliver eating a bag of popcorn with a dragon on his shoulder in the style of Andy Warhol,” DALL-E will output images that likely do not exist in its training data (or anywhere, for that matter). And because each generative AI system uses different data sets, their outputs on a particular prompt will vary.
While influenced by its training, the system recombines disparate concepts into new works of expression, without any human creativity. So these systems exhibit some degree of autonomous creativity in combining novel data points aggregated from the metadata within a particular large data set to independently create a new work of expression. In this way, generative AI art systems produce works not contained within their initial training data. Unlike rules-based AI confined to recombining predefined elements, these models can extrapolate new visions from “learning.” This directly implicates copyright’s incentive theory. If AI can autonomously generate original works without monetary motivations or human involvement, the traditional copyright justification may not apply.
At the prompt refinement stage, a user may reassert superintendence over the generative AI system’s outputs. Consider the previous “John Oliver” “Dragon” “eating popcorn” example. DALL-E’s output better matched Warhol’s style but missed on John Oliver’s likeness. Stability better matched John Oliver’s likeness but missed on Warhol’s style. Using prompt refinement techniques, I can overcome some of the restrictions implemented into each system, such as the bar on producing a work in a particular individual’s style. This allows me to refine the works closer to my intent. Other tools, like Photoshop, allow me to turn the output into raw materials for a new work, no different than how a photographer can heavily edit a picture depicting facts from the real work into a work of individualized expression.
But then a definitional problem arises. If I use over 600 prompts in my prompt engineering, at what threshold does the output become mine, if at all? Why is this different from capturing a photograph and editing it with Photoshop tools, themselves being a different form of algorithmic editing? Where is the line between spell check on the one hand and a service like Grammarly and Chat-GPT on the other? If the Copyright Office finds these boundaries difficult to ascertain, the consuming public will likely find it impossible. This raises several challenges to conducting standard infringement analyses in copyright disputes.31
Therefore, it is increasingly unclear where the boundary between human and algorithmically generated art is. Developments in technology, starting with photography, have and will continue to blur the line between human and machine authorship. As generative AI systems continue to evolve and the boundary between human and machine erodes, copyright theory must evolve to account for algorithmic rather than incentivized human creation. If the end consumer ultimately cannot discern whether a machine created a work or not, then market substitution is all but assured and market displacement risk grows. Examining this market change is critical as copyright adapts to an age of automated, costless machine creation.
II. Copyright in Algorithmically Generated Art
Copyright Basics
The Constitution’s Progress Clause states that “Congress shall have Power … To promote the Progress of Science and useful Arts, by securing for limited Times to Authors and Inventors the exclusive Right to their respective Writings and Discoveries.”32 Pursuant to this Power, Congress has devised a scheme of copyright protection, currently codified pursuant to the Copyright Act of 1976.33 The Copyright Act protects “original works of authorship fixed in any tangible medium of expression,”34 granting to qualifying authors a limited durational monopoly35 to exploit a bundle of exclusive rights in the copyrighted work.36
The Progress Clause is “both a grant of power and a limitation” and Congress “may not overreach the restraints imposed by the stated constitutional purpose.”37 The Supreme Court has said that “[t]he sine qua non of copyright is originality.”38 Originality is a constitutional requirement.39 Originality means that a work is the product of the human mind.40 Artistic originality is not analogous to copyright originality,41 nor does originality require novelty.42 Rather, originality “means only that the work was independently created by the author (as opposed to copied from other works), and that it possesses at least some minimal degree of creativity”43 or “a modicum of creativity.”44 Importantly, the fact that expressive aspects of the work are independently protected does not extend the copyright privilege to the entire work.45 Rather, “copyright protection may extend only to those components of a work that are original to the author.”7 This is reflected under longstanding doctrines like the idea/expression dichotomy, which prohibits the monopolization of ideas under the copyright privilege.46
The Supreme Court has stated that “[t]he economic philosophy behind the clause . . . is the conviction that encouragement of individual effort by personal gain is the best way to advance public welfare.”47 The author’s benefit, however, is clearly a “secondary” consideration.48 “[T]he ultimate aim is, by this incentive, to stimulate artistic creativity for the general public good.”49 As Justice Breyer explained in his dissent in Eldred
The [Progress] Clause authorizes a “tax on readers for the purpose of giving a bounty to writers.” Why? What constitutional purposes does the “bounty” serve? The Constitution itself describes the basic Clause objective as one of “promot[ing] the Progress of Science,” i.e., knowledge and learning. The Clause exists not to “provide a special private benefit,” but “to stimulate artistic creativity for the general public good.” It does so by “motivat[ing] the creative activity of authors” through “the provision of a special reward.” The “reward” is a means, not an end. And that is why the copyright term is limited. It is limited so that its beneficiaries—the public—“will not be permanently deprived of the fruits of an artist’s labors.”50
Thus, under the U.S. Constitution, “the primary objective of copyright law is not to reward the author, but rather to secure for the public the benefits derived from the authors’ labors.”51 And the public benefits twice from copyright: once when the work is created, and later when the work falls into the public domain.
Creating art can be expensive. Before the internet, distributing art was too. Historically, the high costs associated with production and distribution of creative works posed challenges for artists seeking to profit from their labors. In response, copyright law emerged as a means of promoting artistic innovation and dissemination by providing creators with certain exclusive rights over their works for a limited term. Modern Copyright Incentive Theory posits a straightforward exchange between artists and society: in return for producing original works that enrich the cultural landscape, authors are temporally granted bounded monopolies enabling them to profit from their creations. Absent such protections, the theory suggests that artists would lack adequate economic incentive to create, as uncompensated third parties could freely copy and distribute their works. Thus, copyright law aims to remedy market failures stemming from the non-rivalrous nature of artistic goods.
But copyright law never foresaw a post-scarcity marketplace for art. Generative AI art and digital distribution now let machines create, copy, and distribute art for de minimis cost. Processing power and storage are the only limits before a deluge of infinite content. How should Congress respond to the coming artistic singularity? Early cases, Congress, and the Copyright Office focus on art’s human aspects. If originality requires intention in the mind of a human, then it follows that denying machines protection accomplishes the goals of copyright law. But denying copyright to AI art is aesthetic discrimination masquerading as human exceptionalism. Art does not stop being art just because machines make it. Strictly applying Copyright Incentive Theory to AI art is tricky. Congress wants to reward artists when they create, but generative AI enables free creation. Simply ending our inquiry at machine authorship is insufficient to address these larger public policy concerns, yet it is nonetheless a necessary starting point.
Generative AI and Authorship
The Copyright Act does not expressly identify computer programs nor computer-generated works as works of authorship.52 Rather, human authored computer programs are treated as literary works,53 and computer-generated works are treated as audiovisual works.54 Copyright law draws no meaningful distinction between physical and digital copies of a work.55 Digital information is stored on physical hard drives, such that the act of arranging the bits—the ones and zeros comprising that information—is treated no differently than carving a sculpture or painting a painting. All three acts involve the rearrangement of atoms on a physical medium, and copyright law does not require that any rearrangement of physical matter be perceived by a consumer without the aid of a machine.56
The question of whether computers can be authors has received extensive discussion in the literature.57 The outputs of generative AI models are not the product of human authorship.58 So while generative AI systems are capable of producing new (and perhaps novel) works of expression, they do not qualify for protection under the Copyright Act, as they lack human authorship and human originality.59 As a result, any outputs of generative AI tools immediately fall into the public domain.60 It was not until fairly recently that anyone claimed that a computer system was capable of authorship.
The first registration application of a generative-AI-authored work occurred on November 3, 2018, when Stephen Thaler filed a copyright registration application for the image “A Recent Entrance to Paradise,” which was generated by his AI system, “the Creativity Engine.”61 Thaler did not edit or otherwise contribute to the creation of the picture beyond programming the algorithm. The Copyright Office refused to issue a registration on the grounds that the work lacked a human author. The Copyright Office noted in its letter that “copyright law only protects ‘the fruits of intellectual labor’ that ‘are founded in the creative powers of the [human] mind’” and that it would not register works “produced by a machine or mere mechanical process” that operate “without any creative input or intervention from a human author” because, under the statute, “a work must be created by a human being.”62 On August 18, 2023, the District Court for the District of Columbia affirmed the Copyright Office’s determination, holding that human creativity remains “the sine qua non at the core of copyrightability.”63
Contrast Thaler’s claims with those of Damien Riehl and Noah Rubin in 2020, when the duo developed a brute-force algorithm to generate every 8-note, 12-beat melody combination found in Western music theory.64 “Under copyright law, numbers are fact, and under copyright law, facts either have thin copyright, almost no copyright, or no copyright at all,” said Riehl in his TED Talk. “So maybe if these numbers have existed since the beginning of time and we’re just plucking them out, maybe melodies are just math, which is just facts, which is not copyrightable.” To “fix” the algorithm’s resulting work (a requirement of the Copyright Act), the system creates midi-files and saves them onto a hard drive at about 300,000 melodies per second. While their claim has not been tested in Court or at the Copyright Office, the pair hopes to “illustrate that there are a finite number of ways to combine notes to create pop melodies, and these combinations existed before any songwriter actually put them to paper.”65
Their argument is reductive of the abstraction-filtration-comparison test found in the Second Circuit’s decision in Altai.66 Applying the test to a claim of infringement in computer code, the Court held that:
In ascertaining substantial similarity under this approach, a court would first break down the allegedly infringed program into its constituent structural parts. Then, by examining each of these parts for such things as incorporated ideas, expression that is necessarily incidental to those ideas, and elements that are taken from the public domain, a court would then be able to sift out all non-protectable material. Left with a kernel, or possible kernels, of creative expression after following this process of elimination, the court’s last step would be to compare this material with the structure of an allegedly infringing program. The result of this comparison will determine whether the protectable elements of the programs at issue are substantially similar so as to warrant a finding of infringement.7
The pair extend this approach to copyright analysis into its end conclusion. All expression fixed into a material object can be reduced and abstracted to its compositional arrangement of atoms. At what point does random stochastic noise cross the threshold into originality? And if there are only so many arrangements of matter in which a work can be produced from, is anything truly original? Or are we merely discovering facts about the physical world that already exist? If so, that would violate the idea/expression dichotomy.
According to the abstractions-filtration-comparison test, there is some definable point in which random noise crosses into originality. But the boundary is unclear, and, as suggested by the Second Circuit, factually intensive. So, it is more accurate to say that creativity requires intentionality in its constitutive act. Consider Tupper’s self-referential formula:
This formula is mathematically unique because it plots itself. Between a certain k and k+17 on the y-axis and between 0 and 106 on the x-axis, the plot of the formula is itself.67 For k=48584506361897134235820959624942020445814005879832445494830930850619347047088099284506447698655243648499972470249151191104116057391774078569975432657185544205721044573588368182982375413963433822519945219165128434833290513119319995350241375876523926487461339490687013056229581321948111368533953556529085002387509285689269455597428154638651073004910672305893358605254409666435126534936364395712556569593681518433485760526694016125126695142155053955451915378545752575659074054015792900176596796548006442782913148854825991472124850635268663047630, if you plot the equation and look at it between a height of k and k + 17 up the y-axis, it gives the plot of the equation:7
Copyright law would say that both the formula and the expression of the formula itself contained within the graph of the formula are ideas, unprotectable under the idea/expression dichotomy.
But Tupper’s self-referential formula not only plots itself, it plots every 106 × 17 grid of white and black pixels. For example, when k=14452024897089758284794253733719456748127778221515070247971881396854908735682987348888251320905766438178883231976923440016667764749242125128995265907053708020473915320841631792025549005418004768657201699730466383394901601374319715520996181145249781945019068359500510657804325640801197867556863142280259694206254096081665642417367403946384170774537427319606443899923010379398938675025786929455234476319291860957618345432248004921728033349419816206749854472038193939738513848960476759782673313437697051994580681869819330446336774047268864, the plot is:68This image of Pac-Man is not the product of a human mind or human intentionality, but application of mathematical law derived from Tupper’s self-referential formula. Applying the abstraction-filtration-comparison test, it is unclear whether these shapes, as derived, would survive a substantial similarity analysis.
On September 15, 2022, Kristina Kashtanova submitted an application for “Zarya of the Dawn,” a comic book they created with the assistance of generative AI tools.26 Rather than rely solely on the generative AI system to create the final work as Thaler did, Kashtanova used MidJourney to create the raw materials for their work.69 By treating the output as raw materials that they recombined into a new work, Kashtanova was granted a copyright in their comic book as a compilation.70 The Copyright Office has taken the position that, where the output of a generative AI system is used as raw material into new expression and a human author superintends that work into a new compilation of materials, Copyright law permits the human author to claim authorship over the final work.71 So with proper disclosure, a human author, in theory, can claim at least superintendence over the machine’s creativity if they contribute sufficient expressive contributions to that first output.
On September 21, 2022, the Copyright Office received an application for digital artist Jason Allen’s award-winning, two-dimensional work, “Théâtre D’opéra Spatial.”72 Unlike Thaler, who denied any superintendence over the creation of “A Recent Entrance to Paradise,” Allen stated that he used Midjourney and “input numerous revisions and text prompts at least 624 times to arrive at the initial version of the image.”73 Once the initial version was produced, Allen then refined the image using Adobe Photoshop and upscaled it using Gigapixel AI. The Copyright Office initially refused registration on the grounds that Allen’s work “inextricably merged, inseparable contributions” from both Allen and Midjourney.7
In its January 24, 2023 reconsideration letter, the Office again concluded that the work could not be registered without limiting it to Allen’s contributions to the work.7 The Office agreed that the editing in Photoshop showed some creativity on Allen’s part, but that the outputs from Midjourney and Gigapixel AI did not.7 Because Allen sought to register the entire work and not his contributions, the Copyright Office denied registration.7 Allen filed a second request for reconsideration on July 12, 2023.7 Allen argued that the Copyright Office failed to account for his “‘creative input’ into Midjourney, which included ‘enter[ing] a series of prompts, adjust[ing] the scene, select[ing] portions to focus on, and dictat[ing] the tone of the image,’ is ‘on par with that expressed by other types of artists and capable of copyright protection.’”7 Allen also contended that his use of the AI generative output as raw material should be sufficient for him to claim authorship in the final product.7 And he asserted that requiring human applicants to disclose every AI tool used in the creative process would be an unreasonable burden on applicants.7
The Copyright Office denied Allen’s second request for consideration on September 5, 2023.7 Applying the human authorship standard and existing guidance rules, the Copyright Office found Allen’s work “contain[ed] more than a de minimis amount of AI-generated content, which must be disclaimed in an application for registration.”7 The Copyright Office explained:
If all of a work’s “traditional elements of authorship” were produced by a machine, the work lacks human authorship, and the Office will not register it. If, however, a work containing AI-generated material also contains sufficient human authorship to support a claim to copyright, then the Office will register the human’s contributions. In such cases, the applicant must disclose AI-generated content that is “more than de minimis.” Applicants may disclose and exclude such material by placing a brief description of the AI-generated content in the “Limitation of Claim” section on the registration application. The description may be as brief and generic as “[description of content] generated by artificial intelligence.” Applicants may provide additional information in the “Note to CO” field in the online application. Applicants are not required to list the AI tools used in the creation of the work.74
Yet the Copyright Office’s second denial of Allen’s work offers inadequate solutions to the wrong problem.75 The letter’s narrow construction of the Copyright Act encourages inaccurate disclosure to the Copyright Office, as authors may fail to disclose AI use to avoid registration refusal. In the case of failing to disclose public domain materials as grounds for cancellation, the proof was straightforward, as a work’s fixation or publication serve as the evidence of creation. Disclosure of generative AI systems asks for negative proof: that a machine did not in fact author a work or a portion of a work at any time. Proving this negative can be difficult, and AI detection tools offer no solution.76
It also forces difficult line-drawing around the extent of an AI system’s contributions. Does spell check offer more than a de minimis contribution? What about Grammarly’s editorial function? Why allow Photoshop to delete objects using AI detection tools while denying Gigapixel’s upscaling capabilities? The inconsistencies reveal the difficulty in delineating creative collaboration from infringement when AI is involved.
But ultimately, the question of whether a computer can be an author is irrelevant to the purpose of the Copyright Act.77 The goal of American copyright law is to incentivize the creation of works that benefit the public, not reward authors. It does so by “motivating the creative activity of authors” through “the provision of a special reward.”50 But with the boundary between human and machine authorship blurring, we must re-evaluate this means-end goal of copyright policy. If AI systems can produce marketable works, then the “reward” should follow the benefit, regardless of human authorship. Rather than starting and ending the inquiry with authorship, copyright doctrine must evolve to maximize public access to creativity, regardless of who or what is acting as an author. Congress has traditionally viewed authorship as a policy question, not a metaphysical or teleological one. And that policy question historically had been about Congress using copyright as a tool of competition policy.
Generative AI and Fair Use
When considered from a competition policy perspective, the dangers of market substitution come to predominate Getty Images’ copyright infringement lawsuit against Stability AI.78 There, Getty sued over Stability AI’s use of 12 million of Getty’s copyrighted images and associated metadata from its database to train the Stability generative AI system.79 At the heart of Getty’s complaint are concerns about the risk of market substitution for its licensing deals for its compilation of digital pictures.80 In Getty’s copyright infringement complaint, Getty alleges that this compilation of images took “great expense, over the course of nearly three decades” to assemble.81 But copyright law does not reward an author’s “sweat of the brow.”82 The fact that Getty expended significant time, labor, and expense to compile its database of copyrighted images does not render it per se protectable under the Copyright Act.7
Getty’s complaint misstates the computational nature of generative AI and the metadata contained within its own database.83 One of Getty’s allegations point to the wholesale copying of metadata as evidence of direct infringement of its copyrights.84 But metadata is just uncopyrightable facts about the image file.85 As explained in the previous section, modern computer systems algorithmically parse and construct many aspects of a file’s associated metadata. A digital photograph, for example, may automatically generate information about the image creator, keywords specific to the image, captions, titles, comments, or other information.86 These types of metadata typically concern the file structure of the digital file in which the image is stored.87
Data scraping this information and applying machine learning techniques to it enables generative AI to make entirely new, non-derivative versions of existing works. Generative AI tools are not simply copying the works in the database but making statistical observations about the syntax of the metadata itself. It does not learn meaning (semantics) from its use of the database, but structure. In theory, generative AI tools utilize vast datasets of existing works to identify patterns and correlations in the metadata—information about the works such as keywords, captions, titles, style tags, etc. The tools employ statistical methods to discern averages and tendencies about how these metadata elements relate to one another across the dataset. In this way, the tools are not directly copying or deriving from any one specific work, but rather discovering symbolic rules about how the metadata can be recombined in novel ways according to the overall statistics of the dataset.
The tools then apply these rules to generate new metadata pairings and compositions. While the output contains symbolic structures reminiscent of the original dataset, the semantic meaning is emergent rather than copied. The key principles are that the tools do not learn the meaning, only the relationships between structural patterns, as they leverage statistics across the dataset rather than deriving from any singular work. Through this process of discerning and applying abstract rules about metadata composition, the generative AI tools can produce original works that do not directly copy or infringe upon any one existing work.88
So, if Getty cannot claim copyright in its metadata as raw facts, it cannot object to the stochastic averaging of these facts into something new, any more than Rural Telephone Service in Feist could complain about the copying of raw facts contained in its phonebook.89 Like those phonebook listings, metadata provides the raw materials for new expressions. Art builds upon the works of predecessors. Students study masters, learning associations and developments from centuries of practice, eventually recombining discrete data points into unique interpretive styles. Similarly, generative AI systems do not merely capture moments like photographs. They “learn” as does a student who mimics their masters.
Copyright is structured to disseminate this metadata about art to future creatives. Without the reproduction of material objects, the incentive theory argues that insufficient production of copies of expression will prevent future artists from developing artistic study and creating new works of expression. The end goal is not encouraging expressive labors but a specific marketplace for expression, one in which transaction costs are kept low in a marketplace that encourages the free exchange of ideas. For this reason, strictly requiring human authorship is an aesthetic judgment that reads the “sweat of the brow” theory back into conversations about the extent to which machines can possess or exhibit creativity. The deeper concern is, and should be, that costless AI art will displace economic incentives for human creativity, which can in turn harm the very incentives that drive humans to create art.
Within its copyright infringement complaint, Getty offered the following photograph as evidence of direct copying:
Getty points to similarities in the composition and the appearance of a distorted watermark as evidence of the generative AI system’s direct copyright infringement.91 But this ignores that the associated metadata is itself factual in nature and that the Getty Images watermark is not a copyrightable work of expression.92 As noted above, Stability did not slavishly copy the demonstrative photograph. It amalgamated metadata about the indexed photographs contained in the Getty Images database then independently created its own new work based on the calculated averages of what a tokenized representation of a photograph of a soccer game should contain.93 If the picture depicts two players in a contrasting dark and light soccer jersey, it’s because the average photograph of soccer players does.94 Getty’s arguments simply rehash those that failed to overcome Google’s fair use defense in the Google Book Indexing Case, Authors Guild v. HathiTrust.95 Those arguments about substantial similarity lead to the same conclusion—there are only so many ways to depict a soccer game. To the extent that Getty relies on these narrow facts within its complaint, the Court should discount those as scenes a faire or under the merger doctrine.96
Getty’s argument represents an impermissible aesthetic claim regarding AI mimicking a photographic style for soccer players. Copyright law protects specific original works, not general styles or aesthetics.97 Just as Warhol’s estate cannot prohibit others from adopting Warhol’s signature styles, Getty cannot monopolize the marketplace for photographs depicting a soccer game. Generative AI prompts questions around human creativity’s scope, though style itself falls squarely in the public domain. AI may independently reproduce works evoking a given aesthetic (to the extent they have not been limited by their programmers), but this market substitution does not implicate copyright absent actual infringement. Utilitarian copyright assumptions that incentives spur human creation are still challenged. However, copyright was never intended to monopolize stylistic concepts, only particular expressions.98 While the implications of AI art merit examination, copyrightability thresholds remain grounded in specific original works of authorship. Unless AI replicates protected elements rather than uncopyrightable style, copyright law maintains vital boundaries limiting monopolies to discrete creations. This upholds its purpose of promoting creative progress, not rewarding authors. AI expands expression at copyright’s frontiers, but stylistic claims exceed its legal limits.
An objection to this argument is found in my prior example of John Oliver depicted in the style of Andy Warhol. On first appearance, the associated metadata of “John Oliver” does not neatly fit within my prior argument. There are many ways to express a picture of a dragon, there are fewer to depict the British comedian. But the likeness of John Oliver, in a copyright sense, is simply a fact about the physical world—that is, an unprotectable fact.99 However, while the celebrity’s likeness itself may be unprotectable, this example exposes subtle complexities around AI mimicking distinctive stylistic flourishes. While copyright protects specific original works, not general style or aesthetic, the line blurs when an AI model is explicitly trained on a narrow artist’s oeuvre rather than generalized creative concepts. This raises complex questions around derivative works and transformative fair use when the input data and parameters narrow substantially, even if the output differs.100 So a nuanced re-evaluation of the Copyright Incentive Theory is needed in response to AI models mimicking, but not appropriating, highly-specific human styles and expressions.101 If economic incentives prove increasingly irrelevant in an age of machine creativity, copyright’s foundational premises warrant reevaluation. Generative AI’s implications extend beyond specific instances of infringement, calling into question the theoretical basis for copyright itself—the Copyright Incentive Theory.
Incentives and Post-Scarcity
The copyright monopoly represents a quid pro quo—rights holders temporarily control the exploitation of works in exchange for their eventual addition to the public domain.102 The copyright monopoly, limited by traditional hostility toward monopolization, aims to stimulate creativity for public benefit rather than provide private windfalls.7 The dominant American legal theory justifying this “reader’s tax” is the Copyright Incentive Theory, which remedies an identified market failure.103 Copyright makes the reward to the author a secondary consideration as a result.104 It is “intended … to allow the public access to the products of [authors’] genius after the limited period of exclusive control has expired.”105 It does not “provide a special private benefit,”106 but rather exists “to stimulate artistic creativity for the general public good”107 and to promote the creation and dissemination of information.108 Were it not, copyright would be outside the boundaries of Congress’ constitutional authority.109
The Theory offers a remedy to an identified form of market failure. It argues that when things are scarce, they become more subjectively valuable.110 So the theory holds that the scarcer property is, the more valuable it becomes to consumers.111 The scarcity of the capital and labor resources is needed to make a good inference regarding how scarce the property is. Economists call these the “factors of production”—labor, capital, land, and entrepreneurship.112 Each is required, in differing amounts, to produce tangible property.113 And each of the factors is scarce depending on the type of goods being produced.114
Intangibles are different as they lack natural scarcity.115 Expressions are intangibles that are freely copyable. My enjoyment of a book does not diminish your enjoyment or use of the same book.116 For this reason, once expressions are created, they belong to the public.117 Economists call this a “public good.”118 When intangibles become widely available to the public, one market effect is an increase in free riders.119 Why would you pay an author for creating a work of expression when you can freely copy it?120 But if you are not willing to pay, then authors will not be willing to invest their capital and labor into creating new works. The result is market failure.121 So the Copyright Incentive Theory offers artificial scarcity as a way of fixing this market failure.122 Copyright creates artificial scarcity by tying freely copyable intangible expressions to monopolies in scarce material objects.124 It does not protect anyone selling these expressions, but rather uses specific market structures to create artificial scarcity and thus incentivize creative expression.
Scarcity of a thing does not make it intrinsically valuable.125 Value is not intrinsic to things but is instead based on the consumer’s perceived subjective marginal utility.126 Economists offer the diamond-water paradox to explain why.127 Water has greater social utility, but diamonds are more expensive.128 Diamonds have aesthetic and industrial utility, but we die without water. So why are diamonds worth more? If value was determined solely by intrinsic utility, then water would be more expensive. But the value of diamonds and water is a function of the scarcity of the factors of production needed to create and consume a unit of either, not a function of their corresponding social utility. It is generally much harder to locate and mine diamonds and cut them into gems than it is to pump water from the ground or collect it from the rain.129 At the microeconomic level, what matters is not the general utility of a good in the marketplace, but the immediate consumer’s preference for it.130 A diamond commands a much different price to shoppers in New York City’s Diamond District than it does to the dehydrated person dying of thirst in Death Valley.131 That person would pay all the diamonds in the world for a glass of water. Of course, once they have had that first glass, the second and subsequent glasses become worthless—economists call this diminishing marginal utility.
Price and production share a bidirectional causal relationship, each influencing the other according to basic economic principles. In a market of perfect competition, the price of a good or service is found at the intersection of its supply and demand.132 The law of supply holds that as prices rise, producers will produce more of a good.133 Conversely, the law of demand states the opposite for consumers with rising prices of a good or service translating into less consumers.134 Producers find that the invisible hand of the market constrains them.135 But for the consumer and their willingness to come to market, producers would produce into the infinite.136 Yet, the law of marginal utility further constrains them.137 Because intrinsic utility does not define value, a consumer’s subjective marginal preference toward a unit of a thing means that producers will never be able to sell as much product as they would like.138
Perfect competition requires complete, symmetrical information between transacting parties—a rare scenario. Real-world markets are dynamic, with firms deciding optimal output levels. A perfectly competitive firm has but one decision: what output of a good to produce. Both parties to the transaction benefit from exchange in this market, but the consumer is robbed of the creative destructive forces of the market.139 Economists define the benefit to the consumer as the consumer surplus; the benefit to the producer as the producer surplus.140 These together create the total economic benefit to the public realized through free trade and competition. Highly competitive marketplaces are characterized by high amounts of consumer surplus, while oligopolistic and monopolistic markets have low amounts of consumer surplus.141
When a marketplace is in an oligopolistic or monopolistic state, the producer in that marketplace reaps additional producer surplus at the expense of consumers.7 Copyright’s lawful monopoly creates deadweight loss, granting producers temporary monopolies over works to remedy disincentives to create freely-copyable expressions. Copyright policy limitations aim to balance this loss against market failures from lack of incentives. Historically, high production and distribution costs, barriers to entry, and government regulations reinforced copyright’s artificial scarcity.
Economies of scale, and the physical nature of books and other works have historically reinforced copyright’s artificial scarcity.142 For centuries, the high costs of production and distribution needed to scale production accordingly, served as a barrier to entry that helped limit piracy. Governmental regulation, licensure, censorship, and copyright further legitimized the status quo. Yet the Founders saw “monopoly as a two-edged sword”; a necessary evil to increase the public good.143 Because while copyright encourages production of new works, it restricts the dissemination of works once produced.7 The absence of competition translates into higher consumer prices and transaction costs.7 It can be difficult for potential users of copyrighted works to locate owners and strike a bargain.7 And monopolists invariably reap more consumer surplus than necessary.
Copyright’s artificial scarcity is said to drive two separate but complementary incentives: the production of new forms of creative original expression and the cultivation of a marketplace for material objects containing those expressions.7 To the extent this shaping of information market structures reduces the direct and search cost of information dissemination, the copyright regime reinforces the patent system, furthering the goals of the Progress Clause.144 However, much of the economic literature in copyrights makes a fatal error: copyright does not incentivize production of one homogeneous product.145 Each of the Section 106 rights individually and together result in several differing product offerings, sometimes in direct competition with an author’s other existing rights, and sometimes in completely different marketplaces or channels of distribution.7 Copyright does not directly reward creativity or encourage creativity, nor does it reward authors for their labors of creativity.146
While copyright’s stated goal is incentivization, the outcome is the commodification of art.147 The American copyright system developed to solve a specific market failure—the lack of incentives to invest in the expensive capital and labor structures needed to disseminate and distribute information throughout the country. Ultimately, “[t]he possibility of eliciting new production is, and always has been, an essential precondition for American copyright protection.”148 Copyright exists to create a specific monopolistic or oligopolistic market structure with specific congressionally defined boundaries. It seeks to create a market of a few firms engaged in the mass production of works.149
As technology reduces entrenched firm’s marginal costs, Congress has repeatedly rebalanced copyright’s incentives.150 Amendments to the copyright term, scope, licensing, renewals, termination and works for hire doctrine have adjusted the scale toward additional producer surpluses.151 Substantive doctrines like fair use, the idea-expression dichotomy, merger, and others—in theory but perhaps not in practice—have all been held to further unify the Copyright Act and the First Amendment’s dual mandate of advancing American culture and freedom.152 These disparate rules each attempt to limit producer overreach and prevent producers from capturing too much of the consumer surplus while also ensuring that the consumer surplus is generated in the first place.153
Technological obsolescence of historical market barriers routinely threatens this equilibrium. For example, in the late 1960s, Congress became increasingly concerned that reproduction technologies were eliminating the cost disincentives to pirate copyrighted works.154 Concerned that the internet and digital copying technology were driving these costs down (and purportedly to harmonize American and international copyright law), Congress enacted a number of modern reforms to copyright law, under the belief that producers would stop making new works if any consumer could easily and cheaply make a copy of a work.155 The Judiciary Committee report that accompanies the 1976 amendments to the Copyright Act explains:
The history of copyright law has been one of gradual expansion in the types of works accorded protection. . . . [S]cientific discoveries and technological developments have made possible new forms of creative expression that never existed before. In some of these cases the new expressive forms—electronic music, filmstrips, and computer programs, for example—could be regarded as an extension of copyrightable subject matter Congress had already intended to protect, and were thus considered copyrightable from the outset without the need of new legislation.156 94-1476, at 51 (1976); S. Rep. No. 94-473, at 50 (1975).[/efn_note]
The conversations surrounding generative AI do not occur in a vacuum. Many of these issues are not novel from a policy perspective. Painters thought photography was the end of the fine arts. Video killed the radio star. File-sharing was to be the death of the music industry. Congress has amended the Copyright Act on several occasions in response to these and other advancements in technology and the invention of new mediums of expression.157 Digital art itself has existed since the 1960s, and the threat of the unauthorized distribution of copyrighted works has dominated most of the public policy debates since.158 A study of the historical development of the American copyright system and the changing perspectives toward the Incentive Theory show that the Theory is really a post-hoc rationale for copyright. As I discuss in the next section, copyright was, is, and continues to be, a means for advancing specific industrial policies in a manner that is perceived to advance the sciences and useful arts.
III. Beyond Incentives: Copyright as Competition Policy
Mechanical Reproduction in Europe
Expression predates copyright, and copyright was unknown before the printing press.159 According to modern theory, copyright should not serve competition-related goals under the Progress Clause, but the Copyright Incentive Theory in application has always served as a post hoc rationale for copyright.160 The historical development of the American copyright system shows why. Every major technological leap has, in some way, raised the same fundamental structural questions that the development of generative AI tools have. Historically, copyright is a competition policy tool first and foremost, one used by Congress to orient the marketplace toward specific economic goals. The idea of incentivizing authors to create works, while germane, is a wholly modern conceptualization of copyright.
The starting place of our inquiry is found in the information market structures that existed around the advent of mechanical printing. The manual reproduction of books was economically labor and capital intensive before mechanical forms of reproduction.161 Following the European Monastic tradition, monks crafted a book’s raw materials, including the parchment, ink, binding materials, and linen.162 They painstakingly hand-copied each work, a laborious and time-consuming activity.7 This expensive process and economically inefficient exploitation of the factors of production made supply scarce, restricting the dissemination of knowledge throughout Europe.7 In this period, due to high production costs, books were expensive luxury goods, reserved in their limited supply to the religious and political castes.7
This changed with the advent of mechanical printing. The mechanical reproduction of books came to Europe in 1450, when Gutenberg founded his printing press.163 Printing, along with other socio-economic and political pressures, led to a gradual collapse of Europe’s then existing monopolistic information market structures.164 After 1436, the price of a book fell from a week’s wages to less than a day’s, with the average cost falling at 2.4% per annum for nearly a hundred years in the period following.165 By the first decade of the 16th century, it is estimated that some 2 million books were printed in Europe.166 Production grew exponentially, and the continent saw 20 million printings by 1550, and 150 million printings by 1590.7
Advances in production techniques also reduced the overall size and weight of books, making them easier to transport and distribute throughout Europe.69 Falling distribution costs, coupled with larger supplies of works drove literacy rates up, which increased demand for further printings.69 As supply increased, price fell, and literacy rates rose correspondingly in a cyclical feed-back loop.167
The impact of this new technology was an economy of European information marketplaces. The old hand reproduction method of creating reproductions of books created a highly elastic demand curve. Books were a luxury good, reserved for the aristocracy and religious castes of Europe. This began to change with the advent of book printing. As the technology pushed reproduction and distribution costs down, the demand for works trended toward an elastic demand curve. But the benefits of the technology were not perceived as unlimited, and regulation and monopoly soon followed.
Though initially characterized by free market competition, the book publishing industry gradually developed oligopolistic structuring due to inefficient transportation networks and prohibitive barriers to entry.168 Paper-based books were heavy and cumbersome in bulk, constraining dissemination outside local distribution channels.169 Copies spread via transportation links as far-flung printers produced their own print runs of works. High fixed costs and long recoupment horizons compounded barriers to entry.170
Consequently, market division agreements and governmental privileges led to increased cartelization.171 Incumbent printers artificially restricted production runs to maximize monopolistic profits. Technological and infrastructure limitations enabled rent-seeking behaviors that transformed book publishing into a concentrated, anti-competitive industry.172 Political and religious censorship soon followed.173 Between 1469 and 1517, the Venetian Republic granted a series of increasingly draconian monopoly privileges over printing.174 Similar legislation followed in France in 1475 and Germany in 1531.175 In 1476, Caxton founded his printing press at Westminster.176 One-hundred fifty years of English censorship followed.177
The first English copyright privilege was granted to the King’s Printer, Richard Pynson, in 1518, with a series of royal privileges following.178 In the aftermath of the War of the Roses, Henry VIII issued a series of Proclamations consolidating the political and religious power of England within the Crown.179 The first list of prohibited books appeared in 1529, followed by the grant of licenses the year after.180 Henry VIII granted the Stationer’s Company a royal charter, giving them a monopoly over the printing of books from 1557 until 1710, in an effort to further suppress foreign competition and the importation of foreign manufactured books.181
The Stationer’s Company was empowered to inspect printing operations and seize and destroy offending equipment and publications, in service of the Crown’s prerogative (or its own monopoly).7 It further constrained book production to extract additional monopoly rents.182
When the Star Chamber was abolished in 1641, the cap on London printing houses was removed, and by 1660, there were nearly 60 operating within the city.183 However, the Stationer’s Company’s complex licensing systems and territorial arrangements caused prices to rise steadily by as much as 40% in 1635, making books unaffordable for most day laborers.184
After several centuries of abuse, Parliament enacted the Statute of Anne, the first modern copyright statute, in 1709 as a means of reforming perceived market failure and redressing public outcry over the Company’s monopoly abuses.185 In 1735, the booksellers proposed an amendment that would have extended their copyright monopoly to 1756.186 The amendment was defeated on the grounds that it would create a perpetual monopoly.7 Prices began to decline, such that by the late 18th century, the fledgling American printing industry could not compete on price.187 So it was the political desire to restrain printing, not a need to incentivize art, that characterized the early formation of copyright protection at common law. But as printing came to colonial America, a different concern came to predominate copyright theory.
Mechanical Reproduction in the Colonies
The first printing press came to colonial Massachusetts in 1638, when Reverend Jose Glover was expelled from the Church of England as a heretic.188 Glover afterward contracted with a craftsman, Steven Daye, to bring his press to support the newly founded Harvard University.189 Reverend Glover died on the Atlantic crossing, but with his widow Elizabeth’s help, his machine eventually passed into the hands of Harvard, becoming the first printing press operated in colonial America.190
Lawful monopolization followed. The reforms of the Statute of Anne were not extended to the colonies, leaving the colonial government free to restrain the book trade on political and religious grounds.191 Shortly after printing presses were erected in New York and Philadelphia, the General Court of the Bay Colony (the colonial government authority) permitted no other presses to operate for a period of nearly 40 years.192 Any hope of a nascent book industry was quashed, with no major investments or changes to the American book trade for the following century.
As a result, the colonial production and distribution of printed works was difficult and proved prohibitively expensive.193 Insufficient transportation infrastructure and an agrarian-mercantilist economy limited the sale of printed works to major coastal cities.194 The combination of the General Court’s monopoly privileges and high production and labor costs resulted in most domestically consumed books being imported from England and Europe through transatlantic trade.195 The Stamp Act of 1765 and Revenue Act of 1767 further increased the cost of parchment and vellum, which both created a barrier to entry in the publication business and brought increased pressures on the domestic manufacturing capacity of books.69 The Acts also restrained internal domestic commerce amongst the colonists.69 These high labor and capital costs factors directly influenced what works were printed and how they were disseminated to the public.196
In lieu of a free marketplace for literature, a nascent sharing economy for literature and ideas emerged.197 While bookstores existed, most relied on the sale of other goods for income.198 Early novels were expensive, and rural readership primarily relied on subscription and direct salesman to bring works to them.199 This encouraged the development of robust public and private library systems to distribute works.200 Small towns often established at least one library collection, making books accessible and affordable to an emergent class of readers.201 Demand for works outpaced supply, such that an alternative economic system was necessary to satisfy consumer demand.
The colonial distribution system for books that emerged was chiefly decentralized. It consisted of informal networks of friends, laborers, agents, peddlers, bookstore owners whose income came only partially from books, and a few institutions and private individuals who imported European books.202 The most rapid tool for reproduction in this period was localized reprints. Thomas Paine’s 1776 pamphlet Common Sense, for example, was in such high demand that bookstore owners could not keep it in stock.7 Many local printers turned to producing their own unauthorized copies as the pamphlet circulated through the colonies.7 And while logic would presume that the Revolution should have severed the English-American transatlantic book trade and quashed the book industry in this period, some historical research suggests the possibility of an increase in the exchange of political literature during open hostilities.203
After the war, disparities in specialization of labor between England and the United States meant that English producers easily outcompeted efforts at large-scale domestic printing operations.204 Books printed in England were cheaper to purchase in the fledgling United States than domestic works. Lord Sheffield noted that “all school and common books can be sent cheaper from Britain than they can be printed in America, or sent from Ireland.”7 The 1796 Present State of Printing and Bookselling in America noted “[t]he people of North America manufacture their own paper, and in sufficient quantities for home consumption, but the price of labour is still so extremely high, that it seldom answers to print any work there: at least, they have hitherto seldom ventured beyond their own laws, temporary pamphlets, and newspapers, which every State now prints in abundance.”7 Priced out of competition, the nascent domestic American book trade floundered.
In the years preceding the Constitutional Convention, a coalition of domestic publishers, led by lexicographer Noah Webster, for somewhat self-serving reasons, began lobbying state legislatures for copyright laws allegedly to bolster domestic production. All but one state enacted reform prior to the Constitutional Convention, with most modeled after the Statute of Anne. In 1783, the Constitutional Convention concluded “that nothing is more properly a man’s own than the fruit of his study, and that the protection and security of literary property would greatly tend [sic] to encourage genius.”205
But the question of copyright protection was not a major priority at the Convention, and no committee meeting notes survive its debate.206 At least some delegates, including Thomas Jefferson, opposed the idea as furthering monopolies and the aristocracy.7 Yet, in Federalist Paper No. 43, James Madison made his case for a federal copyright scheme, arguing that the public interest in encouraging the spread of knowledge was in harmony with the private property rights of authors.207
The Constitution’s Progress Clause was approved with no recorded debate.208 It grants Congress the power to “promote the Progress of Science … by securing for limited Times to Authors … the exclusive Right to their respective Writings.”7 But in the period between the Constitution’s ratification in 1783 and 1790, it was unclear how authors acquired a copyright under the Progress Clause. In this period, Congress was petitioned by several authors who believed that Congress had to directly grant a copyright monopoly, as was the prerogative of the English monarchs.
Recognizing these issues, George Washington’s State of the Union Address of 1790 called for a federal copyright scheme as a means of advancing education and for improvements in transportation infrastructure to assist in the distribution of knowledge and information throughout the United States.209 Many of the Founding Fathers saw the spread of information and knowledge as an important democratic bulwark against tyranny.210 The United States at this time was predominantly an agrarian economy, so the problem of the day was solving the difficulty in distribution of educational works throughout the United States. The roads, transportation system, labor conditions, and slow communications of the late 18th century created an identified form of market failure, one left unresolved as book publishers were either economically unwilling or unable to sufficiently scale production factors pre-industrialization and compete with the English printers.
The Copyright Act of 1790 attempted to address this perceived market failure, with a grant protecting “maps, charts, books . . . and manuscripts”—the tools of education and science.211 The 1790 Act did not define the term “book,” leading to anything not fitting within the scope being classified as a book in applications.7 This predictably led to non-educational literary works being registered, including hotel registers, form books, circulars, syndicate articles, and compilations of information.212 Further, the resulting confusion and overly punitive formalities found in the 1790 Act led to most authors seeking protection under state rather than federal law.213 The 1790 Act was seen as a failure in this regard.
Washington and Alexander Hamilton saw reliance on English production as a threat to the economic development and recently won independence of the United States. Without the appropriate economic incentives for domestic manufacture, they were greatly concerned with English and European producers pricing fledgling American industrial production out of the domestic and global marketplace. The 1790 Act echoed the Federalists’ platform of economic protectionism, with its scope of protection limited to domestically manufactured works produced by U.S. residents and citizens.214 To further protect domestic manufacture of books, Congress increased book tariffs several times between 1794 and 1800, raising the tariff from 5% to 12.5%.215
The Copyright Act maintained this domestic protectionism in the 1831, 1909, and 1976 revisions.216 Copyright’s domestic protectionism would last until the United States acceded to the Berne Convention in 1983.217 As part of its accession, in 1986, Congress repealed the domestic production requirements from Section 601 of the Copyright Act.218 So, for the better part of the existence of federal copyright protection, the driving concern was protecting domestic reproductions of works, not incentivizing authors to create new art. Congress made the public policy determination that suffering some of the deadweight losses caused by monopolistic market structure was the best means by which to foster and develop the domestic manufacturing of works.
Industrialization
The early 19th century brought new challenges to American information market structures. Prior to American Industrialization, a trip from New York to Chicago in this period could take 6 weeks’ time. Books remained expensive, and the fledgling United States was in significant debt to France for financing its Revolution. Many school teachers had come to rely on libraries and sharing systems to access literature and other education aids. Without sufficient capital reserves or the tax structure to finance education, the federal government saw the copyright monopoly as a means of spurring private investment into the information marketplace.219 Copyright was seen as a solution to this problem.
During the Antebellum period, two technological developments began pushing down the costs of mechanical reproduction and distribution of works. First, the rapid industrialization of the United States introduced commercial mass (re)production. Second, the railroad and the telegraph drastically reduced the cost of information sharing and the distribution of goods. The advent of the telegraph in the 1830s meant that information could be disseminated between states at rapid speed. The building of rail lines caused the journey time between New York and Chicago to fall from 3 weeks in 1830 to 2 days by 1860.220 By 1850, nearly 9,000 miles of railway had been laid.7 Faster information dissemination and distribution of raw materials and consumer goods fed further innovation and market demand, causing the United States to undergo a period of significant increases in productive capacities and economic growth in the pre-war period.7 And most importantly, several new technologies born of the late industrial period challenged existing copyright norms.
Several copyright reforms followed in the period of industrialization, including the protection of music compositions as a category separate from books and simplification of the deposit requirements. Starting with the Copyright Act of 1831, deposit copies were required in the district court that had jurisdiction over the author’s residence, further facilitating the dissemination of knowledge capital on a decentralized basis.221 This had the immediate effect of lowering the public’s search costs in locating information which in turn further cut distribution costs. By tying the deposit to the grant, Congress had created an incentive for authors of works to make them available to the public at known locations. Later, in 1846, the Smithsonian and the Library of Congress were added as depositees.7 It would then later shift, temporarily, to the Patent Office.7
The Supreme Court’s first copyright decision came in the 1834 case Wheaton v. Peters, wherein the Court confronted the reproduction of its own decisions.222 Concerns of a “proper” market structure permeate the case. Wheaton involved a claim of copyright infringement of the Court reporter’s compilations of decisions and accompanying annotations.223 The third reporter, Wheaton, had supplemented his income with these reporters.69 The Wheaton reporters were notoriously expensive, pricing most lawyers (and the public) out of accessing them.69 His successor, Peters, took the cases in the Wheaton reporters and republished them in an abridged volume.69 Wheaton sued.69 In ruling against Wheaton, and out of concern for public harm caused by the monopolization of information, the Court remarked that it “is unanimously of opinion that no reporter has or can have any copyright in the written opinions delivered by this Court, and that the judges thereof cannot confer on any reporter any such right.”224
Photography brought similar market structure concerns. Invented as early as 1816 and popularized around the late 1830s, the technology both spurred consumer demand and military applications during the Civil War.225 During the Civil War, Union and Confederate demand for family portraits and battle scenes resulted in a consumer photography boom.7 Military leaders recognized the strategic implications of the technology on information warfare, with both sides employing photographers to record enemy emplacements, roads, bridges, and railroads. Photography underwent rapid technological innovation in this period, resulting in lowered production costs for cameras and photographs. The invention of the tintype, which was a metal image, and the ambrotype, printed on glass, allowed for mass production of small photographs for consumers.226 The massive popularity of photographs during the war, and to some extent, the works of war-photographer Mathew Brady, led Congress to add protection for photographic works to the Copyright Act in 1865.227 The camera democratized art.228 The ease of labor in capturing reality meant that anyone with the right equipment could do so. But the question of whether a copyright privilege in a photograph served the underlying goals of the Progress Clause would go unanswered for several decades.
The Copyright Act of 1870 was an attempt to modernize the Act in response to these changes in the domestic marketplace. First, it brought needed reform to the deposit requirements, with the Library of Congress again serving as the sole depositee.229 This served the dual purpose of streamlining the registration process and encouraging further development of a centralized storehouse of American culture.230 The formalities were likewise made less punitive to encourage more authors to register and deposit works with the fledgling Copyright Office. This, coupled with another expansion in the protection of works to include paintings, drawings, chromolithographs, statues and statuaries, and models or designs, reflected the growing industrialization and new consumer marketplace within the United States. As industrial capital accumulated, massive social changes to American life occurred. As printing did several centuries before with literature, the industrial commodification of art and advent of new production technologies drove consumer demand which in turn fermented further technological innovation. The “democratization” of art had aligned the industrial capacity of the nation toward these new consumption preferences.
In 1884, the Supreme Court finally took up the question as to whether photography was copyrightable subject-matter in Burrow-Giles v. Sarony.231 In 1882, photographer Sarony had captured a portrait of Oscar Wilde in his magnum opus photo Oscar Wilde No. 18.232 Burrow-Giles Lithographic Company later made unauthorized reproduction lithographs of the work.7 Sarony sued.7 On appeal to the Supreme Court, the court rejected Burrow-Giles’s arguments that a camera results only in a mere mechanical reproduction of nature.233 Rather, the Court held that it was the expressive contributions of the human author to the depiction contained in the photograph that vested authorship in the work.7 The Court recognized, for the first time, that the addition of human originality, in choosing and arranging the composition, lighting, subject-matter, and technical skills in capturing an image, to the capturing of raw data about the world, was sufficient under the Copyright Act.234
The seminal 1908 Supreme Court White-Smith Music Publishing Co. considered whether piano-roll players that performed perforated music copies of copyrighted musical composition infringed on the copyright in music composition.235 The Supreme Court ruled that they did not, as the Copyright Act requires that the work be capable of human perception.236 Congress disagreed, leading to the addition of the compulsory license for “mechanical” embodiments of musical works in the 1909 Amendment,237 and the dispensing of the human perception requirement in the 1976 Amendment.238 In both instances, Congress acted not to protect economic incentives to authors, but to facilitate specific market structures within the industry, which here amounted to concern over the growing cartel of music publishers and their control over the recording of music.239
Digital Media and Beyond
A full account of the impact of radio, motion pictures, and phonorecords is beyond the scope of this paper, but it is sufficient to note that they each prompted Congress to, after industry lobbying, rewrite the rules of the marketplace to favor entrenched market participants to further specific industry-related structural goals. By way of example, consumer access to audiotape recorders and record piracy led to the Sound Recording Amendment of 1971240 and the later Audio Home Recording Act of 1992.241 Photocopying technologies played a part in several exemptions defined in the 1976 amendments.242 The danger of market disruption caused by digital filing sharing was known to Congress in the late 1960s, leading to decades worth of investigation that culminated in the enactment of the DMCA in 2001.243 Perhaps tellingly, the word “incentive” does not appear until 130 pages into the Copyright Office’s seminal 2001 DMCA report; also perhaps telling is that it arises solely in the context of the Copyright Office rejecting arguments that a digital first sale doctrine would further the Copyright Incentive Theory.244
While the Copyright Incentive Theory presumes that financial incentives for authors are the primary aim of copyright law, an alternative perspective is that encouraging the commodification and commercialization of artistic works is aligned with national economic policy goals. From this critical view, copyright is less about rewarding individual creators and more about facilitating the development of arts and culture as an industry that can be monetized, marketed, and controlled. Copyright law establishes mechanisms for art and creativity to be traded as commodities and granted commercial value rather than existing as freely-available public goods. Therefore, the political-economic function of copyright may be the commodification of culture over and above providing incentives to creators.
Generative AI tools, with their costless productive capabilities and ability of their outputs to serve as perfect market substitutes, require a different contextualization. If human authorship is to remain the focus of American copyright law, it can no longer do so based solely on previous conceptions of originality and the strength of the incentive rationale. Rather, the current conversation should shift toward strengthening the moral rights of human authors, which establish protections based on the personal dignity and creative identity embodied in their works, independent of the economic incentives involved. Moving forward, moral rights may provide a legal framework better suited to preserving the value of human creativity itself in the age of generative AI. Congress should respond with stronger protections for integrity and attribution beyond the limited ones currently in place.245
Conclusion
The issue posed by generative AI is not really whether machines can be authors, but rather, whether the existing system of incentives make sense considering this new technology. Congress has repeatedly addressed this question as new forms of creation have disrupted the marketplace. As generative AI systems produce a near-infinite amount of works in a near-post-scarcity marketplace, abundance, market disruption, and the displacement of human-authored works is all but assured. But the Copyright Incentive Theory has always been a post hoc rationalization justifying monopolization, rather than the true basis for copyright law, leaving it ill-equipped to pave a working path going forward. The real public debate over generative AI art centers on moral rights in human authored works and whether data scraping those works to generate new art constitutes a mutilation or distortion under the European tradition. There are non-economic interests around attribution, integrity, and human expression that could justify maintaining the existing copyright structure. As costs vanish, a copyright system based on moral rights and intrinsic motivation may better serve the public interest than a system preoccupied with monopolistic incentives and form an area ripe for additional consideration and scholarship.
Rather than engaging in anthropomorphic debates about computer authorship, copyright policy should focus on strengthening moral rights for human authors. If Congress believes that human art deserves more protection than machine outputs, that there is something unique about human intentionality in creating new expression, then it should say so. This requires rethinking standards around derivative works, transformative fair use, and moral rights. If we accept that copyright still serves the public’s interest in this new marketplace, then the coming artistic singularity calls for an evolved copyright doctrine that incentivizes generative AI’s immense expressive potential for public enrichment, while preserving the integrity of the human works it builds upon. Stronger moral rights protections from generative AI data-scraping tools, requiring better attribution and better protection from algorithmic mutilation, is a start. The other, more radical abolitionist alternative is moving beyond copyright and its incentives as we approach the artistic singularity.