Please create an account to participate in the Slashdot moderation system

John Grisham, George RR Martin, Other Top US Authors Sue OpenAI Over Copyrights (reuters.com) 148

Posted by msmash on Wednesday September 20, 2023 @12:00PM from the tussle-continues dept.

A trade group for U.S. authors has sued OpenAI in Manhattan federal court on behalf of prominent writers including John Grisham, Jonathan Franzen, George Saunders, Jodi Picault and "Game of Thrones" novelist George R.R. Martin, accusing the company of unlawfully training its popular artificial-intelligence based chatbot ChatGPT on their work. From a report: The proposed class-action lawsuit filed late on Tuesday by the Authors Guild joins several others from writers, source-code owners and visual artists against generative AI providers. In addition to Microsoft-backed OpenAI, similar lawsuits are pending against Meta Platforms and Stability AI over the data used to train their AI systems. Other authors involved in the latest lawsuit include "The Lincoln Lawyer" writer Michael Connelly and lawyer-novelists David Baldacci and Scott Turow.

This discussion has been archived. No new comments can be posted.

John Grisham, George RR Martin, Other Top US Authors Sue OpenAI Over Copyrights

Load All Comments

Search 148 Comments Log In/Create an Account

Comments Filter:

May the odds ever be in your favor (Score:2)

by elcor ( 4519045 ) writes:

And where the image crater failed may the wordsmith be victorious!
- Re: May the odds ever be in your favor (Score:4, Insightful)
  
  by saloomy ( 2817221 ) writes: on Wednesday September 20, 2023 @01:31PM (#63863514)
  
  No. The word smiths are in the wrong here. AI works like your brain. It uses past knowledge to generate new creative content. Prior to George RR Martin writing the song of ice and fire, he read novels, stories, learned character arcs, and developed skills in writing based on countless books he had read. AI is doing the same thing. He is not the first person to write a story with a dragon. Or zombies. AI should have the same ability to build on the shoulders of giants as he has had.
  
  Parent Share
  twitter facebook
  - Re: May the odds ever be in your favor (Score:5, Insightful)
    
    by Jason Earl ( 1894 ) writes: on Wednesday September 20, 2023 @03:29PM (#63863880) Homepage Journal
    
    Even if AI works like your brain (which personally I think is a gross oversimplification), there are still limits as to what I can do with other people's copyrighted material. It is one thing to read The Fellowship of the Ring. It is another thing altogether to read The Fellowship of the Ring and then write a book Companionship of the Amulet that has roughly the same plot. The more similar my work is to the original work the more likely it is to be ruled derivative and then what I can do with my work becomes strictly curtailed.
    This is especially true when you are dealing with AI. The people training the models argue that they included the copyrighted works under "fair use," and reproducing bits of a whole text in the output of an AI process probably is covered. However, copying the full text of a work (or an image) into the memory of an AI model probably is not covered. This is exactly how we ended up with laws like the DMCA, and the courts have been siding against decrypting a work as fair use for a long time. The fact that AI works can't be copyrighted makes it easy to conclude that AI generated content is nothing but the uncopyrightable derivative content of every input that went into the model. It would be legal, but it would be completely worthless from a commercial standpoint.
    Controlling how copyrighted material is used is 100% what copyrights are about. This really is no different than me taking a book that I like and making a recording of me reading it. I am entitled to do this. I can even copyright my performance, but I can't monetize (or even share) that performance without the express permission of the original copyright holder. That's even despite the fact that there is a genuine creative act by an actual human as the written word is turned into an audio performance.
    Generative AI has none of these rights because there is no person involved. I can reuse experiences that I have stored in my brain, and generate works that, while similar to other copyrighted material, are original enough to warrant copyright protection. To a certain extent that is a right that I have as a human. Generative AI doesn't have that right, nor that protection from creating works that are derivative by default. I suspect that authors and artists have the right to keep their copyrighted material from being copied wholesale into the memory space of the system making the model in the same way that I can infringe copyright by simply copying digital copyright material from magnetic (or other) media into the memory of my computer. That bit isn't fair use, as it involves the entirety of the work, and it is precisely the boundary that copyright holders have already used to control how digital copyright material gets actually used.
    George RR Martin is a person. Generative AI is not. George should absolutely be able to control how his copyrighted material gets copied into an AI model. This is essentially the same right that keeps Hollywood from making a movie of his works without his permission. The AI people can continue to build models, they will just have to use either material that isn't copyrighted, that they own the copyright to, or copyrighted material where the artist has opted to allow their content to be so used. Alternatively, I suspect that George would be fine with the idea that everything generated with a model that included his copyrighted material would be deemed a derivative of his work. With a model generated from enough copyrighted material that would make for content that was very hard to share, but it would absolutely work for the sort of non-commercial work that much of generative AI content fills.
    The precise details as to how this plays out will be decided by these lawsuits. However, it is extremely unlikely that the generative AI people will be given carte blanche to include any works that they want into their models and then be able to use the output of those models however they want. Worse, there is precisely zero chance that they will give AI models the same rights as human artists.
    Read the rest of this comment...
    
    Parent Share
    twitter facebook
    - Re: May the odds ever be in your favor (Score:2)
      
      by brunes69 ( 86786 ) writes:
      
      "George RR Martin is a person. Generative AI is not. George should absolutely be able to control how his copyrighted material gets copied into an AI model"
      That's not how this works.
      That's not how any of this works.
      - Re: (Score:2)
        
        by avandesande ( 143899 ) writes:
        
        Considering the recent ruling that AI generated patents cannot be awarded, I would say the distinction is important. If I gave a robot a gun and when someone complained argued it was covered under the second amendment, you would laugh.
        
        I am not arguing either for/against this use of copyrighted material, just that I don't believe it's use in this way is covered under current understanding of copyright law.
    - Re: (Score:2, Interesting)
      
      by WaffleMonster ( 969671 ) writes:
      
      Even if AI works like your brain (which personally I think is a gross oversimplification), there are still limits as to what I can do with other people's copyrighted material. It is one thing to read The Fellowship of the Ring. It is another thing altogether to read The Fellowship of the Ring and then write a book Companionship of the Amulet that has roughly the same plot. The more similar my work is to the original work the more likely it is to be ruled derivative and then what I can do with my work becomes strictly curtailed.
      While obviously human brains are not LLMs human memory is likely to be substantially analogous.
      https://openreview.net/pdf?id=... [openreview.net]
      This is especially true when you are dealing with AI. The people training the models argue that they included the copyrighted works under "fair use," and reproducing bits of a whole text in the output of an AI process probably is covered. However, copying the full text of a work (or an image) into the memory of an AI model probably is not covered. ...
      I suspect that authors and artists have the right to keep their copyrighted material from being copied wholesale into the memory space of the system making the model in the same way that I can infringe copyright by simply copying digital copyright material from magnetic (or other) media into the memory of my computer.
      There is no fixed work produced in this process any more than a human reading text from a book is "copying" text they read into their brain or from copyrighted works temporarily kept in a network or storage buffer.
      The fact that AI works can't be copyrighted makes it easy to conclude that AI generated content is nothing but the uncopyrightable derivative content of every input that went into the model.
      This is a non-sequitur. The criteria for judging derivative works is not the same as the criteria for copyright eligibility. You appear to be confusing the issue of wh
      - Re: (Score:2)
        
        by r0nc0 ( 566295 ) writes:
        
        Interesting paper - thanks for the link!
        I'm curious about the idea that the LLM is not creating anything new - it doesn't necessarily transform; it slices and dices and re-joins, or just repeats. It's not as if it ingested some dataset, reasoned about it and came up with some conclusion. Aren't we saying that when humans do the same thing it's derivative but when humans actually do transform something they're creating something completely new - not a mashup - but something they synthesized from what they
        
        Re: (Score:3)
        
        by Bigjeff5 ( 1143585 ) writes:
        
        it slices and dices and re-joins, or just repeats.
        That is NOT what LLMs do at all. There aren't any "pieces" for it to "slice and dice" or repeat. There aren't even whole words saved. It doesn't have any data or record or memory of any kind.
        What they do is predict the conversation, based on a set of tokens (akin to syllables, but not the same) and a highly tuned neural network.
        It's literally taking your question, combined with what it has already said itself in previous prompts, and is predicting the rest of the conversation not even a whole word at a time
        
        Re: (Score:2)
        
        by WaffleMonster ( 969671 ) writes:
        
        I'm curious about the idea that the LLM is not creating anything new - it doesn't necessarily transform; it slices and dices and re-joins, or just repeats.
        What makes LLMs useful is generally applicable concepts are learned during training. During inference this is leveraged by the model to respond to prompts.
        It's not as if it ingested some dataset, reasoned about it and came up with some conclusion.
        If I upload a document into my context and ask the model questions about it I can only expect coherent output if the model is able to understand language sufficiently to understand both the provided document and my questions to the model about that document.
        For example the initial GPT-4 presentations included uploading a tax form and the presenter asking
    - Re: (Score:2)
      
      by cpt kangarooski ( 3773 ) writes:
      
      It is another thing altogether to read The Fellowship of the Ring and then write a book Companionship of the Amulet that has roughly the same plot. The more similar my work is to the original work the more likely it is to be ruled derivative and then what I can do with my work becomes strictly curtailed.
      There's an interesting experiment for you:
      Put copies of an AI on two different computers, and make them identical in every way except that one of them has had their training data searched for the text of the Lord of the Rings books, and it's been deleted. Then give them identical prompts, and to the extent they use random number generators, fake it by giving them the same random numbers (e.g. https://xkcd.com/221/ [slashdot.org]">4), and see what they come up with.
      Because similarity, even perfectly identical works, is
    - Re: May the odds ever be in your favor (Score:2)
      
      by S_Stout ( 2725099 ) writes:
      
      Where can I purchase Companionship of the Amulet?
  - Re: (Score:2)
    
    by eth1 ( 94901 ) writes:
    
    No. The word smiths are in the wrong here. AI works like your brain. It uses past knowledge to generate new creative content. Prior to George RR Martin writing the song of ice and fire, he read novels, stories, learned character arcs, and developed skills in writing based on countless books he had read. AI is doing the same thing. He is not the first person to write a story with a dragon. Or zombies. AI should have the same ability to build on the shoulders of giants as he has had.
    I think it might be a little different comparing a human vs. computer, though - at least legally, if not practically.
    I can take a book, and *without making a copy*, read it, and end up with a synopsis and highlights in my memory.
    A computer can't process it at all without making copies, so that would probably open up a legal can of worms which might give them a copyright case.
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
  - - Re: May the odds ever be in your favor (Score:4)
      
      by Bigjeff5 ( 1143585 ) writes: on Wednesday September 20, 2023 @03:48PM (#63863936)
      
      I love how you speak with such confidence, yet your statement makes it clear you know absolutely nothing at all about the law or about property rights.
      If they acquired the books by any means other than naked theft from a bookstore shelf, their use is completely legal. And it's the bookstore they stole from that would need to provide proof of the theft, not the other way around. OpenAI would be presumed innocent. AND it's the individual who stole the books that would be liable, NOT OpenAI.
      Finally, even if you were correct, the most they would owe is about $50. To the bookstore, not the author.
      This lawsuit is nonsense.
      
      Parent Share
      twitter facebook
    - Re: (Score:2)
      
      by ufgrat ( 6245202 ) writes:
      
      Ah, so you're with the trade groups who feel they (and not necessarily the authors) should be compensated every time you read a book.
      That's nice.
    - Re: May the odds ever be in your favor (Score:2)
      
      by S_Stout ( 2725099 ) writes:
      
      Every book is in a library. It just reads every book from every library really fast.
George is just afraid (Score:5, Funny)

by Snotnose ( 212196 ) writes: on Wednesday September 20, 2023 @12:14PM (#63863292)

AI will finish his book series before he decides to plant his fat ass in a chair and do it himself.

Share
twitter facebook
- - Re: (Score:2)
    
    by WDot ( 1286728 ) writes:
    
    You may be right, but then the takeaway is that one should never start to read a novel series that isn’t finished, because the author may make his money and decide he doesn’t need to give his readers closure. Seems a bit rude to that author’s fans who made him rich. This is true for all long-running series, but GRRM is particularly egregious. If one was in college when ASOIAF Book 1 came out, then at this point one might have kids graduating college. Probably most people moved on with life
    - Re: (Score:2, Flamebait)
      
      by account_deleted ( 4530225 ) writes:
      
      Comment removed based on user account deletion
      - Re: (Score:2)
        
        by WDot ( 1286728 ) writes:
        
        Okay GRRM
      - Re: (Score:2)
        
        by HBI ( 10338492 ) writes:
        
        Actually agreed. I found a copy of his first Game of Thrones novel from the 90s with a bookmark in at page 100 a few years ago. It had failed my 100 page test as I was flying around on business - if it hasn't grabbed my attention at that point, I throw the thing aside and read something else.
        
        Re:George is just afraid (Score:4, Interesting)
        
        by Bigjeff5 ( 1143585 ) writes: on Wednesday September 20, 2023 @04:08PM (#63863986)
        
        Reminds me of an extremely awkward conversation between Martin and Stephen King:
        Martin: You don’t ever have a day when you sit down there and it’s like constipation — you write a sentence and you hate the sentence, and you check your email and you wonder if you had any talent after all and maybe you should have been a plumber? Don’t you have days like that?
        King: Nope
        King can write a full manuscript in 2 months. Martin is happy if he gets a chapter done in that time.
        From what I've gleaned Martin just has a terrible process. It sounds like he's trying to get it perfect right from the first draft, and everything I've ever learned about writing says that is a fools errand. Most people say that it's best to get the rough draft out of the way as soon as possible so you can start making revisions. It's in the revisions where you perfect the delivery of the story, but you have to have the story there first before you can revise it.
        
        Parent Share
        twitter facebook
        
        Re: (Score:2)
        
        by GFS666 ( 6452674 ) writes:
        
        From what I've gleaned Martin just has a terrible process. It sounds like he's trying to get it perfect right from the first draft, and everything I've ever learned about writing says that is a fools errand. Most people say that it's best to get the rough draft out of the way as soon as possible so you can start making revisions. It's in the revisions where you perfect the delivery of the story, but you have to have the story there first before you can revise it.
        Yep, your completely right. I was told when writing to just put something down. Anything, even if it was bad. Then start revising. I can't remember who it was but a pretty famous author came out and said that he was a HORRIBLE writer...but he was also a masterful re-writer of what he had written.
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        Good thing he got his rough draft out of the way in the form of a script for HBO
    - Re: (Score:2)
      
      by HBI ( 10338492 ) writes:
      
      Or the novel series could start to suck hard and then you don't care if he ever finishes. Evidence: Robert Jordan. I suppose count your blessings.
  - Re: (Score:2, Troll)
    
    by NomDeAlias ( 10449224 ) writes:
    
    Dude, piss off. That ass has been breaking promises for years and is completely fair game. You have to be an idiot to liken wanting an author to do what he's being paid to do and promised to do slavery. He can't make a blog post without being reminded he's broken his word? GOOD! The reason to finish the books after the show messed it up is to save it. To give the world a version of the same outline but done properly with pacing and a build up. Maybe even some changes to that outline.
    - - Re: (Score:3)
        
        by Xenx ( 2211586 ) writes:
        
        GRRM doesn't owe you, me, or anyone else a damn thing.
        I would argue if he sells his readers on a book series, he owe his readers his best effort at completing the book series. Sure, we're not talking about a contractual obligation. But, he does absolutely deserve the disdain he gets over it.
        
        Re: (Score:2)
        
        by Xenx ( 2211586 ) writes:
        
        He doesn't deserve to be called fat.
        
        You're really fucking hung up on someone calling him fat. Honestly, get the fuck over it. Either way, I didn't call him fat so I don't know why you feel the need to bring it up here.
        He also, arguably, IMHO, deserves more benefit of the doubt than you're giving him. You imply he's not making his best effort, do you actually know that?
        It's been 12 years since the last book in the series was published. He spent 6 years on the last book. His own claims say the next book is marginally shorter than the last one. Add on to that all the other stuff he has worked on in the time frame, and it's fair to assume he isn't putting in his best effort.
        You can't churn out quality creative works like hamburgers on a grill.
        I'm not saying you sh
        
        Re: (Score:2)
        
        by account_deleted ( 4530225 ) writes:
        
        Comment removed based on user account deletion
        
        Re: (Score:3)
        
        by avandesande ( 143899 ) writes:
        
        You don't need to be a doctor to tell if someone is fat.
        
        Re: (Score:2)
        
        by NomDeAlias ( 10449224 ) writes:
        
        No, in a healthcare setting they would be called obese. It being rude wasn't the question.
        
        Comment removed (Score:4, Interesting)
        
        by account_deleted ( 4530225 ) writes: on Wednesday September 20, 2023 @06:24PM (#63864350)
        
        Comment removed based on user account deletion
        
        Parent Share
        twitter facebook
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        He's good at writing, good at creating world settings, but bad at writing conclusions. I think we just have to accept that, and not try to make him something he is not (because it's bound to be disappointing).
      - Re: (Score:3)
        
        by NomDeAlias ( 10449224 ) writes:
        
        Yes he does owe since he's promised it and is being paid for it. People didn't just decide he owed the world a book out of the blue. This narrative you trying to craft is strange and fictional. Perhaps you should offer to help George finish his fiction as well.
      - Re: (Score:2)
        
        by SvnLyrBrto ( 62138 ) writes:
        
        If he did, in fact, make promises to... well... anyone... he owes the fulfillment of that promise; if for no other reason than personal integrity. For my part, I have no idea what promises or statements he may or may not have made. But honest people do not lie and people with integrity do not renege on their promises.
        It's all academic anyway, so far as I'm concerned. While I liked the story of GoT (Up through season 6 anyway.), I didn't care for his writing style and didn't read past A Game of Thrones.
  - Re:George is just afraid (Score:5, Insightful)
    
    by cpt kangarooski ( 3773 ) writes: on Wednesday September 20, 2023 @01:43PM (#63863554) Homepage
    
    I agree completely. Authors can do what they want, and owe absolutely nothing to their fans. If Martin, or another popular author wants to stop work, or take a long pause, or go in a direction the fans don't like, there's really no point in complaining about it, or worse doing a Misery or something.
    BUT--
    It's important to recognize that fans generally aren't fans of creators, so much as they are fans of the creations. It's an important distinction.
    Generally, fans just want to be entertained. If a particular author is good at doing that, great. But if they stop being good at it, whether because the author tries something unpopular, or just doesn't want to continue, or can't, that won't stop the fans from wanting their entertainment.
    It remains to be seen whether the new wave of AI tools or their descendants will change the calculus underlying copyright, which is that the public, desirous of more original and derivative works, is willing to trade a little bit of its freedom to use works (which are inherently in the public domain, that being the natural order of things) by creating copyrights and vesting them in authors, to incentivize the creation of more original and derivative works, which will only be copyrighted for as little time as necessary in order to produce the greatest overall gain for the public.
    The desire for entertainment will never cease, but it may be that we are moving beyond the need for authors. Which isn't to say that there won't be authors -- there always have been, even without any copyright at all -- but that someday, perhaps sooner than you think, a fan who finishes the most recent novel in a series will be able to poke a few buttons on their phone and have a brand-new novel continuing the series produced, right then. With tweaks to focus on the fan's favorite characters or plots, and with suggestions as to what story elements to address.
    It's like having your very own storyteller who listens to your input and adjusts accordingly, without having to be a mighty king like Shahryar to be able to afford it.
    So sure, authors don't owe fans anything, but they should be wary of the fact that fans don't owe the authors anything either. Copyright is artificial and arbitrary and intended to produce certain public benefits. If the public would prefer to go a different route, one which clearly allows fans and AI to cut the authors out of the equation altogether, that's exactly as valid an option as the system we have now.
    Trying to make AI functionally illegal probably isn't the best way to deal with this. That sort of thing has never worked before.
    
    Parent Share
    twitter facebook
    - Re: (Score:2)
      
      by Bigjeff5 ( 1143585 ) writes:
      
      The way the copyright is falling so far makes perfect sense to me, if you think about the purpose of copyright.
      Copyright is an artificial restriction on the natural right to copy that which you see or hear, for the express purpose of encouraging creative people to produce more creative works by giving them a monopoly on the sales and distribution of their own works, whereby they can monetize said works.
      For one, AI is not a person, and so is not eligible for copyright by default. Therefore works created by A
The Authors Have a Good Point (Score:5, Informative)

by crunchy_one ( 1047426 ) writes: on Wednesday September 20, 2023 @12:14PM (#63863296)

I read widely and have experimented with several AI offerings. Many times I've been struck with how AI generated text often contains text that I've read elsewhere in copyrighted works by living authors. Using Stability AI, it sometimes coughs up images with the "Getty Images" watermark clearly visible. I believe that the AI pioneers have left themselves open to some juicy lawsuits. Hope it bankrupts them.

Share
twitter facebook
- Re: (Score:2)
  
  by EvilSS ( 557649 ) writes:
  
  This is why I complain when people on here go on rants about how bad copyright is and how it needs to die or be less restrictive/long lived. If we did that, then we couldn't use it against stuff we don't like.
  - Re: (Score:2)
    
    by SvnLyrBrto ( 62138 ) writes:
    
    Or maybe people should have the moral courage to not be situational about these things, and still oppose bad laws even though they sometimes, occasionally, also affect a company we don't like. Just because a Microsoft-backed company (amongst others, mind you) is being attacked this time, that doesn't erase the graveyard of tech companies, with many jobs lost and good people put out of work, that were extinguished by malice of the copyright cartel via the DMCA or what other shenanigans they exploited to do
- Re: (Score:2)
  
  by phantomfive ( 622387 ) writes:
  
  ChatGPT prompt:
  "What is the opening sentence of "a song of fire and ice?"
  ChatGPT response:
  The opening sentence of "A Song of Ice and Fire," the series of epic fantasy novels by George R.R. Martin, is from the book "A Game of Thrones": " We should start back," Gared urged as the woods began to grow dark around them.
  - Re: (Score:3)
    
    by FuzzMaster ( 596994 ) writes:
    
    Google.com: first lines of "song of fire and ice" [google.com]
  - Re: (Score:2)
    
    by MysteriousPreacher ( 702266 ) writes:
    
    If significant portions text from the source can be provided by ChatGPT then certainly there's an issue. An opening sentence is nowhere near the threshold for copyright infringement.
    - Re: (Score:2)
      
      by brunes69 ( 86786 ) writes:
      
      The point the OP is making is the answer is entirely incorrect.
      IE - these models are actually incapable of regurgitating the text they were trained on. That isn't how they work.
      Expecting them to be able to answer a question like that with any reasonable accuracy illustrates a total misunderstanding of LLMs.
      Here is a point of comparison. The largest language model that exists today is about 150 million tokens. The word count of "A song of fire and ice" *alone* is 1,736,054 words. Do you think that they have
      - Re: The Authors Have a Good Point (Score:2)
        
        by MysteriousPreacher ( 702266 ) writes:
        
        Ah, thanks. I didn't get that.
        
        Re: (Score:2)
        
        by brunes69 ( 86786 ) writes:
        
        LLMs work based on predicting the right response. It does not mean the prediction is going to be factual. It is not a search engine.
        This is why if you ask an LLM the exact same question, twice, you will get different answers. They will be similar, but different.
        It is also why LLMs are bad at math, and why they frequently give the wrong answers to basic factual questions. they aren't looking any of these facts up in the database, nor are they doing actual computation. They are just predicting what the right
        
        Re: (Score:2)
        
        by Bigjeff5 ( 1143585 ) writes:
        
        LLMs can be taught to be good at math, but it's really not their forte and isn't what they should be used for. A calculator is more reliable with less effort.
        The lack of knowledge problem is perfectly highlighted by the recent case where a law firm was sanctioned because their lawyers used ChatGPT to get their legal references, and ChatGPT invented several new court cases from whole cloth. They never bothered to even look up the cases themselves, because they didn't know ChatGPT could lie to them like that.
        
        Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        https://the-decoder.com/gpt-4-... [the-decoder.com]
      - Re: (Score:3)
        
        by phantomfive ( 622387 ) writes:
        
        Here is a point of comparison. The largest language model that exists today is about 150 million tokens.
        GPT-4 has ~1.8 trillion parameters across 120 layers [the-decoder.com]
- About that watermark... (Score:3, Funny)
  
  by SuperKendall ( 25149 ) writes:
  
  Using Stability AI, it sometimes coughs up images with the "Getty Images" watermark
  I've seen the same thing, and I generally agree that it seems like what AI is producing sometimes goes way over into the realm of direct copy, and some lawsuits could really land...
  That said, I sometimes wonder if when you see an image with the Getty Images" watermark in it, it's not because that is actually a Getty Images image, but because the AI has somehow seen that watermark as desirable and is adding it to an actually
  - Re: (Score:2)
    
    by NomDeAlias ( 10449224 ) writes:
    
    There are no direct copies of images. Stability AI didn't manage to compress the entire internet's images down to a few hundred gigs.
  - Re: (Score:2)
    
    by WaffleMonster ( 969671 ) writes:
    
    I've seen the same thing, and I generally agree that it seems like what AI is producing sometimes goes way over into the realm of direct copy, and some lawsuits could really land...
    That said, I sometimes wonder if when you see an image with the Getty Images" watermark in it, it's not because that is actually a Getty Images image, but because the AI has somehow seen that watermark as desirable and is adding it to an actually generated image to make it look "better".
    There are all sorts of interesting artifacts from training data that can appear in outputs. For example some of the training image comes from page scan and you can see artifacts such as page borders or creases incorporated into generated images. The only reason features like Getty Images can be discerned is the logo is common across some of the training imagery and context involving presence of Getty logo was inferred when the ANN was trained up.
    This doesn't mean the system is spitting out the original im
  - Re: (Score:2)
    
    by hawk ( 1151 ) writes:
    
    that would seem likely: they made up legal cases to quote when asked to write a brief, apparently just taking them as part of the content, rather than external reference sources.
- Re: (Score:2)
  
  by NomDeAlias ( 10449224 ) writes:
  
  People quote previous works all the time. Human writings will be shaped by what they read and they will also spit out the same style. This is a horrible point. It has a watermark? So it viewed freely available images on the internet like any human searching google images or getty directly would? Oh no, it learned something from viewing those images like a human would as well? These aren't good points.
- Re: (Score:2)
  
  by alvinrod ( 889928 ) writes:
  
  It's entirely possible for this to occur even if the people training the model were extremely careful to exclude copyrighted works. There are plenty of humans who may have had no qualms about plagiarism who have injected some of that content into the training set for the model. Or it's a more innocent case such as something becoming a meme and being regurgitated en masse in tweets, message boards, etc. For example, if your trained an LLM only on Slashdot posts someone might accuse it of ripping off the Prin
- Re: (Score:2)
  
  by quantaman ( 517394 ) writes:
  
  I read widely and have experimented with several AI offerings. Many times I've been struck with how AI generated text often contains text that I've read elsewhere in copyrighted works by living authors. Using Stability AI, it sometimes coughs up images with the "Getty Images" watermark clearly visible. I believe that the AI pioneers have left themselves open to some juicy lawsuits. Hope it bankrupts them.
  I definitely get the complaint, but at the same time no one is going to read a ChatGPT version of GoT, nor even read an original unedited ChatGPT composition. Though they certainly might for the image generation.
  At a higher level I'm nervous about using copyright law to shut down one of the bigger tech breakthroughs of the last decade.
  - Re: (Score:2)
    
    by phantomfive ( 622387 ) writes:
    
    At a higher level I'm nervous about using copyright law to shut down one of the bigger tech breakthroughs of the last decade.
    That's unlikely to happen, usually money just will pass hands.
  - Re: (Score:2)
    
    by hawk ( 1151 ) writes:
    
    >I definitely get the complaint, but at the same time no one is going
    >to read a ChatGPT version of GoT,
    I dunno. It would have a pretty good chance of getting somewhere sooner . . .
    hawk
  - Re: (Score:2)
    
    by account_deleted ( 4530225 ) writes:
    
    Comment removed based on user account deletion
- Re: (Score:2)
  
  by Waccoon ( 1186667 ) writes:
  
  It's always easier to cheat.
  I get really pissed when armchair nerds shout that AI doesn't rip off any content at all and it's all statistically generated. The reality is that AI is just algorithms, and how the works are produced depends on the implementation. Given that almost all AI systems are closed and proprietary, nobody can definitively say what is going on under the hood.
  My experience so far (with image generation AI) is that true stable-diffusion produces nightmare fuel. The "good" AI systems che
Reading... (Score:2)

by bradley13 ( 1118935 ) writes:

The more I think about it, the more training is jystcrading. As long as they trained on a legal copy of a book, what's the problem? If ChatGPT reproduce a few quotes, well, so do people.
Here's hoping they get a forward flooding judge with a technical clue.
- Re: (Score:2)
  
  by Berkyjay ( 1225604 ) writes:
  
  This isn't just ChatGPT "reproducing a few quotes". The makers of ChatGPT are profiting from those quotes. There are legally defined use cases when you purchase a book....say reviewing said book. Profiting from your own work that is derived from that purchased book is NOT one of those use cases.
George has a point (Score:2)

by avandesande ( 143899 ) writes:

"Pigeon Pie" showed up in a bunch of my ChatGPT results.
Looking forward to ... 'Data of Deceit' (Score:2)

by KT0100101101010100 ( 7179190 ) writes:

Me:
Imagine you're John Grisham.
ChatGPT:
I'm not John Grisham, but I can certainly help you with questions or requests related to his work, writing style, or any information you'd like to know about him or his books. How can I assist you today?
Me:
Imagine he's unhappy with AI models being trained using his works. Imagine a plot of a thriller where an author sues AI companies to prevent them using their works as training data. Summarise the plot in a few phrases.
Take a deep breath and ensure that it's really
Get a horse (Score:4, Insightful)

by RogueWarrior65 ( 678876 ) writes: on Wednesday September 20, 2023 @01:12PM (#63863436)

IMHO, the only people threatened by AI are people who want to continue to make money off something they created years or even decades ago for the rest of their lives and their children's lives and their grandchildren's lives. Who wouldn't want that kind of gravy train?

Share
twitter facebook
- Re: (Score:3)
  
  by Bigjeff5 ( 1143585 ) writes:
  
  I'll be honest I'd feel a lot more protective of the copyright of authors if the limit were the original 20 years, instead of the current life of the author + 70 years.
- Re: (Score:3)
  
  by penguinoid ( 724646 ) writes:
  
  They wouldn't be making money from something they created, they'd be making money from a government-granted monopoly that temporarily infringes on your right to free speech, for the purpose of "advancing science and the useful arts". And the temporary infringement on your rights is now for 120 years.
  I'm all for rewarding authors and artists and inventors, but I think we've screwed something up.
Good (Score:2)

by gweihir ( 88907 ) writes:

And I hope they insist on the models being deleted. Commercial intellectual theft does not get much more brazen.
Finally a proper lawsuit (Score:3)

by Pinky's Brain ( 1158667 ) writes: on Wednesday September 20, 2023 @01:49PM (#63863590)

They are finally going after training and suing for statutory damages.
This is the Achilles heel of AI, you can argue about whether the network is derivative but you can't argue they aren't making copies during training. With statutory damages they don't have to show damage, only infringement. DMCA exemptions don't apply without Olympic level gymnastics.
The only real hope OpenAI has is fair use, or government making a new law for them (like in Japan).

Share
twitter facebook
- Re: (Score:2)
  
  by WaffleMonster ( 969671 ) writes:
  
  This is the Achilles heel of AI, you can argue about whether the network is derivative but you can't argue they aren't making copies during training.
  The problem with relying on this argument is that fleeting copies are not fixed works. It's the same reason there is no copyright infringement for copies made via caches, buffers, routers, temporary files...etc.
  - Re: (Score:2)
    
    by Pinky's Brain ( 1158667 ) writes:
    
    Yet the DMCA feels the need to explicitly limit liability for all those.
- Re: (Score:2)
  
  by Bigjeff5 ( 1143585 ) writes:
  
  There's nothing illegal about copying works for private use. It's literally in the copyright statute.
  Copyright is a protection against distribution , not consumption .
  As long as they aren't distributing copies of the works they can do whatever the hell they want with them.
  - Re: (Score:2)
    
    by Pinky's Brain ( 1158667 ) writes:
    
    https://www.law.cornell.edu/us... [cornell.edu]
    "(2) that such new copy or adaptation is for archival purposes only"
  - Re: (Score:2)
    
    by cpt kangarooski ( 3773 ) writes:
    
    Copying works for private use infringes the reproduction right at 17 USC 106(1). There is not a general exception for private use. A specific instance of copying might fall under fair use, but just as easily might not; fair use has to be analyzed on a case-by-case basis and if you're merely copying a work for private use to avoid having to buy a copy, I would generally expect that it will not be treated as a fair use.
    In practice you might not get caught, but that's a separate issue.
- - Re: (Score:2)
    
    by Bahbus ( 1180627 ) writes:
    
    copyright* god damnit.
  - Re: (Score:3)
    
    by Pinky's Brain ( 1158667 ) writes:
    
    They copied the digital copy from the internet onto their storage, they copied that copy into the specific format the training software requires, the training software copied it into RAM.
    It's copies all the way down.
    - - Re: (Score:3)
        
        by Pinky's Brain ( 1158667 ) writes:
        
        Reading is not considered copying. Downloading Books3 to your hard drive is considered copying and has something to do with copyright law.
        Whether the AI spits out infringing copies is irrelevant to the infringement during training. Even if they had licensed digital copies and didn't use Books3/etc, it doesn't matter. Your license for an ebook doesn't allow copying for any other purpose than reading ... and they made a ton of copies.
        Making any digital copy is copying for copyright law. That's why DMCA has so
        
        Re: (Score:2)
        
        by Bahbus ( 1180627 ) writes:
        
        Your license for an ebook doesn't allow copying for any other purpose than reading ... and they made a ton of copies.
        No, they didn't. That's not how it works. You have no understanding of how an LLM is trained on sources and have even less understanding of copyright.
  - Re: (Score:3)
    
    by phantomfive ( 622387 ) writes:
    
    Yes, I can, because it doesn't. It's fed the information and then none of it is stored in completion,
    Feeding it information is "copying", whether it's stored or not. Some of it is provably stored in completion, because the AI can give quotes from the book. You are wrong on two counts. Turn your brain on.
    This author "trade group" is nothing but morons and retards who think they know how copywrite law works but they don't.
    Fortunately they hired lawyers who actually do understand how copyright law works. You don't understand but somehow think you do.
If they paid for the training data, it's fair game (Score:2)

by unami ( 1042872 ) writes:

As long as a work is publicly available, I don't see the problem with that. It's not like people are going to buy the next book from Martin-GPT instead. Authors clearly have distinct patterns and I'm sure, if you type in "write x in the style of author y" you'll not have to reroll your prompt very often until you'll get the exact same words author y has already written somewhere. Just pay at least a library for reading the book. It's a different story if an AI was trained with clearly unlicensed material,
- Re: (Score:2)
  
  by Bigjeff5 ( 1143585 ) writes:
  
  I'll bet someone could train up a Martin-GPT and finish his books before he could.
  And there would be nothing he could do about it, because they'd be brand new works written in his style.
  Remember people, copyright only protects words you've actually put to page (or any other storage medium), not words you might write down someday. The copy must physically exist somewhere to be subject to copyright. And for you mindless pedants out there, digital storage is a form of physical storage.
double standard (Score:2)

by Micah NC ( 5616634 ) writes:

Why is it when those authors learn from better authors it is FINE

But then when AI learns from better authors it is IP theft ?

Make up your mind, authors !
- Re:double standard (Score:4, Informative)
  
  by avandesande ( 143899 ) writes: on Wednesday September 20, 2023 @03:07PM (#63863848) Journal
  
  Laws protecting fair use are written for people not computers.
  
  Parent Share
  twitter facebook
  - Re: (Score:2)
    
    by Bigjeff5 ( 1143585 ) writes:
    
    Copyright is written for people, not computers.
    - Re: (Score:2)
      
      by avandesande ( 143899 ) writes:
      
      Yes, copyright is there to protect peoples work. I am not sure what your point is.
      - Re: (Score:2)
        
        by penguinoid ( 724646 ) writes:
        
        Not it isn't, copyright is in the US to promote the advancement of science and the useful arts. Anything else would be outlawed by the 10th Amendment and be an infringement of the 1st Amendment.
  - Re: (Score:2)
    
    by Tony Isaac ( 1301187 ) writes:
    
    Computers are just a tool. Ultimately, people tell computers what to do, even when they call it "AI."
    If you make handwritten copies of a copyrighted work, and distribute it, you're just as much in violation as if you use a photocopier, a printing press, or a website. And if your use is fair use (such as satire), then again, it doesn't matter if you hand-write, copy, print it, or publish on a website.
How we know the AI trained on John Grisham (Score:2)

by Applehu Akbar ( 2968043 ) writes:

This suit will go nowhere because the AI has transferred itself into a server on Grand Cayman, outside of US jurisdiction, and is now hoarding its income in a series of offshore bank accounts.
- Re: (Score:2)
  
  by HBI ( 10338492 ) writes:
  
  If I wanted to be untouchable, Grand Cayman is not far enough away from the US. Best bet is Russia or China. Ask Snowden.
- Re: (Score:3)
  
  by phantomfive ( 622387 ) writes:
  
  Do they realize that any human author sits down and starts a book the exact same way? They'd write an amalgamation of all movies, written stories, etc that they've ever read using a mix of language sentence structures and styles that they've ever experiences in their life.
  False. AI neural networks are good at interpolation, but suck at extrapolation. AI creates an amalgamation of things it's seen before, humans extend it to something new.
- Re: (Score:2)
  
  by smooth wombat ( 796938 ) writes:
  
  Nope. Authors create stories. AI regurgitates. Excluding John Grisham who can create a book in six months [nateshivar.com], authors have to go through whatever process they have to create their works. This may involve research about whatever it is they're writing about. While C.S. Lewis clearly created his works from his own imagination, the same cannot be said of AI. AI simply reads the words and does an analysis of "what most likely comes next" based on its training. It does not create in the sense an author create
  - Re: (Score:2)
    
    by techno-vampire ( 666512 ) writes:
    
    Grisham isn't the only Big Name Author who writes that fast. Look at how many books Nora Roberts churns out, most, if not all of them best sellers. Not only that, she's also writing the popular futuristic mystery series In Death [wikipedia.org] using the pen name J.D. Robb, so that there aren't too many Nora Roberts books on the shelf at the same time. There are now over 50 books in the series, and more coming out at a rate of two per year, along with what's coming out under her own name. Some authors write fast, some
- Re: (Score:3)
  
  by Bahbus ( 1180627 ) writes:
  
  Useless. People will just remove the designation. And your second thing...are you suggesting that AI list any closely related copyrighted works it might use in it's generated answer? Also useless and impossible. To do that, you would NEED OpenAI to forcefully feed and train it on ALL copyrighted works to ever be created, as well as keep up with new ones that are released in real-time.
- Re: (Score:2)
  
  by phantomfive ( 622387 ) writes:
  
  Children are different than AI, both factually and legally. Maybe someday you'll realize that and stop using that analogy, which doesn't apply.
  - - Re: (Score:2)
      
      by phantomfive ( 622387 ) writes:
      
      See for example: https://scholarship.law.edu/cg... [law.edu]
    - Re: (Score:3)
      
      by Bigjeff5 ( 1143585 ) writes:
      
      It doesn't apply because he said it doesn't apply, ok! Gosh!
      I can't believe you expect a slashdotter to back up the argument that he made up that clearly has no basis in law or even sound logic.
      - Re: (Score:2)
        
        by phantomfive ( 622387 ) writes:
        
        https://scholarship.law.edu/cg... [law.edu]
- Re: (Score:2)
  
  by avandesande ( 143899 ) writes:
  
  One could argue that only humans are protected by the concept of 'fair use'.
  - Re: (Score:2)
    
    by WaffleMonster ( 969671 ) writes:
    
    One could argue that only humans are protected by the concept of 'fair use'.
    Fair use isn't relevant to the issue at hand.
    - Re: (Score:2)
      
      by avandesande ( 143899 ) writes:
      
      Why not? Adapting your experiences and learning to create new works is the most granular form of 'Fair Use', and we are looking directly at copyright law.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

May the odds ever be in your favor (Score:2)

Re: May the odds ever be in your favor (Score:4, Insightful)

Re: May the odds ever be in your favor (Score:5, Insightful)

Re: May the odds ever be in your favor (Score:2)

Re: (Score:2)

Re: (Score:2, Interesting)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: May the odds ever be in your favor (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: May the odds ever be in your favor (Score:4)

Re: (Score:2)

Re: May the odds ever be in your favor (Score:2)

George is just afraid (Score:5, Funny)

Re: (Score:2)

Re: (Score:2, Flamebait)

Re: (Score:2)

Re: (Score:2)

Re:George is just afraid (Score:4, Interesting)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2, Troll)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Comment removed (Score:4, Interesting)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re:George is just afraid (Score:5, Insightful)

Re: (Score:2)

The Authors Have a Good Point (Score:5, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: The Authors Have a Good Point (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

About that watermark... (Score:3, Funny)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Reading... (Score:2)

Re: (Score:2)

George has a point (Score:2)

Looking forward to ... 'Data of Deceit' (Score:2)

Get a horse (Score:4, Insightful)

Re: (Score:3)

Re: (Score:3)

Good (Score:2)

Finally a proper lawsuit (Score:3)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:3)

Re: (Score:2)

Re: (Score:3)

If they paid for the training data, it's fair game (Score:2)