Mark Zuckerberg Gave Meta's Llama Team the OK To Train On Copyright Works, Filing Claims (techcrunch.com) 70

Posted by BeauHD on Thursday January 09, 2025 @06:20PM from the behind-the-scenes dept.

Plaintiffs in Kadrey v. Meta allege that Meta CEO Mark Zuckerberg authorized the team behind the company's Llama AI models to use a dataset of pirated ebooks and articles for training. They further accuse the company of concealing its actions by stripping copyright information and torrenting the data. TechCrunch reports: In newly unredacted documents filed (PDF) with the U.S. District Court for the Northern District of California late Wednesday, plaintiffs in Kadrey v. Meta, who include bestselling authors Sarah Silverman and Ta-Nehisi Coates, recount Meta's testimony from late last year, during which it was revealed that Zuckerberg approved Meta's use of a data set called LibGen for Llama-related training. LibGen, which describes itself as a "links aggregator," provides access to copyrighted works from publishers including Cengage Learning, Macmillan Learning, McGraw Hill, and Pearson Education. LibGen has been sued a number of times, ordered to shut down, and fined tens of millions of dollars for copyright infringement.

According to Meta's testimony, as relayed by plaintiffs' counsel, Zuckerberg cleared the use of LibGen to train at least one of Meta's Llama models despite concerns within Meta's AI exec team and others at the company. The filing quotes Meta employees as referring to LibGen as a "data set we know to be pirated," and flagging that its use "may undermine [Meta's] negotiating position with regulators." The filing also cites a memo to Meta AI decision-makers noting that after "escalation to MZ," Meta's AI team "[was] approved to use LibGen." (MZ, here, is rather obvious shorthand for "Mark Zuckerberg.")

The details seemingly line up with reporting from The New York Times last April, which suggested that Meta cut corners to gather data for its AI. At one point, Meta was hiring contractors in Africa to aggregate summaries of books and considering buying the publisher Simon & Schuster, according to the Times. But the company's execs determined that it would take too long to negotiate licenses and reasoned that fair use was a solid defense. The filing Wednesday contains new accusations, like that Meta might've tried to conceal its alleged infringement by stripping the LibGen data of attribution.

Mark Zuckerberg Gave Meta's Llama Team the OK To Train On Copyright Works, Filing Claims

This discussion has been archived. No new comments can be posted.

Load All Comments

Search 70 Comments Log In/Create an Account

Comments Filter:

Meta[stasize] is metastasizing! (Score:2)

by Sebby ( 238625 ) writes:

The cancer is deep, and spreading!
- Re:Meta[ the cancer] is metastasizing! (Score:2)
  
  by shanen ( 462549 ) writes:
  
  Brevity is the soul of wit, but too much brevity may not lead anywhere? The story is about to fall off the front page of Slashdot and your potentially significant comment on the story didn't even get a funny mod for dark humor, let alone any follow up comments.
  In the form of a question, I think you should have made it clear in what way this cancerous behavior is new, different, or significant. From what I can understand of the situation, this is supposed to be public information and it isn't obvious who is
Shocked! (Score:4, Informative)

by slipped_bit ( 2842229 ) writes: on Thursday January 09, 2025 @06:24PM (#65076573) Homepage

I'm shocked! Shocked, I tell you!
Well, not that shocked.

- Re:Shocked! (Score:4, Insightful)
  
  by 93 Escort Wagon ( 326346 ) writes: on Thursday January 09, 2025 @11:42PM (#65077119)
  
  This has how Facebook has always behaved. It's the old principle "it's better to beg forgiveness than ask permission", but carried to ridiculous extremes. They historically have always broken both laws and norms... then, when they get caught, say "mea culpa" - but with the damage already done and not recoverable, as seems to be their intent.
  So, unfortunately, your joke/meme doesn't work with Facebook-related news simply because no one could possibly be shocked by their behavior after all this time.
  
  - Re: Shocked! (Score:3)
    
    by fluffernutter ( 1411889 ) writes:
    
    Apparently breaking laws and norms is what it takes to get elected president of the US as well as start any successful business. It has become clear that laws are now just about crowd control rather than right and wrong.
    - Re: Shocked! (Score:5, Insightful)
      
      by nightflameauto ( 6607976 ) writes: on Friday January 10, 2025 @09:52AM (#65077791)
      
      Apparently breaking laws and norms is what it takes to get elected president of the US as well as start any successful business. It has become clear that laws are now just about crowd control rather than right and wrong.
      Laws in America, and in fact the entire judicial system, is based on the concept of protecting the owner class. Always have been, always will be. When the rabble gets upset enough to threaten violence, they may feed one of their own into the system, but for the most part it's about keeping the facade wrapped up securely so the owners can continue to fleece the rest of us.
      
- Re: (Score:2)
  
  by Rei ( 128717 ) writes:
  
  Just to make myself clear taking copyrighted work and putting it on the internet for any purpose without permission is a crime
  Where did Meta do that?
  Also, "for any purpose" is doing a lot of heavy lifting there - not least of which since you didn't include the word "publicly" in that sentence.
  Also, in most cases, copyvio is not a crime. Non-fair use of copyrighted material is generally a civil offense. There are statutes for criminal copyright infringement, but generally apply to things like bootlegging ri
  - Re:2 sets of laws: ones for the rich and (Score:5, Informative)
    
    by viperidaenz ( 2515578 ) writes: on Thursday January 09, 2025 @06:53PM (#65076637)
    
    If I downloaded a torrent of copyrighted work, and used that to make a product, then sell that product, I'd expect to be chased after for criminal copyright infringement, since I intended to financially benefit from the willful copyright infringement.
    If it's intentional infringement and for large scale commercial gain, it's criminal.
    Monetary damages for large corporations should be dropped in favour of the alternative, which is already an option in law, of imprisonment.
    Mark Zuckerberg would gladly hand over a million dollars, when the product he's selling will make him much more.
    I doubt he'd like spending a year in prison.
    
    - Re:2 sets of laws: ones for the rich and (Score:4, Informative)
      
      by Rei ( 128717 ) writes: on Thursday January 09, 2025 @07:14PM (#65076675) Homepage
      
      If I downloaded a torrent of copyrighted work, and used that to make a product, then sell that product, I'd expect to be chased after for criminal copyright infringement
      First, if you download a copy of This Old House, and then you use that information to build a house, that house is NOT a violation of copyright.
      Secondly, there are broad general accepted exemptions in copyrighted law for the automated processing of copyrighted information. Literally like 95% of Google's business model would be illegal if not for that.
      For an extreme case, look at Google Books. Google mass-scanned-in books, not just without permission, but explicitly against publisher wishes. Then put them up online, made them searchable, and showed excerpts (up to whole pages) at once. Zero permission, they just went and did this, and didn't just learn from them, but reproduced exact content from them.
      Guess what? The courts found even that to be a transformative use. Google won.
      There have been cases on AI training that have reached completion - for example, the LAION case in Germany. It was upheld.
      Contrary to what you may think, copyright doesn't give the holder a dictatorship. For example, you can shout from the rooftops how you absolutely ban anyone from using it for parody.... tough luck, you don't have the right to stop it; you were never granted that right. There are broad classes of exemptions upheld by the courts. The purpose of copyright law is prosocial. It is intended to strike a balance between encouraging the creation of more material, and enabling society to benefit from said material.
      Copyright is also based on specific works**. Like with the house, it doesn't matter if the housebuilder learned from a given work - so long as they're not reproducing a specific copyright-protected house, to within the bounds of qualification as a derivative work, they're perfectly fine; the copyright holder of the book they learned from has absolutely no claim against them. Styles are not copyrightable.
      ** The sort of oddball carveout is character copyrights. But it's a pretty narrowly proscribed cutout, and they're considered to stem from specific works anyway.
      
      - Re: (Score:2)
        
        by kmoser ( 1469707 ) writes:
        
        Try pirating a few DVDs and selling them to your friends and watch how fast the MPAA and DHS crack down on your ass. Why is pirating an entire library full of material any different, especially when the organization doing the pirating is using that material to generate new content and make a profit?
      - Re: (Score:2)
        
        by Visarga ( 1071662 ) writes:
        
        > The purpose of copyright law is prosocial. It is intended to strike a balance between encouraging the creation of more material, and enabling society to benefit from said material.
        
        That is the theory. But practice is that we live in a post-scarcity economy with regards to content. The availability of infinite choice and alternatives makes it an attention economy. So the value of any new work competes against decades of accumulation. Most creatives can't make a living from royalties anymore. Most book
      - Re: (Score:2)
        
        by viperidaenz ( 2515578 ) writes:
        
        If I illegally downloaded house plans, and built a house with them, the buildingo f the house has nothing to do with it, it was still illegal to download the plans.
        If I built a house to sell, or maybe a bunch of houses, that's setting up to criminal infringment, as I'm willfully doing it and for commercial gain.
        Google didn't do the illegal downloading part that Meta has done. They physically scanned the books they had in their posession.
    - Re: (Score:2)
      
      by ShanghaiBill ( 739463 ) writes:
      
      I'd expect to be chased after for criminal copyright infringement
      That's very unlikely.
      You might be sued in civil court, but the police won't be involved.
      since I intended to financially benefit
      Your financial benefit is irrelevant. The copying is illegal, not the profit.
      I doubt he'd like spending a year in prison.
      Very unlikely. Even Kim Dotcom didn't go to prison.
      - Re: (Score:2)
        
        by viperidaenz ( 2515578 ) writes:
        
        Kim Dotcom had the local police involved. And the FBI.
        He's still fighting extradition, the order has finally been signed, but he's allegedly had a stroke since then.
        Criminal copyright infringement is prosecuted by the state.
    - Re: (Score:2)
      
      by Visarga ( 1071662 ) writes:
      
      > since I intended to financially benefit from the willful copyright infringement
      
      Why? For example you could have read the books and written reviews manually. How is that an infringing product?
      - Re: (Score:2)
        
        by viperidaenz ( 2515578 ) writes:
        
        How is it copyright infringement? You mean if I download copyrighted material without permission, it's ok if I manually write a review about it?
        Copyright infringement already happened before the AI training was involved. The use of it in training the AI models could be considered profiting from it, making it criminal infringement, instead of just civil.
    - Re: (Score:1)
      
      by I-am-a-Banana ( 940550 ) writes:
      
      If Facebook legally took the book out of the library, scanned it, and let their AI read it, what would be wrong with that? What then if they skipped the scanning step, legally obtained, maybe even bought the book, but used a prescanned online version instead of scanning themselves?
      And what is the difference between the AI "reading" the book and a person doing it? If I do it and use that knowledge to tutor someone I am profiting on that copyrighted work. But is that wrong? If AI does it what is the differenc
      - Re: (Score:2)
        
        by viperidaenz ( 2515578 ) writes:
        
        But they didn't do that. They downloaded the torrents, used them, and tried to hide the fact they did it.
        
        Re: (Score:1)
        
        by I-am-a-Banana ( 940550 ) writes:
        
        Yes but can it be proved that they didn't go to a library and scan the books for the AI to read it? Or just take the book out? I guess what I am saying there is a VERY fine line here. This is not to be on Meta's side. It is a thought process. Just say we created an android with some AI smarts. And walked into a library with it. It picked up a book and with super efficiently read it.
        Then did this over and over and over again.
        The result would be the same. Would this be legal?
        
        Re: (Score:2)
        
        by viperidaenz ( 2515578 ) writes:
        
        It's right in TFS
        The filing quotes Meta employees as referring to LibGen as a "data set we know to be pirated,"
  - Everywhere All At Once (Score:1)
    
    by SuperKendall ( 25149 ) writes:
    
    Where did Meta do that?
    They potentially do it anytime you ask their model for a result, it may at any time include portions of copyrighted worked, which has been demonstrated pretty often.
    That is a copyrighted work published from the website and service they built, that you are accessing.
    - Re: (Score:1)
      
      by ihavesaxwithcollies ( 10441708 ) writes:
      
      They potentially do it anytime you ask their model for a result, it may at any time include portions of copyrighted worked, which has been demonstrated pretty often.
      Exactly.
    - Re: (Score:2)
      
      by presidenteloco ( 659168 ) writes:
      
      Portions, as in snippets or relatively short excerpts? You mean just like Google Books does? Sounds like fair use to me.
      - Re: (Score:1)
        
        by ihavesaxwithcollies ( 10441708 ) writes:
        
        Portions, as in snippets or relatively short excerpts? You mean just like Google Books does? Sounds like fair use to me.
        If I took 100 pages of copyrighted works from 10 books and made my own book with them, that would just be 10 different copyright violations.
      - Re: (Score:1)
        
        by SuperKendall ( 25149 ) writes:
        
        Portions, as in snippets or relatively short excerpts?
        No.
        I mean entire images except for slight details and a background changed.
        Or multiple paragraphs, un-cited.
        You mean just like Google Books does?
        They acknowledge the original author and work it came from even if it were similar.
        
        Re: (Score:2)
        
        by nightflameauto ( 6607976 ) writes:
        
        Portions, as in snippets or relatively short excerpts?
        No.
        I mean entire images except for slight details and a background changed.
        Or multiple paragraphs, un-cited.
        You mean just like Google Books does?
        They acknowledge the original author and work it came from even if it were similar.
        It occurs to me that a lot of the copyright dings these "AI" bots are getting hit with would be resolved by a simple bibliography-like section included with each response. I'm guessing the reason it's not done now is nobody was smart enough to realize that citations would be a requirement at some point, so there's nothing linking the data with an original source. Why bother with citations when you believe that you've created "intelligence" by slamming together a halfway competent pattern matcher?
  - Re: (Score:2)
    
    by Visarga ( 1071662 ) writes:
    
    Maybe Zuck has to pay billions to the author of his first coding book. How is it possible that Zuck learned to code and made billions while the author remained poor? I tell you, copyright should go to the infinite /s
- Re: (Score:3)
  
  by hey! ( 33014 ) writes:
  
  You can make a reasonable argument that AI training is fair use. After all, it's really just a mechanized version of what humans do. Where do writers get their ideas? There's all kinds of answers they'll give you -- real life observation, experience, even just from the act of sitting down and writing. But one thing they never say, but they all do, is get their ideas from other writers. Writers are readers first; everything they read goes into their (actual) neural net and out comes as new stuff. Every
  - Re: (Score:3, Insightful)
    
    by david.emery ( 127135 ) writes:
    
    That's an interesting question about 'fair use'. But I think we have an answer. A human is expected to give credit, and to not parrot back as his/her own work the complete copyrighted material. Selected quotes are OK, with the expectation that the human adds value/provides additional relevant content. But generative AI seems to violate both expectations. No credit, and no limits on what is extracted and presented back.
- Re: (Score:2)
  
  by Tony Isaac ( 1301187 ) writes:
  
  >> for any purpose
  Well, not for *any* purpose.
  "Fair use" allows people to use copyrighted works for purposes of criticism and commentary, news reporting, teaching, parody, and research, for example. I'm not qualified to determine whether facebook's use of copyrighted works here is actually fair use, but that is certainly what they are claiming.
- Re:2 sets of laws: ones for the rich and (Score:5, Insightful)
  
  by Rei ( 128717 ) writes: on Thursday January 09, 2025 @07:01PM (#65076657) Homepage
  
  "put it through a filter"
  Yeah, mate, that's not how AI works.
  Since the core of every generator is detectors, reverse the situation. Think of image recognition. You take a picture of your dog and put it in an image detector. It highlights it in a bounding box and labels it "Dog [99.97%]".
  Was it trained with that picture? Of course not.
  Was it trained with any picture of your dog? Almost certainly not.
  Rather, it knows what dogs are visually. It didn't memorize its training data; it used its training data to distill what the essence of a dog is - what sort of complex arrangement of high and low level features that distinguish dogs from other things.
  That doesn't just apply specifically to image detectors - that applies to *all* DNNs (and all NNs, period). DNNs have the ability to generalize from data, and it is this ability that is what is desired. That's not that they don't also have the ability to memorize. But they're not doing that from seeing something once, with a learning rate of 1e-5. There's also fundamental limits to how much DNNs, like everything else in the world, physically can memorize. Generalization is vastly more space efficient than memorization. AI performance improves with both increased train times and increased training data sizes because AI performance is measured on how well they generalize tasks, and both of those things increase generalization performance.
  If you train a 10GB video generation model on Youtube's 100 petabytes of compressed video data (perhaps 10 exabytes uncompressed), I don't know how to break it to you, but that 10GB model does not contain 10 exabytes of videos (a 1e9-to-1 compression ratio). That's just not happening. Those videos aren't in there - they're gone. What is in there is what people look like, what animals look like, how people move, how animals move, how people and animals interact..... on and on and on. The generalization of the latent space that is video.
  
  - Reverse does not hold (Score:1)
    
    by SuperKendall ( 25149 ) writes:
    
    Rather, it knows what dogs are visually.
    You cannot just simply reverse this argument as you are doing, when there are countless examples of AI outright reproducing copyrighted artworks.
    For recognizers that's fine, all material in the models they are storing on their servers you cannot access which represents and abstract and lossy encoding of many copyrighted works.
    For imagine generation it is not fine to use obviously similar and marginally changed works from artists in imagines that you are transmitting
    - Re: (Score:2)
      
      by Rei ( 128717 ) writes:
      
      Yes, you very much *can* reverse this argument, because this is how the work.
      when there are countless examples of AI outright reproducing copyrighted artworks.
      The vast majority of those cases online are FYI img2img. That is, the person using the tool fed in the image and told the AI tool to slightly modify it. If you load someone's art into Photoshop and make some small changes to it, that's not Photoshop's problem.
      Image diffusion models do not have the majority (or even any meaningful fraction) of their
      - Absolutely not img2img (Score:1)
        
        by SuperKendall ( 25149 ) writes:
        
        The vast majority of those cases online are FYI img2img
        Not one of the HUNDREDS of cases I have seen over the past year or two is img2img, they are all artists being sent images from general purpose AI image builders from prompts, that are producing works exactly like an artists work, with as I said some minor details altered.
        I didn't even bother reading the rest of what you wrote, if you have such a fundamental misunderstanding of the whole issue what is the point?
        You sort of understand how LLMs work but yo
        
        Re: (Score:2)
        
        by Travelsonic ( 870859 ) writes:
        
        Not one of the HUNDREDS of cases I have seen over the past year or two is img2img
        Really? I mean, overtraining on specific works and ovefitting IS a thing, absolutely - fortunately a thing that is supposed to be undesired, but I find it hard to believe that (at least based on an elementary understanding of how this is supposed to work, granted) that no examples of img2img dishonesty exist.
- Re: (Score:2)
  
  by ShanghaiBill ( 739463 ) writes:
  
  Just to make myself clear taking copyrighted work and putting it on the internet for any purpose without permission is a crime
  Nope. Copyright violations are torts, not crimes.
  - Re: (Score:1)
    
    by ihavesaxwithcollies ( 10441708 ) writes:
    
    Nope. Copyright violations are torts, not crimes.
    Copyright infringers can be sued civilly and in some cases prosecuted criminally for the same infringing act.
- Re: (Score:2)
  
  by Visarga ( 1071662 ) writes:
  
  > take copyrighted work and put it through a filter and then put it on the internet for any purpose
  
  "Put it through a filter" is an understatement. The AI does more than filter, and the user controlling the AI pushes the model into new directions, not simple replication. What you are saying sounds like "we want copyright protection on abstractions and styles", which would undermine all creative work if it were granted.
  - Re: (Score:1)
    
    by ihavesaxwithcollies ( 10441708 ) writes:
    
    What you are saying sounds like "we want copyright protection on abstractions and styles", which would undermine all creative work if it were granted.
    Bad strawman.
    No what I am saying are these big tech companies are repeatedly stealing other people's works and using neat parlor tricks to say it isn't copyright infringement. They are using their wealth to shrug and say, "fair use" and buy off the system.
The Swamp Exposed (Score:2)

by Tablizer ( 95088 ) writes:

That proven orangeNoser [youtu.be] should be locked up.
Go Zuck. Fuck 95-year copyright very much. (Score:1)

by greytree ( 7124971 ) writes:

Anyone infringing immorally-long copyright terms is doing the world a moral service.

Even Zuck.

Pirate on until copyright is a fit-for-purpose 5 years.
Unfortunately this may be needed (Score:2)

by spitzak ( 4019 ) writes:

I have no idea what to do about the legal and moral problems. But I would greatly prefer an AI whose knowledge is not limited to non-copyrighted work.
Abolish copyright (Score:2)

by Hentes ( 2461350 ) writes:

At this point, the only logical thing to do is to abolish copyright entirely. If corporations don't have to follow it, why should anybody else? If anything, AI has proven that the romantic idea of a scientific/engineering/artistic genius was just an illusion, most creative work is easily automated. So why should it get special protections? Culture has existed before copyright was a thing, and will exist afterwards. Without IP reform, humanity will end up being slaves to megacorps that can ignore it and then
- Abolish Copyright = Give Money to MegaCorps (Score:4, Interesting)
  
  by Rinnon ( 1474161 ) writes: on Thursday January 09, 2025 @07:51PM (#65076771)
  
  Abolishing copyright outright is throwing the baby out with the bathwater. I totally agree the current system is absurd, but no copyright isn't the answer. Without copyright, a record label, or publishing house, has no obligation to pay a musician or author when they sell their work. If they can get their hands on it, they can sell it and keep 100% of the profit. Basically, if you upload a song to the internet (anywhere, even your own server) YouTube can snag it, put it up, monetize it, and give you nothing. If you write a book, you'd be able to sell about 1 copy online before Amazon grabbed it, threw it up on Kindle, and kept any profits for itself. Unless your intention is to take any money currently going to musicians and authors and redirect it to the likes of Google and Amazon, a more nuanced solution is required.
  
  - Re: (Score:3)
    
    by tlhIngan ( 30335 ) writes:
    
    Abolishing copyright outright is throwing the baby out with the bathwater. I totally agree the current system is absurd, but no copyright isn't the answer. Without copyright, a record label, or publishing house, has no obligation to pay a musician or author when they sell their work. If they can get their hands on it, they can sell it and keep 100% of the profit. Basically, if you upload a song to the internet (anywhere, even your own server) YouTube can snag it, put it up, monetize it, and give you nothing
    - Re: (Score:2)
      
      by 93 Escort Wagon ( 326346 ) writes:
      
      Most people who state "we need to completely abolish copyright" aren't thinking beyond the level of "I want to be able to freely download movies and music without any possible restrictions or repercussions".
      - Re: Abolish Copyright = Give Money to MegaCorps (Score:1)
        
        by greytree ( 7124971 ) writes:
        
        Likewise, most companies that use the law to enforce immoral 95-year copyright terms are stifling not promoting creativity, the reason for copyright.
  - Re: (Score:2)
    
    by Travelsonic ( 870859 ) writes:
    
    Indeed. The problem is not the concept of copyright, IMO, but with what it has become.
    
    Take the duration and bring it back to well within an author's life time, as the implementation in the U.S constitution intended.
    
    This was crucial - as it encouraged authors to not just create, but to keep creating rather than just sitting on their asses for the rest of their life. It was also crucial because it meant the public domain got consistent, and regular additions from which anyone could take portions, pieces
Cur the standard ... (Score:1)

by cascadingstylesheet ( 140919 ) writes:

plaintiffs in Kadrey v. Meta, who include bestselling authors Sarah Silverman and Ta-Nehisi Coates
And if someone figures out how to remove their stuff from the dataset, nothing of value would be lost ...
- Re: (Score:1)
  
  by cascadingstylesheet ( 140919 ) writes:
  
  Sorry, that was supposed to be "cue the standard" ...
So what (Score:2)

by Visarga ( 1071662 ) writes:

They trained on copyrighted books, so what? Is this any problem? If we want to infringe those works, we can get them from LibGen directly, faster, and more exact. LLaMA does not recreate those works unless you prompt with a passage, and that means you already have the text. But when you do chat, and put your data into the session, then LLaMA simply doesn't reproduce those works, instead it works on what the user requested. The users add new intent on top of what the model learns. It's transformative
Why not? (Score:2)

by SuperDre ( 982372 ) writes:

If we humans are allowed to learn from copyrighted materials why would AI not be allowed to use the same texts, videos and audio to learn the same way?
- Re: (Score:2)
  
  by pauljlucas ( 529435 ) writes:
  
  Humans are allowed to learn from *authorized* copyrighted materials, i.e., you bought a book so you have an authorized copy. Meta is using *unauthorized* copyrighted materials, hence guilty of copyright infringement. But you would also be guilty of copyright infringement if you obtained an unauthorized copy of a copyrighted material.
  - Re: (Score:2)
    
    by Travelsonic ( 870859 ) writes:
    
    Humans are allowed to learn from *authorized* copyrighted materials,
    The distinction seems kind of odd - wouldn't borrowing a book be legally allowed but not necessarily authorized - and a way for someone to learn from a work?
    - Re: (Score:2)
      
      by pauljlucas ( 529435 ) writes:
      
      Assuming the entity you borrowed the book from had an authorized copy to begin with, the First Sale doctrine allows said entity to do anything they want with it: lend it, destroy it, put it under the short leg of a table, whatever. There is only ever one authorized copy in the current scenario.
  - Re: (Score:1)
    
    by sikiriki ( 6723224 ) writes:
    
    Even if they learn from a pirated source, the knowledge is legit and free to use it (with trade secrets it's a bit different, since that isn't public material).
    - Re: (Score:2)
      
      by pauljlucas ( 529435 ) writes:
      
      The knowledge is irrelevant. Whether you learn from it is irrelevant. Whether you remember it is irrelevant. It's only the mere possession of an unauthorized copy to begin with that matters. Copyright is literally only about the copy. It's right there in the name. You're making this a more difficult concept than it actually is.
What Meta is doing is not "fair use." (Score:2)

by rocket rancher ( 447670 ) writes:

Based on what I read in the complaint, this motion includes evidence from discovery around Meta's use of copyrighted material in training Llama. This evidence raises real questions about Meta's approach to training Llama, and it does not look good for Meta's fair use defense. Training an AI model could be transformative -- Llama abstracts patterns from data instead of just copying works, yes, but the evidence uncovered in discovery presents a much darker picture. It clearly shows practices that defy the p
They know the money they will make (Score:2)

by ayesnymous ( 3665205 ) writes:

from using the pirated content will far exceed the amount they will have to pay to settle the lawsuit.

There may be more comments in this discussion. Without JavaScript enabled, you might want to turn on Classic Discussion System in your preferences instead.

Meta[stasize] is metastasizing! (Score:2)

Re:Meta[ the cancer] is metastasizing! (Score:2)

Shocked! (Score:4, Informative)

Re:Shocked! (Score:4, Insightful)

Re: Shocked! (Score:3)

Re: Shocked! (Score:5, Insightful)

Re: (Score:2)

Re:2 sets of laws: ones for the rich and (Score:5, Informative)

Re:2 sets of laws: ones for the rich and (Score:4, Informative)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Everywhere All At Once (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

Re: (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:3)

Re: (Score:3, Insightful)

Re: (Score:2)

Re:2 sets of laws: ones for the rich and (Score:5, Insightful)

Reverse does not hold (Score:1)

Re: (Score:2)

Absolutely not img2img (Score:1)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

Re: (Score:1)

The Swamp Exposed (Score:2)

Go Zuck. Fuck 95-year copyright very much. (Score:1)

Unfortunately this may be needed (Score:2)

Abolish copyright (Score:2)

Abolish Copyright = Give Money to MegaCorps (Score:4, Interesting)

Re: (Score:3)

Re: (Score:2)

Re: Abolish Copyright = Give Money to MegaCorps (Score:1)

Re: (Score:2)

Cur the standard ... (Score:1)

Re: (Score:1)

So what (Score:2)

Why not? (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:2)

Re: (Score:1)

Re: (Score:2)

What Meta is doing is not "fair use." (Score:2)

They know the money they will make (Score:2)

Related Links Top of the: day, week, month.

Slashdot Top Deals