• Snot Flickerman@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    179
    arrow-down
    4
    ·
    edit-2
    6 months ago

    “To the extent a response is deemed required, Meta denies that its use of copyrighted works to train Llama required consent, credit, or compensation,” Meta writes.

    The authors further stated that, as far as their books appear in the Books3 database, they are referred to as “infringed works”. This prompted Meta to respond with yet another denial. “Meta denies that it infringed Plaintiffs’ alleged copyrights,” the company writes.

    When you compare the attitudes on this and compare them to how people treated The Pirate Bay, it becomes pretty fucking clear that we live in a society with an entirely different set of rules for established corporations.

    The main reason they were able to prosecute TPB admins was the claim they were making money. Arguably, they made very little, but the copyright cabal tried to prove that they were making just oodles of money off of piracy.

    Meta knew that these files were pirated. Everyone did. The page where you could download Books3 literally referenced Bibliotik, the private torrent tracker where they were all downloaded. Bibliotik also provides tools to strip DRM from ebooks, something that is a DMCA violation.

    This dataset contains all of bibliotik in plain .txt form, aka 197,000 books processed in exactly the same way as did for bookcorpusopen (a.k.a. books1)

    They knew full well the provenance of this data, and they didn’t give a flying fuck. They are making money off of what they’ve done with the data. How are we so willing to let Meta get away with this while we were literally willing to let US lawyers turn Swedish law upside-down to prosecute a bunch of fucking nerds with hardly any money? Probably because money.

    Trump wasn’t wrong, when you’re famous enough, they let you do it.

    Fuck this sick broken fucking system.

    • kibiz0r@lemmy.world
      link
      fedilink
      English
      arrow-up
      48
      ·
      6 months ago

      The main reason they were able to prosecute TPB admins was the claim they were making money.

      I think in the Darknet Diaries episode about TPB, the guy said they never even made enough off of ads to pay for the server costs.

      • Snot Flickerman@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        32
        ·
        edit-2
        6 months ago

        He also said as much in their documentary TPB AFK.

        Maybe the issue was they didn’t make enough money? If they had truly been greedy bastards they could have used that money to win the court case? What a joke.

    • Dr. Moose@lemmy.world
      link
      fedilink
      English
      arrow-up
      18
      arrow-down
      11
      ·
      edit-2
      6 months ago

      They’re the same issue tho. Piracy and using books for corporate AI training both should be fine. The same people going after data freedom are pushing this AI drama too. There’s too much money in copyright holding and it’s not being held by your favorite deviantart artists.

      • kibiz0r@lemmy.world
        link
        fedilink
        English
        arrow-up
        52
        arrow-down
        5
        ·
        6 months ago

        It’s not the same issue at all.

        Piracy distributes power. It allows disenfranchised or marginalized people to access information and participate in culture, no matter where they live or how much money they have. It subverts a top-down read-only culture by enabling read-write access for anyone.

        Large-scale computing services like these so-called AIs consolidate power. They displace access to the original information and the headwaters of culture. They are for-profit services, tuned to the interests of specific American companies. They suppress read-write channels between author and audience.

        One gives power to the people. One gives power to 5 massive corporations.

        • Snot Flickerman@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          24
          arrow-down
          1
          ·
          edit-2
          6 months ago

          Extremely well-said.

          Also, it’s important to point out that the one that empowers people is the one that is consistently punished far more egregiously.

          We have governments blocking the likes of Sci-Hub, Libgen, and Annas-Archive, but nobody is blocking Meta’s LLMs for the same.

          If they were treated similarly, I would be far less upset about Meta’s arguments. However it’s clear that governments prioritize the success of business over the success of humanity.

        • Dr. Moose@lemmy.world
          link
          fedilink
          English
          arrow-up
          9
          ·
          edit-2
          6 months ago

          It’s the opposite. Closing down public resources would be regulatory capture and that would be consolidation of power.

          Who do you think can afford to pay billions in copyright to produce models? Only mega corporations and pirates. No more small AI companies. No more open source models.

        • archomrade [he/him]@midwest.social
          link
          fedilink
          English
          arrow-up
          7
          ·
          6 months ago

          I wish we could be talking about the power imbalances of corporate bodies exercised through the use of capital ownership, instead of squabbling about how that differential is manifested through a specific act of piracy.

          The reason we view acts of piracy different when they are committed by corporate bodies is because of the power of their capital, not because the act itself is any different. The issue with Meta and OpenAI using pirated data in the production of LMM’s is that they maintain ownership of the final product to be profited from, not that the LMM comes to exist in the first place (even if it is through questionable means). Had they come to create these models from data that they already owned (I need not remind you that they have already claimed their right to a truly sickening amount of it, without having paid a cent), their profiting from it wouldn’t be any less problematic - LLM’s will still undermine the security of the working class and consolidate wealth into fewer and fewer hands. If we were to apply copyright here as it’s being advocated, nothing fundamental will change in that dynamic; in fact, it will only reinforce the basis of that power imbalance (ownership over capital being the primary vehicle) and delay the inevitable (continued consolidation).

          If you’re really concerned with these corporations growing larger and their influence spreading further, then you should be directing your efforts at disrupting that vehicle of influence, not legitimizing it. I understand there’s an enraging double-standard at play here, but the solution isn’t to double down on private ownership, it should be to undermine and seize it for common ownership so that everyone benefits from the advancement.

        • Flying Squid@lemmy.world
          link
          fedilink
          English
          arrow-up
          2
          arrow-down
          1
          ·
          6 months ago

          I wonder if piracy could even benefit these corporations in the long term? Do people who pirate games and movies in their teens and twenties frequently go on to purchase such things when they’re older? I honestly don’t know, but I would love to see a study. I certainly have seen people make that claim.

          • Snot Flickerman@lemmy.blahaj.zone
            link
            fedilink
            English
            arrow-up
            2
            ·
            6 months ago

            Microsoft famously never went after pirates in Asian countries because despite piracy, it made them the default operating system.

            They wanted people to be so used to Windows that they would be willing to pirate it just to use a computer.

            It worked and their OS dominance for consumer OSes continues.

            • Flying Squid@lemmy.world
              link
              fedilink
              English
              arrow-up
              2
              ·
              6 months ago

              There you go. Piracy helps. I’m sure game companies and TV producers and so on feel the same way quite often. People who pirate are free marketing for them because they’ll tell other people about the product.

              • Snot Flickerman@lemmy.blahaj.zone
                link
                fedilink
                English
                arrow-up
                3
                ·
                edit-2
                6 months ago

                Further, piracy can be reduced or made to not impact you as much if you have the right business model.

                Louis CK (before he wrecked his career) famously made millions selling his comedy special through his website for $5 a pop with no Digital Rights Management. You were able to download a copy and keep it forever.

                With no DRM, this meant that copies of his special were able to be pirated easily. Prior to releasing this way, he had previously gone on piracy websites and made comments under his pirated specials politely asking people not to pirate, but understanding if they did it because they were too poor.

                Despite massive piracy of his special, enough people were happy to pay $5 for a DRM-free copy of his comedy special and if I recall correctly me made $5 million+ on that first special he released like that. It was a massive hit and people were encouraging each other to buy a copy since it was so cheap and respected you as a consumer.

                Gabe Newell wasn’t wrong, a big part of piracy always was a service problem.

                On December 10, 2011, C.K. released his fourth full-length special, Live at the Beacon Theater. Like Hilarious, it was produced independently and directed by C.K. However, unlike his earlier work, it was distributed digitally on his website, foregoing both physical and broadcast media. C.K. released the special for $5.00 and without DRM, hoping that these factors and the direct relationship between the artist and consumer would effectively deter illegal downloading.

      • Snot Flickerman@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        15
        arrow-down
        1
        ·
        edit-2
        6 months ago

        So why are Meta, and say, Sci-Hub are treated so differently? I don’t necessarily disagree, but it’s interesting that we legally attack people who are sharing data altruistically (Sci-Hub gives research away for free so more research can be done, scientific research should be free to the world, because it benefits all of mankind), but when it comes to companies who break the same laws to just make more money, that’s fine somehow.

        It’s like trying to improve the world is punished, and being a selfish greedy fucking pig is celebrated and rewarded.

        Sci-Hub is so villified, it can be blocked at an ISP level (depending on where you live) and politicians are pushing for DNS-level blocking. Similar can be said for Libgen or Annas-Archive. Is anything like that happening to Meta? No? Huh, interesting. I wonder why Meta gets different treatment for similar behavior.

        I am willing to defend Meta’s use of this kind of data after the world has changed how they treat entities like Sci-Hub. Until that changes, all you are advocating for is for corporations to be able to break the law and for altruistic people to be punished. I agree they’re the same, but until the law treats them the same, you’re just giving freebies to giant corporations while fucking yourself in the ass.

        • SlopppyEngineer@lemmy.world
          link
          fedilink
          English
          arrow-up
          13
          ·
          6 months ago

          To me it always seems to come back to nobility. Big corpo is the new nobility and they have certain privileges not available to the common folk. In theory it shouldn’t exist but in practice it most certainly does.

          • Snot Flickerman@lemmy.blahaj.zone
            link
            fedilink
            English
            arrow-up
            15
            arrow-down
            2
            ·
            edit-2
            6 months ago

            The aristocracy never died, it just got a new name.

            I mean the US is literally built on the fact that the aristocracy in the US didn’t actually want to lose station, so they built a democracy that included many anti-democratic measures from the Senate to the Electoral College to only allowing land-owning white men to vote. The US was purpose built to serve the rich while paying lip-service to the poor.

            “Conservatives” were literally always those who wanted to conserve the monarchy and aristocracy. Those were the things they originally wanted to conserve, and plainly still fucking do.

            How people do not see this is a complete farce.

    • Flying Squid@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      arrow-down
      1
      ·
      6 months ago

      “To the extent a response is deemed required, Meta denies that its use of copyrighted works to train Llama required consent, credit, or compensation,” Meta writes.

      Cool, so I can train my AI on Facebook and Instagram posts and you’re fine if I don’t consent, credit or compensate you either, right Meta? It’s not even copyrighted in the first place, so you shouldn’t have a single complaint.

    • yesdogishere@kbin.social
      link
      fedilink
      arrow-up
      3
      ·
      6 months ago

      The only solution is vigilante justice. Bezos and all the directors and snr execs. Bring them all to justice. Exile to Mars.

    • The Hobbyist@lemmy.zip
      link
      fedilink
      English
      arrow-up
      6
      arrow-down
      3
      ·
      6 months ago

      Perhaps I’m misunderstanding, but it sounds like you’re suggesting we side with Meta to put a precedence in which pirating content is legal and allows websites like TPB to keep existing but legitimally? Or are you rather taking the opposite stand, which would further entrench the illegality of TPB activities and in the same swoop prevent meta from performing these actions?

      I don’t know if we can simultaneously oppose meta while protecting TPB, is there?

      • Tedrow@lemmy.world
        cake
        link
        fedilink
        English
        arrow-up
        15
        arrow-down
        1
        ·
        6 months ago

        I think what they are saying is that Meta is powerful enough to get away with it. You are attempting to equate two different things.

        Meta isn’t using the books for entertainment purposes. They are using another IP to develop their own product. There has to be a distinction here.

        • The Hobbyist@lemmy.zip
          link
          fedilink
          English
          arrow-up
          4
          ·
          6 months ago

          We are in agreement, but I was attempting to launch a discussion about how we want the laws to actually be applied and possibly how they should be reformulated.

      • Snot Flickerman@lemmy.blahaj.zone
        link
        fedilink
        English
        arrow-up
        9
        arrow-down
        1
        ·
        edit-2
        6 months ago

        I’m advocating that if we’re going to have copyright laws (or laws in general) that they’re applied consistently and not just siding with who has the most money.

        When it’s small artists needing their copyright to be defended? They’re crushed, ignored, and lose their copyright.

        Even when Sony was suing individuals for music piracy in the early 2000’s, artists had to sue Sony to see any money from those lawsuits. Those lawsuits were ostensibly brought by Sony for the artists, because the artists were being stolen from. Interesting that none of that money made it to artists without the artists having to sue Sony.

        Sony was also behind the rootkit disaster and has been sued many times for using unlicensed music in their films.

        It is well documented that copyright owners constantly break copyright to make money, and because they have so much fucking money, it’s easy for them to just weather the lawsuits. (“If the penalty for a crime is a fine, that law only exists for the lower classes.”)

        We literally brought US courtroom tactics to a foreign country and bought one of their judges to get The Pirate Bay case out the fucking door. It was corruption through and through.

        We prosecute people who can’t afford to defend themselves, and we just let those who have tons of money do whatever the fuck they want.

        The entire legal system is a joke of “who has the most money wins” and this is just one of many symptoms of it.

        It certainly feels like the laws don’t matter. We’re willing to put down people just trying to share information, but people trying to profit off of it insanely, nah that’s fine.

        I’m just asking for things to be applied evenly and realistically. Because right now corporations just make up their own fucking rules as they go along, stealing from the commons and claiming it was always theirs. While individuals just trying to share are treated like fucking villains.

        Look at how they treat Meta versus how they treat Sci-Hub. Sci-Hub exists only to promote and improve science by giving people access to scientific data. The entire copyright world is trying to fucking destroy them, and take them offline. But Facebook pirating to make money? Totes fucking okay! If it’s selfish, it’s fine, if it’s selfless, sue the fuck out of them!

        • The Hobbyist@lemmy.zip
          link
          fedilink
          English
          arrow-up
          2
          ·
          6 months ago

          Of course we should have consistent laws, but which way should we have it? We can either defend pirates and Meta, or none of them, so what are you saying? Unless there’s a third option I’m missing?

          • Snot Flickerman@lemmy.blahaj.zone
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            2
            ·
            edit-2
            6 months ago

            Are you really so naive that you think suddenly when Meta is let off the hook governments worldwide will change tack and let Sci-Hub/Libgen/etc off the hook as well?

            Like I said elsewhere, I’d be happy to defend Meta in a world where governments aren’t trying to kick altruistic sharing sites off the internet, while allowing selfish greedy sites to proliferate and make money off their piracy.

            However, that won’t change if Meta wins this case, it will just mean big corporations can get away with it and individuals and altruistic groups will still be prosecuted.