Data poisoning: how artists are sabotaging AI to take revenge on image generators::As AI developers indiscriminately suck up online content to train their models, artists are seeking ways to fight back.

  • gaiussabinus@lemmy.world
    link
    fedilink
    English
    arrow-up
    54
    arrow-down
    2
    ·
    10 months ago

    This system runs on the assumption that A) massive generalized scraping is still required B) You maintain the metadata of the original image C) No transformation has occurred to the poisoned picture prior to training(Stable diffusion is 512x512). Nowhere in the linked paper did they say they had conditioned the poisoned data to conform to the data set. This appears to be a case of fighting the last war.

  • Blaster M@lemmy.world
    link
    fedilink
    English
    arrow-up
    46
    arrow-down
    3
    ·
    10 months ago

    Takes image, applies antialiasing and resize

    Oh, look at that, defeated by the completely normal process of preparing the image for training

  • qooqie@lemmy.world
    link
    fedilink
    English
    arrow-up
    39
    arrow-down
    5
    ·
    10 months ago

    Unfortunately for them there’s a lot of jobs dedicated to cleaning data so I’m not sure if this would even be effective. Plus there’s an overwhelming amount of data that isn’t “poisoned” so it would just get drowned out if never caught

  • Potatos_are_not_friends@lemmy.world
    link
    fedilink
    English
    arrow-up
    27
    arrow-down
    3
    ·
    10 months ago

    Imagine if writers did the same things by writing gibberish.

    At some point, it becomes pretty easy to devalue that content and create other systems to filter it.

    • kromem@lemmy.world
      link
      fedilink
      English
      arrow-up
      17
      arrow-down
      5
      ·
      10 months ago

      Shhhhh.

      Let them keep doing the modern equivalent of “I do not consent for my MySpace profile to be used for anything” disclaimers.

      It keeps them busy on meaningless crap that isn’t actually doing anything but makes them feel better.

  • kromem@lemmy.world
    link
    fedilink
    English
    arrow-up
    15
    arrow-down
    1
    ·
    edit-2
    10 months ago

    This doesn’t actually work. It doesn’t even need ingestion to do anything special to avoid.

    Let’s say you draw cartoon pictures of cats.

    And your friend draws pointillist images of cats.

    If you and your friend don’t coordinate, it’s possible you’ll bias your cat images to look like dogs in the data but your friend will bias their images to look like horses.

    Now each of your biasing efforts become noise and not signal.

    Then you need to consider if you are also biasing ‘cartoon’ and ‘pointillism’ attributes as well, and need to coordinate with the majority of other people making cartoon or pointillist images.

    When you consider the number of different attributes that need to be biased for a given image and the compounding number of coordinations that would need to be made at scale to be effective, this is just a nonsense initiative that was an interesting research paper in lab conditions but is the equivalent of a mouse model or in vitro cancer cure being taken up by naturopaths as if it’s going to work in humans.

  • HejMedDig@feddit.dk
    link
    fedilink
    English
    arrow-up
    14
    arrow-down
    1
    ·
    10 months ago

    Let’s see how long before someone figures out how to poison, so it returns NSFW Images

    • daxnx01@lemmy.world
      link
      fedilink
      English
      arrow-up
      5
      ·
      10 months ago

      You can create NSFW ai images already though?

      Or did you mean, when poisoned data is used a NSFW image is created instead of the expected image?

    • AVincentInSpace@pawb.social
      link
      fedilink
      English
      arrow-up
      4
      ·
      edit-2
      10 months ago

      companies would stumble all over themselves to figure out how to get it to stop doing that before going live. source: they already are. see bing image generator appending “ethnically ambiguous” to every prompt it receives

      it would be a herculean if not impossible effort on the artists’ part only to watch the corpos scramble for max 2 weeks.

      when will you people learn that you cannot fight AI by trying to poison it. there is nothing you can do that horny weebs haven’t already done.

      • General_Effort@lemmy.world
        link
        fedilink
        English
        arrow-up
        3
        ·
        10 months ago

        It can only target open source, so it wouldn’t bother corpos at all. The people behind this object to not everything being owned and controlled. That’s the whole point.

      • HejMedDig@feddit.dk
        link
        fedilink
        English
        arrow-up
        1
        ·
        10 months ago

        The Nightshade poisoning attack claims that it can corrupt a Stable Diffusion in less than 100 samples. Probably not to NSFW level. How easy it is to manufacture those 100 samples is not mentioned in the abstract

        • AVincentInSpace@pawb.social
          link
          fedilink
          English
          arrow-up
          2
          ·
          edit-2
          10 months ago

          yeah the operative word in that sentence is “claims”

          I’d love nothing more than to be wrong, but after seeing how quickly Glaze got defeated (not only did it make the images nauseating for a human to look at despite claiming to be invisible, not even 48 hours after the official launch there was a neural network trained to reverse its effects automatically with like 95% accuracy), suffice to say my hopes aren’t high.

  • Uriel238 [all pronouns]@lemmy.blahaj.zone
    link
    fedilink
    English
    arrow-up
    10
    ·
    edit-2
    10 months ago

    The general term for this is adversarial input, and we’ve seen published reports about it since 2011 when ot was considered a threat if CSAM could be overlayed with secondary images so they weren’t recognized by Google image filters or CSAM image trackers. If Apple went through with their plan to scan private iCloud accounts for CSAM we may have seen this development.

    So far (AFAIK) we’ve not seen adversarial overlays on CSAM though in China the technique is used to deter trackng by facial recognition. Images on social media are overlaid by human rights activists / mischief-makers so that social media pics fail to match secirity footage.

    The thing is like an invisible watermark, these processes are easy to detect (and reverse) once users are aware they’re a thing. So if a generative AI project is aware that some images may be poisoned, it’s just a matter of adding a detection and removal process to the pathway from candidate image to training database.

    Similarly, once enough people start poisoning their social media images, the data scrapers will start scaning and removing overlays even before the database sets are sold to law enforcement and commercial interests.

  • Kedly@lemm.ee
    link
    fedilink
    English
    arrow-up
    5
    arrow-down
    4
    ·
    10 months ago

    Man, whenever I start getting tired by the amount of Tankies on Lemmy, the linux users and decent AI takes in users rejuvenates me. The rest of the internet has jumped full throttle on the AI hate train

  • Sabin10@lemmy.world
    link
    fedilink
    English
    arrow-up
    9
    arrow-down
    17
    ·
    10 months ago

    Data poisoning isn’t limited to just AI stuff and you should be doing it at every opportunity.

  • Dr. Moose@lemmy.world
    link
    fedilink
    English
    arrow-up
    22
    arrow-down
    33
    ·
    10 months ago

    Just don’t out your art to public if you don’t want someone/thing learn from it. The clinging to relevance and this pompous self importance is so cringe. So replacing blue collar work is ok but some shitty drawings somehow have higher ethical value?

    • Catoblepas@lemmy.blahaj.zone
      link
      fedilink
      English
      arrow-up
      27
      arrow-down
      9
      ·
      10 months ago

      “Just don’t make a living with your art if you aren’t okay with AI venture capitalists using it to train their plagiarism machines without getting permission from you or compensating you in any way!”

      If y’all hate artists so much then only interact with AI content and see how much you enjoy it. 🤷‍♂️

      • teichflamme@lemm.ee
        link
        fedilink
        English
        arrow-up
        2
        ·
        10 months ago

        It has literally nothing to do with plagiarism.

        Every artist has looked at other art for inspiration. It’s the most common thing in the world. Literally what you do in art school.

        • Catoblepas@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          1
          ·
          10 months ago

          It’s not an artist any more than a xerox machine is. It hasn’t gone to art school. It doesn’t have thoughts, ideas, or the ability to create. It can only take and reuse what has already been created.

          • teichflamme@lemm.ee
            link
            fedilink
            English
            arrow-up
            4
            arrow-down
            1
            ·
            10 months ago

            The ideas are what the prompts and fine tuning is for. If you think it’s literally copying an existing piece of art you just lack understanding because that’s not how it works at all.

      • Dr. Moose@lemmy.world
        link
        fedilink
        English
        arrow-up
        11
        arrow-down
        21
        ·
        10 months ago

        It has nothing to do with AI venture capitalists. Also not every profession is entitled to income, some are fine to remain as primarily hobbies.

        AI art is replacing corporate art which is not something we should be worried about. Less people working on that drivel is a net good for humanity. If can get billions of hours wasted on designing ads towards real meaningful contributions we should added billions extra hours to our actual productivity. That is good.

        • Catoblepas@lemmy.blahaj.zone
          link
          fedilink
          English
          arrow-up
          8
          ·
          10 months ago

          The ratio of using AI to replace ad art:fraud/plagiarism has to be somewhere around 1:1000.

          “Actual productivity” is a nonsense term when it comes to art. Why is this less “meaningful” than this?

          Without checking the source, can you even tell which one is art for an ad and which isn’t?

          • lad@programming.dev
            link
            fedilink
            English
            arrow-up
            3
            ·
            10 months ago

            I would assume the first to be an ad, because most of depicted people look happy

          • Dr. Moose@lemmy.world
            link
            fedilink
            English
            arrow-up
            7
            arrow-down
            14
            ·
            10 months ago

            I’m not sure what’s your point here? Majority of art is drivel. Most art is produced for marketing. Literally. If that can be automated away what are we losing here? McDonald’s logos? Not everything needs to be a career.

              • Dr. Moose@lemmy.world
                link
                fedilink
                English
                arrow-up
                4
                arrow-down
                8
                ·
                10 months ago

                Nah. In literally being proven right real time. You can set a reminder or something :)

                • TrickDacy@lemmy.world
                  link
                  fedilink
                  English
                  arrow-up
                  4
                  arrow-down
                  6
                  ·
                  10 months ago

                  Not sure how you can be “right” to generally just shit on the concept of art and think it’s better replaced by ai.

        • Flying Squid@lemmy.world
          link
          fedilink
          English
          arrow-up
          10
          arrow-down
          5
          ·
          10 months ago

          Also not every profession is entitled to income

          Yes it is. Otherwise it is not a profession. People go to school for years to become professional artists. They are absolutely entitled to income.

          But hey, you want your murals painted by robots and your wall art printed out, have fun. I’m not interested in your brave new world.

            • Flying Squid@lemmy.world
              link
              fedilink
              English
              arrow-up
              5
              arrow-down
              4
              ·
              10 months ago

              So you think you’re not entitled to income from your work? That doesn’t sound like something a professional would say. “I’m obsolete, don’t pay me.”

              • Dr. Moose@lemmy.world
                link
                fedilink
                English
                arrow-up
                5
                arrow-down
                2
                ·
                edit-2
                10 months ago

                Nah I understand what’s going on. AI is not replacing real artists. It’s replacing sweatshops. And even when it will eventually replace most of art grunt work we’ll find something more interesting to do like curate the art, mix, match, add extra meta layers and so on.

                This closed mind protectionism is just silly. Not only it’s not sustainable because you will never win it’s also incredibly desperate. No real artist would cry and whine here when given this super power.

                Also pay is not everything in life. Maybe think about that for a second when you discuss art

          • Kedly@lemm.ee
            link
            fedilink
            English
            arrow-up
            1
            arrow-down
            1
            ·
            10 months ago

            The robots are far better at giving me what I want than humans ever were, so yeah, I I ironically am stoked for robot wall art and murals

    • Red_October@lemmy.world
      link
      fedilink
      English
      arrow-up
      14
      arrow-down
      4
      ·
      10 months ago

      The idea that you would actually object to replacing labor with automation, but think replacing art with automation is fine, is genuinely baffling.

      • Dr. Moose@lemmy.world
        link
        fedilink
        English
        arrow-up
        8
        arrow-down
        16
        ·
        10 months ago

        Except the “art” ai is replacing is labor. This snobby ridiculous bullshit that some corporate drawings are somehow more important than other things is super cringe.

      • Ilovethebomb@lemm.ee
        link
        fedilink
        English
        arrow-up
        16
        arrow-down
        5
        ·
        10 months ago

        Yeah, no. There’s a difference between posting your work for someone to enjoy, and posting it to be used in a commercial enterprise with no recompense to you.

        • Dr. Moose@lemmy.world
          link
          fedilink
          English
          arrow-up
          4
          arrow-down
          7
          ·
          10 months ago

          How are you going to stop that lol it’s ridiculous. Would you stop a corporate suit from viewing your painting because they might learn how to make a similar one? It’s makes absolutely zero sense and I can’t believe delulus online are failing to comprehend such simple concept of “computers being able to learn”.

          • Cyber Yuki@lemmy.world
            link
            fedilink
            English
            arrow-up
            6
            arrow-down
            3
            ·
            10 months ago

            Ah yes, just because lockpickers can enter a house suddenly everyone’s allowed to break and enter. 🙄

          • BURN@lemmy.world
            link
            fedilink
            English
            arrow-up
            3
            arrow-down
            7
            ·
            10 months ago

            Computers can’t learn. I’m really tired of seeing this idea paraded around.

            You’re clearly showing your ignorance here. Computers do not learn, they create statistical models based on input data.

            A human seeing a piece of art and being inspired isn’t comparable to a machine reducing that to 1’s and 0’s and then adjusting weights in a table somewhere. It does not “understand” the concept, nor did it “learn” about a new piece of art.

            Enforcement is simple. Any output from a model trained on material that they don’t have copyright for is a violation of copyright against every artist who’s art was used illegally to train the model. If the copyright holders of all the training data are compensated and have opt-in agreed to be used for training then, and only then would the output of the model be able to be used.

      • Flying Squid@lemmy.world
        link
        fedilink
        English
        arrow-up
        7
        arrow-down
        2
        ·
        10 months ago

        Are you actually suggesting that if I post a drawing of a dog, Disney should be allowed to use it in a movie and not compensate me?

        • Delta_V@midwest.social
          link
          fedilink
          English
          arrow-up
          3
          arrow-down
          3
          ·
          10 months ago

          Everyone should be assumed to be able to look at it, learn from it, and add your style to their artistic toolbox. That’s an intrinsic property of all art. When you put it on display, don’t be surprised or outraged when people or AIs look at it.