without you ever knowing about it, as well as (perhaps) swapping in a cheaper-to-operate model some percentage of the time, perhaps as request loads peak, hoping you’ll just roll the dice and try again.

  • K3zi4@lemmy.world
    link
    fedilink
    English
    arrow-up
    3
    ·
    2 months ago

    I’m still unsure why anyone would pay for AI image generation purely because of the trial and error it takes. I get that not everyone has a GPU that can do it, but I use stable diffusion through automatic 1111 and I’ll likely be about 2-300 generations of text to image, image to image, some inpainting and editing, then some more image to image and upscaling before I get a representation of what’s in my head down.

    I love the process of it all, but paying for tokens would completely limit me. Is there a specific reason that people use paid models? Or is it just because a lot of people are limited by their gpu?

    • Doombot1@lemmy.one
      link
      fedilink
      arrow-up
      1
      ·
      2 months ago

      GPU limiting and a general lack of either knowledge or wanting to put in the effort to do it themselves. Even just going into Github in the first place is enough of a barrier for a lot of people, unfortunately

      • Iceblade@lemmy.world
        link
        fedilink
        arrow-up
        2
        ·
        2 months ago

        Yep, I’ve a mobile 3070 in my laptop, and whilst I feasibly could run some of the smallest models around, paying on a per-use basis gets me way better quality results for relatively cheaply.

        Besides, running it locally isn’t free either. Your hardware deprecates and depreciates over time, in addition to non-negligible power costs.

      • Doombot1@lemmy.one
        link
        fedilink
        arrow-up
        1
        ·
        2 months ago

        Easiest GitHub install I have ever done - works well on both Linux and windows per my own testing. Go onto the Automatic1111 stable-diffusion-webui github and find the git clone link, and then, from a command shell, use “cd” to get into your directory of choice and then do a “git clone <link>”. After that, go into said folder and literally just run the webui.bat/.sh file (might be called webui or might be called setup, can’t remember). That’s it - it should install all of the packages and python stuff and everything for you, and from there, honestly just fuck around with it for a while. On Linux, I had to install a few extra things/screw around a bit to get it working fully, but it wasn’t too tough. I highly recommend also installing the “dynamic prompts” extension and playing around with wildcards and such. Civitai has some great models and LoRAs you can play around with, too - Dreamshaper is a great one. If you’ve not got a 4000-series GPU, I would suggest editing the webui-user.bat/sh file and adding “—medvram” or “—lowvram” to help you get higher res images at the cost of a little bit of speed. Let me know if you’ve got any more questions! If nothing else, it’s honestly just a lot of fun to use.

        • Sakychu@lemmy.world
          link
          fedilink
          arrow-up
          1
          ·
          2 months ago

          I didn’t expect such a detailed answer: Thank you very much! I will definitely check it out! And I will definitely come back here if I have questions!