If they charge by the image, and there is any expectation that AI image-generators might produce garbage results, then AI providers are incentivized to make you pay for their experimentation

xia@lemmy.sdf.org · 2 months ago

If they charge by the image, and there is any expectation that AI image-generators might produce garbage results, then AI providers are incentivized to make you pay for their experimentation

K3zi4@lemmy.world · 2 months ago

I’m still unsure why anyone would pay for AI image generation purely because of the trial and error it takes. I get that not everyone has a GPU that can do it, but I use stable diffusion through automatic 1111 and I’ll likely be about 2-300 generations of text to image, image to image, some inpainting and editing, then some more image to image and upscaling before I get a representation of what’s in my head down.

I love the process of it all, but paying for tokens would completely limit me. Is there a specific reason that people use paid models? Or is it just because a lot of people are limited by their gpu?

Doombot1@lemmy.one · 2 months ago

GPU limiting and a general lack of either knowledge or wanting to put in the effort to do it themselves. Even just going into Github in the first place is enough of a barrier for a lot of people, unfortunately

Iceblade@lemmy.world · 2 months ago

Yep, I’ve a mobile 3070 in my laptop, and whilst I feasibly could run some of the smallest models around, paying on a per-use basis gets me way better quality results for relatively cheaply.

Besides, running it locally isn’t free either. Your hardware deprecates and depreciates over time, in addition to non-negligible power costs.

Sakychu@lemmy.world · 2 months ago

Sounds interesting! Do you maybe mind sharing some resources to get started with that?

Doombot1@lemmy.one · 2 months ago

Easiest GitHub install I have ever done - works well on both Linux and windows per my own testing. Go onto the Automatic1111 stable-diffusion-webui github and find the git clone link, and then, from a command shell, use “cd” to get into your directory of choice and then do a “git clone <link>”. After that, go into said folder and literally just run the webui.bat/.sh file (might be called webui or might be called setup, can’t remember). That’s it - it should install all of the packages and python stuff and everything for you, and from there, honestly just fuck around with it for a while. On Linux, I had to install a few extra things/screw around a bit to get it working fully, but it wasn’t too tough. I highly recommend also installing the “dynamic prompts” extension and playing around with wildcards and such. Civitai has some great models and LoRAs you can play around with, too - Dreamshaper is a great one. If you’ve not got a 4000-series GPU, I would suggest editing the webui-user.bat/sh file and adding “—medvram” or “—lowvram” to help you get higher res images at the cost of a little bit of speed. Let me know if you’ve got any more questions! If nothing else, it’s honestly just a lot of fun to use.

Sakychu@lemmy.world · 2 months ago

I didn’t expect such a detailed answer: Thank you very much! I will definitely check it out! And I will definitely come back here if I have questions!

Doombot1@lemmy.one · 2 months ago

Sure thing! I tend to over explain things anyways, lol