Stable Diffusion XL Turbo can generate AI images as fast as you can type

thehatfox@lemmy.world · 1 year ago

Stable Diffusion XL Turbo can generate AI images as fast as you can type

ubermeisters@lemmy.world · edit-2 1 year ago

Seems like a bit of a stretch to call 4 seconds per frame, on a 3060, “realtime” / “as fast as you can type”.

Domi@lemmy.secnd.me · 1 year ago

I tried it on a 6900 XT recently and generation time was well under half a second.

Results are not as good as with SDXL but for the time it needs it’s very impressive.

Paradox@lemdro.id · 1 year ago

The author can’t type very quickly

barsoap@lemm.ee · 1 year ago

I’d guess that the ‘realtime’ is a quote from StabilityAI and of course they’re running that stuff on an A100. A couple of seconds is still interactive rate as generally speaking you want to think about the changes you’re making to your conditioning.

Haven’t tried yet but if individual steps of XL Turbo take ballpark as much time as LCM steps then… well, it’s four to eight times faster. As quality generally isn’t production-ready we’re generally speaking about rough prompt prototyping, testing out an animation pipeline, such stuff, but that has the caveat that increasing step size often leads to markedly different results (complete change of composition, not just details) so the information you gain from those preview-quality images is limited.

Oh, “production ready quality”: image quality being roughly en par with 4-step LCM means that it’s nowhere near production grade. For the final render you still want to give the model more steps. OTOH I’ve found that some LCM-based merges do in 30 steps what other models need 80 steps for so improvements are always welcome. But I’m also worried about these distilled models being less flexible, pruning only slightly trodden paths that you actually might want the model to take.

simple@lemm.ee · 1 year ago

Well, it is technically as fast as you can type if you’re running a better GPU. The 3060 is pretty mid-tier at this point.

TheGrandNagus@lemmy.world · edit-2 1 year ago

Low end card.

I’ll get crucified for saying that because people will interpret that as an attack on their PC or something daft like that. It’s not.

It’s Ampere, a GPU architecture from 3.5 years ago. And even then, here’s what the desktop stack was like:

3090 Ti (GA102)
3090 (GA102)
3080 Ti (GA102)
3080 12GB (GA102)
3080 (GA102)
3070 Ti (GA102/GA104)
3070 (GA104)
3060 Ti (GA104/GA103)
3060 (GA106/GA104)
3050 (GA106/GA107)

It was almost at the bottom of Nvidia’s stack 3 years ago. It was a low end card then (because, you know, it was at the bottom end of what they were offering). It’s an even more low end card now.

People are always fooled by Nvidia’s marketing and thinking they’re getting a mid range card when in reality Nvidia’s giving people the scraps and pretending they’re giving you a great deal. People need to demand more from these companies.

Nvidia takes a low end card, slaps a $400 price tag on it, calls it mid range, and people lap it up every time.

Blackmist@feddit.uk · 1 year ago

The pricing makes it a mid range card, because the budget end is just gone these days.

TheGrandNagus@lemmy.world · edit-2 1 year ago

Nvidia conning people into paying what used to be mid range/high end pricing for a low end card does not make it a low end card.

The 3060 was always a low end card. Because it was on the low end of the product stack, both for Nvidia and against AMD.

tillary@sh.itjust.works · 1 year ago

deleted by creator