Stability.ai released a new model with a new architecture,
Stable Cascade. They claim it gives better prompt adherence and results than the SDXL base model and after fooling around with a bit it seems that's true, although it still struggles with hands and text. While I eventually got it to do simple prompts like show hands, point at the viewer, clap, and give a thumbs up I couldn't get it to do an OK sign or give a thumbs down. Text was better but still had spelling mistakes.
I was more impressed when tried out some wildcards like a woman driving a car made of bread, Gal Gadot with spoons sticking out of her ears, and Trump diving into a pool of cottage cheese.
NSFW generation is, well, bad, but that's to be expected.
Is this going to replace SDXL and become the new base model everyone will train on or will it be like 2.0 and quickly forgotten by the community? No idea honestly.