coreml stable-diffusion text-to-image not-for-all-eyes

Core ML Converted Model:

QGO-10b:

Source(s): CivitAI<br>

The biggest difference here has not been the merge, but just cutting down on negative prompt. This can have amazing results for realism, though you may run into things that are a little too real. Be warned ;)

It uses RPGv4 instead of v3. It also involves Latex to get some of the bondage model back in as well..

There may also be traces of the Middle Finger and Gun2Head Pose LORAs in the examples.

Advisable to use Hires.fix with the following (or similar) settings:

Upscaler: ESRGAN_4x (NMKD superscale can be a bit sharper, which is nice for smaller upscales)

Upscale by: 1.1~2.0 (whatever suits your purpose)

Denoising strength: 0.3 (or 0.5~0.7 if you don't mind changes from the base image)

You can of course generate low resolution versions first, and pick out the ones you enter into the upscale process (saves a lot of time). I really takes the amount of detail, especially of the face and eyes, to the next level.

This model aims for photorealism at higher resolutions, and large variation in poses, settings and genres. (And having a lot of fun filling half the prompt with wildcards).

I will still be tweaking it in the near future, to eliminate unwanted outputs and/or increase possibilities. Let me know if you run into anything that seems off.

Shout-out to the Unstable Diffusion Discord, where all the cool people share their gems in #photorealistic<br><br>

image

image

image

image