stable-diffusion stable-diffusion-diffusers diffusers text-to-image image-to-image art artistic dreambooth

spop style

This model features four different concepts: humans, outer space, forests, and landscapes in the specific style of SPOP: She-Ra and the Princesses of Power, the Dreamworks version.

This is a fine-tuned Stable Diffusion model, based on SD 1.5.

The goal of this model is to capture the style - not the individual characters featured in the series.

๐Ÿ’– Disclaimer: This is my favorite show. I won't go into that here but a lot of love went into this model.

Detailed Samples

Detailed Samples

Model Usage

This model was trained on multiple concepts. Use the tokens below:

Token Description
๐Ÿ‘ค dwspop style Uses concepts trained on people
๐ŸŒŒ dwspop space Uses concepts trained on outer space
๐ŸŒฒ dwspop forest Uses concepts trained on forests
๐ŸŒ„ dwspop landscape Uses concepts trained on landscapes

๐Ÿ‘ค dwspop style examples

Detailed Samples

This token is capable of handling multiple genders and uses person which can be then used for woman, man, or cat-like woman, or even lizard, dog, snoop dog... it's awesome:

โ›” Negative prompt: ((out of focus body)), ((out of focus face)), ((((ugly)))), (((duplicate))), ((morbid)), ((mutilated)), [out of frame], extra fingers, mutated hands, ((poorly drawn hands)), ((poorly drawn face)), (((mutation))), (((deformed))), ((ugly)), blurry, ((bad anatomy)), (((bad proportions))), ((extra limbs)), cloned face, (((disfigured))), out of frame, ugly, extra limbs, (bad anatomy), gross proportions, (malformed limbs), ((missing arms)), ((missing legs)), (((extra arms))), (((extra legs))), mutated hands, (fused fingers), (too many fingers), (((long neck)))

๐ŸŒŒ dwspop space examples

Detailed Samples

The space token is versatile when prompting, especially when generating galaxies and solar systems. This token is capable of handling different camera angles by desribing in your prompts as a scene.

โ›” Negative prompt: ((out of focus face)), (((duplicate))), [out of frame], blurry, out of frame, ugly, blur, motion blur

๐ŸŒฒ dwspop forest forest examples

Detailed Samples

The forest token is able to generate random forest scenes due to the regularization images that were used. When prompting, additional enviromental objects are supported, such as crystals, rocks, flowers, cottage. Finally, mix in time of day: sunrise, dawn, sunset, evening.

โ›” Negative prompt: ((out of focus face)), (((duplicate))), [out of frame], blurry, out of frame, ugly, blur, motion blur

๐ŸŒ„ dwspop landscape examples:

Detailed Samples

The landscape token is primarly for landscapes but also supports a small percentage of architecture. Blending your prompts to have both an establishing shot of a landscape with architecture woven in and out is where this token shines.

โ›” Negative prompt: ((out of focus face)), (((duplicate))), [out of frame], blurry, out of frame, ugly, blur, motion blur


๐Ÿงจ Diffusers

This model can be used just like any other Stable Diffusion model. For more information, see Stable Diffusion.

Export the model:

from diffusers import StableDiffusionPipeline
import torch

model_id = "zuleo/spop"
pipe = StableDiffusionPipeline.from_pretrained(model_id, torch_dtype=torch.float16)
pipe = pipe.to("cuda")

prompt = "Perfectly-centered close up portrait-photograph of a person, marketplace in the background, sunrise, dwspop style"
image = pipe(prompt).images[0]

image.save("./spop_person.png")

Detailed Samples

๐Ÿ“… text2img Range Grids

It's always great to get a visual of what's going on with sampler, CFG scale, and other settings. See the examples below and tune them to your liking.

Sampler

Using different samplers can produce different results. My favorites are using DPM++ 2S a Karras, DPM++ SDE Karras, DPM adaptive for cartoon scenes.

๐Ÿ”ฅ DPM Adaptive: DPM Adaptive does not use steps. This sampler is fixed depending on the CFG scale and additional configurations.

View the XY grids below for details:

Sampling Steps for person

Values between 25 - 38 is a good range for most samplers but not all. See the Sampling Steps grid with each sampler below:

Sampling Steps Grid

CFG Scale

Values between 7 - 11 is a good range. See the CFG Scale grid:

CFG Scale Grid


๐Ÿ“… img2img Grids

This model works with img2img with a balanced configuration between CFG scale, denoising, and adding more detail with sampling steps.

Denoising & Steps

Steps: 39 - 46, Denoising: 0.49 - 0.6:

Samplers & Denoising

Samplers: all, Denoising: 0.6 - 0.7:

Samplers & CFG Scale

Samplers: all, CFG Scale: 7.0 - 11.0:


๐ŸŒ Regularization images

If you would like to use the regularization images from this training, see the datasets below:


โ˜• If you enjoy this model, buy me a coffee Buy a coffee