computer vision stable-diffusion stable-diffusion-2-1 photography photoreal

Deprecation notice

This model was a research project that is deprecated in favour of ptx0/pseudo-flex-base

Capabilities

This model is capable of producing photorealistic images of people.

It retains much of the base 2.1-v model knowledge, as its text encoder is minimally tuned.

Limitations

This model does not produce perfect results every time.

This model cannot reproduce most real people. Instead, it makes "Derp-a-Like" equivalents to real people, which I prefer.

This model is not great at abstract imagery or digital art, though it certainly can produce a variety of amazing art styles.

Dataset

Training parameters

Training goals

Observations

Future work

This model inspired the search for a solution to the proliferation issue that led me to ttj/flex-diffusion-2-1, which led to the creation of ptx0/pseudo-flex-base, another photoreal model with multiple aspect support.

This model was trained purely on 768x768 square images, which were randomly resized and cropped. It can produce some higher resolution landscapes, but it cannot reliably do higher resolution subjects without deformities.