text-to-image stable-diffusion

Model Card: Stable-Cats-Generator

Model Information

image/png

Model Description

Stable-Cats-Generator is an image generation model fine-tuned for generating white cat images based on text prompts. It is built upon Stable Diffusion v2 and utilizes a pretrained text encoder (OpenCLIP-ViT/H) for text-to-image generation. Stable Diffusion v2 is the latest version of the Stable Diffusion text-to-image diffusion model. It was released in 2023 and is based on the same core principles as the original Stable Diffusion model, but it has a number of improvements

Intended Use

Potential Use Cases

Model Capabilities

Limitations

Ethical Considerations

Responsible AI

Disclaimer

This model card serves as a documentation tool and does not constitute legal or ethical guidance. Users of the model are responsible for adhering to ethical and legal standards in their use of the model.

Usage

pip install diffusers==0.11.1
pip install transformers scipy ftfy accelerate
import torch
from diffusers import StableDiffusionPipeline

pipe = StableDiffusionPipeline.from_pretrained("ayoubkirouane/Stable-Cats-Generator", torch_dtype=torch.float16)
pipe = pipe.to("cuda")

prompt = "A photo of a picture-perfect white cat."
image = pipe(prompt).images[0]  # image here is in [PIL format](https://pillow.readthedocs.io/en/stable/)

# Now to display an image you can either save it such as:
image.save(f"cat.png")

# or if you're in a google colab you can directly display it with 
image