pytorch diffusers stable-diffusion text-to-image diffusion-models-class dreambooth-hackathon landscape

Dreambooth Hackaton 23': How can we use a text-to-image generative model to explore the cinematographic appeal of Torres del Paine ๐Ÿ‡จ๐Ÿ‡ฑ?

Torres del Paine National Park is a national park encompassing mountains, glaciers, lakes, and rivers in southern Chilean Patagonia. It is also part of the End of the World Route, a tourist scenic route. Wikipedia

<figure> <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/snowscat-H3oXiq7_bII-unsplash.jpg" alt="Torres del Paine Snowcatt photo, Unsplash"> <figcaption><a href="https://unsplash.com/@snowscat?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText">Snowscat</a>'s' Photo, <a href="https://unsplash.com/es/fotos/H3oXiq7_bII?utm_source=unsplash&utm_medium=referral&utm_content=creditCopyText">Unsplash</a>

</figcaption> </figure>

Description

DreamBooth model for the ppaine concept trained by alkzar90 on the alkzar90/torres-del-paine dataset.

This is a Stable Diffusion model fine-tuned on the ppaine concept with DreamBooth. It can be used by modifying the instance_prompt: a photo of ppaine landscape

This model was created as part of the DreamBooth Hackathon ๐Ÿ”ฅ. Visit the organisation page for instructions on how to take part!

This is a Stable Diffusion model fine-tuned on landscape images for the landscape theme.

Cinematographics rendering & Object/Artifacts insertion

<figure> <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-cinematographics.png" alt="Torres del Paine Landscape Model - Cinematographic Renderings/Artifacts Inmersion"> <figcaption>Figure 1: <b>Cinematographics renderings and object/artifacts insertions in the Chilean Torres del Paine national park</b>. Text prompts for generated images up-to-down rows and left-to-right; (i) <i>"The ppaine landscape in the middle earth, cinematic light, lord of the ring style, epic"</i>, (ii) <i>"The ppaine landscape in the middle earth, a visible dragon skeleton bones, cinematic light, lord of the ring style, epic"</i>, (iii) <i>"A long branches forest in the ppaine landscape, mountain peaks at the background, cinematic light, realistic, lord of the ring style, epic"</i>, (iv) <i>"A futuristic jeep riding in ppaine landscape, cinematic light, technology</i>, (v) <i>"A futuristic tensor airship flying over the ppaine landscape at night, NIKON-Z-FX"</i>, (vi) <i>"A huge tensor bridge in the ppaine landscape, cinematic light, majestic, architecture"</i>. </figcaption> </figure>

Object/Artifact insertions

<figure> <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/animal-statues/a-photo-of-an-ancient-stone-condor-statue-in-the-ppaine-landscape%2C-michaelangelo%2C-majestic%2C-NIKON-Z-FX%2C-28mm.png" alt="Condor statue in Torres del Paine landscape"> <figcaption>Figure 2-a: <b>Animal statues in the Chilean Torres del Paine national park</b>. Text prompts for the image: <i>"A photo of an ancient stone condor statue in the ppaine landscape, michaelangelo, majestic, NIKON-Z-FX, 28mm"</i>, </figcaption> </figure>

<figure> <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/animal-statues/a-photo-of-a-marble-huemul-statue-in-the-ppaine-landscape%2C-majestic%2C-michaelangelo%2C-NIKON-Z-FX%2C-28mm%20(1).png" alt="Huemul marble statue in Torres del Paine landscape"> <figcaption>Figure 2-b: <b>Animal statues in the Chilean Torres del Paine national park</b>. Text prompts for the image: <i>"A photo of a marble huemul statue in the ppaine landscape, majestic, michaelangelo, NIKON-Z-FX, 28mm"</i>, </figcaption> </figure>

<figure> <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/structures/a-panoramic-photo-of-ppaine-landscale-with-an-Eiffel-tower-in-the-middle%2C-NIKON-Z-FX%2C-realistic%2C-cinematic-light.png" alt="Eiffel tower in Torres del Paine landscape"> <figcaption>Figure 2-c: <b>Structures in the Chilean Torres del Paine national park</b>. Text prompts for the image: <i>"A panoramic photo of ppaine landscape with an Eiffel in the middle, NIKON-Z-FX, realistic, cinematic light"</i>, </figcaption> </figure>

Director's eye view

What does the director's cut concept mean? The definition by the Merriam-Webster dictionary is: "a version of a motion picture that is edited according to the director's wishes and that usually includes scenes cut from the version created for general distribution".

<figure> <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-wes-anderson-cut.png" alt="Torres del Paine Landscape Model - Wes Anderson's cut"> <figcaption>Figure 3: <b>Illustration of the director cuts of the Chilean Torres del Paine national park, in Wes Anderson's eyes</b>. Text prompts for generated images left-to-right; (i) <i>"The ppaine landscape, Wes Anderson style, cinematic light"</i>, (ii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>, (iii) <i>"The ppaine landscape with a small house in the middle, Wes Anderson style, fish eye"</i>. </figcaption> </figure>

Artistic Style Transfer

One way to monitor the fine-tuning process is to look at the model capabilities for transferring well-known artistic styles into the Torres del Paine landscape.

<figure> <img src="https://huggingface.co/alkzar90/ppaine-landscape/resolve/main/assets/dreambooth-hackaton-patagonia-landscape-painting.png" alt="Torres del Paine Landscape Model - Artist Style Painting"> <figcaption>Figure 4: <b>Artistic renderings of the Chilean Torres del Paine national park in the style of famous painters</b>. Text prompts for generated images up-to-down rows and left-to-right; (i) <i>"A painting of the ppaine landscape, Vincent Van Gogh style"</i>, (ii) <i>"A painting of the ppaine landscape, Michelangelo style"</i>, (iii) <i>"A painting of the ppaine landscape, Botero style"</i>, (iv) <i>"A painting of the ppaine landscape, Pierre-Auguste Renoir style"</i>, (v) <i>"A painting of the ppaine landscape, Leonardo Da Vinci style"</i>, (vi) <i>"A painting of the ppaine landscape, Rembrandt style"</i>. </figcaption> </figure>

Usage

from diffusers import StableDiffusionPipeline

pipeline = StableDiffusionPipeline.from_pretrained('alkzar90/ppaine-landscape')
image = pipeline().images[0]
image

References

Thanks to John Whitaker and Lewis Tunstall

Thanks to John Whitaker and Lewis Tunstall for writing out and describing the initial hackathon parameters at https://huggingface.co/dreambooth-hackathon.