A dramatic image of a crow flying with spread wings displaying fiery orange feathers against a moody sky background.
A robotic terminator covered in dice patterns stands on a glowing lava floor surrounded by scattered dice in a surrealistic hellish cave.
Close-up black and white image of female parted lips with teeth visible, overlaid by abstract interference patterns.
Black and white cityscape showing silhouettes of people walking through a foggy urban environment with tall buildings in the background.
Close-up view of a curious alien with large reflective eyes, detailed alien skin texture, standing among alien flora with mountains in the background under a wide angle lens and film grain effect.
Close-up view of a highly detailed alien face with large reflective eyes showing an alien landscape, captured by an interstellar probe with film grain effect.
A hyper realistic portrait of a sculptural young redhead woman with curly hair, outdoors in a dreamy panorama with a blurred barren landscape in the background.
Close-up hyper realistic image of a green eye surrounded by freckles, with red ginger hair and black painted lips.
Portrait of a woman with long red hair, freckles on white skin, light green eyes, black lips, and intricate detailing in a hyper-realistic style.

Recommended Parameters

resolution

525x525

Tips

The model is intended for research purposes including artwork generation, educational tools, and safe deployment.

It is not intended to generate factual or true depictions of people or events.

Limitations include imperfect photorealism, inability to render legible text, challenges with compositional prompts, and possible improper face generation.

The model uses two pretrained text encoders: OpenCLIP-ViT/G and CLIP-ViT/L.

The two-step pipeline includes base latent generation followed by high-resolution refinement using SDEdit (img2img).

Creator Sponsors

Originally Posted to Hugging Face and shared here with permission from Stability AI.

Originally Posted to Hugging Face and shared here with permission from Stability AI.

SDXL consists of a two-step pipeline for latent diffusion: First, we use a base model to generate latents of the desired output size. In the second step, we use a specialized high-resolution model and apply a technique called SDEdit (https://arxiv.org/abs/2108.01073, also known as "img2img") to the latents generated in the first step, using the same prompt.

Model Description

  • Developed by: Stability AI

  • Model type: Diffusion-based text-to-image generative model

  • Model Description: This is a model that can be used to generate and modify images based on text prompts. It is a Latent Diffusion Model that uses two fixed, pretrained text encoders (OpenCLIP-ViT/G and CLIP-ViT/L).

  • Resources for more information: GitHub Repository.

Model Sources

Uses

Direct Use

The model is intended for research purposes only. Possible research areas and tasks include

  • Generation of artworks and use in design and other artistic processes.

  • Applications in educational or creative tools.

  • Research on generative models.

  • Safe deployment of models which have the potential to generate harmful content.

  • Probing and understanding the limitations and biases of generative models.

Excluded uses are described below.

Out-of-Scope Use

The model was not trained to be factual or true representations of people or events, and therefore using the model to generate such content is out-of-scope for the abilities of this model.

Limitations and Bias

Limitations

  • The model does not achieve perfect photorealism

  • The model cannot render legible text

  • The model struggles with more difficult tasks which involve compositionality, such as rendering an image corresponding to “A red cube on top of a blue sphere”

  • Faces and people in general may not be generated properly.

  • The autoencoding part of the model is lossy.

Bias

While the capabilities of image generation models are impressive, they can also reinforce or exacerbate social biases.

The chart above evaluates user preference for SDXL (with and without refinement) over Stable Diffusion 1.5 and 2.1. The SDXL base model performs significantly better than the previous variants, and the model combined with the refinement module achieves the best overall performance.

Previous
epiCPhotoGasm - V1
Next
IlluQuaint - v0.3

Model Details

Model type

Checkpoint

Base model

SDXL 1.0

Model version

v1.0

Model hash

31e35c80fc

Discussion

Please log in to leave a comment.

Images by SD XL - v1.0

A dramatic image of a crow flying with spread wings displaying fiery orange feathers against a moody sky background.
A robotic terminator covered in dice patterns stands on a glowing lava floor surrounded by scattered dice in a surrealistic hellish cave.
Close-up black and white image of female parted lips with teeth visible, overlaid by abstract interference patterns.
Black and white cityscape showing silhouettes of people walking through a foggy urban environment with tall buildings in the background.
Close-up view of a curious alien with large reflective eyes, detailed alien skin texture, standing among alien flora with mountains in the background under a wide angle lens and film grain effect.
Close-up view of a highly detailed alien face with large reflective eyes showing an alien landscape, captured by an interstellar probe with film grain effect.
A hyper realistic portrait of a sculptural young redhead woman with curly hair, outdoors in a dreamy panorama with a blurred barren landscape in the background.
Close-up hyper realistic image of a green eye surrounded by freckles, with red ginger hair and black painted lips.
Portrait of a woman with long red hair, freckles on white skin, light green eyes, black lips, and intricate detailing in a hyper-realistic style.

base model Images

Photorealistic scene of undead characters including zombies and skeletons walking through a spooky cemetery illuminated by glowing jack-o'-lanterns under a dark, ominous sky.

official Images

A hyperdetailed portrait of a female warrior with dark blue hair and hypnotizing yellow eyes, holding a glowing golden orb in golden armor under cinematic lighting.

sdxl Images

A detailed dragon with metallic sheen and intricate pulsing red and blue ais-vesselz veins covering its body.
A deserted cityscape with crumbling buildings wrapped and intertwined with pulsating red ais-vesselz, streets slicked with viscous red liquid.
Close-up of a robotic dragon head with blue armored plating, highlighted with celestial sparkles and sharp cel shading against a city skyline at dusk.

stability ai Images

Anime-style blonde girl wearing a white sundress and large sunhat with a sunflower decoration, standing happily in a sunflower field at sunset with her arms raised.
Ffixgarnet character with long black hair and brown eyes, wearing an orange cross-laced bodysuit with white puffy sleeves and red gloves, climbing a wooden rope ladder against a blue sky background, blushing and looking back
Anime character with hot pink hair and fiery wings crouching in a dynamic pose against a purple and dark fantasy background with sharp jagged rocks.
Close-up view of multiple black textured spheres floating against a dark cosmic background with colorful dreamy bokeh lights.
A close-up of a cyborg with a matte black no face helmet, long black hair, and intricate red and black armor glowing faintly red in a dark mood setting.
Maid warrior with green twintails wielding a polearm circular saw in a dimly lit robot factory with mechanical parts and cables, blood splashes on the floor.