models stabilityai stable diffusion xl refiner 1 0

stabilityai-stable-diffusion-xl-refiner-1-0

Overview

stabilityai/stable-diffusion-xl-refiner-1.0 employs an ensemble of expert modules in a pipeline for latent diffusion. The process involves using a base model to generate noisy latents, which are then refined using a specialized denoising model. The base model can function independently. Alternatively, a two-stage pipeline involves generating latents with the base model and then refining them using a high-resolution model and the SDEdit technique. The second approach is slightly slower due to more function evaluations.

The above summary was generated using ChatGPT. Review the original-model-card to understand the data used to train the model, evaluation metrics, license, intended uses, limitations and bias before using the model.

Inference samples

Inference type	Python sample (Notebook)	CLI with YAML
Real time	image-text-to-image-online-endpoint.ipynb	image-text-to-image-online-endpoint.sh
Batch	image-text-to-image-batch-endpoint.ipynb	image-text-to-image-batch-endpoint.sh

Inference with Azure AI Content Safety (AACS) samples

Inference type	Python sample (Notebook)
Real time	safe-image-text-to-image-online-endpoint.ipynb
Batch	safe-image-text-to-image-batch-endpoint.ipynb

Sample inputs and outputs (for real-time inference)

Sample input

{
   "input_data": {
        "columns": ["prompt", "image"],
        "data": [
            {
                "prompt": "Face of a yellow cat, high resolution, sitting on a park bench",
                "image": "image1",
            },
            {
                "prompt": "Face of a green cat, high resolution, sitting on a park bench",
                "image": "image2",
            }
        ],
        "index": [0, 1]
    }
}

Note:

"image1" and "image2" strings are base64 format.

Sample output

[
    {
        "prompt": "Face of a yellow cat, high resolution, sitting on a park bench",
        "generated_image": "generated_image1",
        "nsfw_content_detected": null
    },
    {
        "prompt": "Face of a green cat, high resolution, sitting on a park bench",
        "generated_image": "generated_image2",
        "nsfw_content_detected": null
    }
]

Note:

"generated_image1" and "generated_image2" strings are in base64 format.

The stabilityai-stable-diffusion-xl-refiner-1-0 model doesn't check for the NSFW content in generated image. We highly recommend to use the model with Azure AI Content Safety (AACS). Please refer sample online and batch notebooks for AACS integrated deployments.

Model inference: visualization for the prompt - "gandalf, lord of the rings, detailed, fantasy, cute, adorable, Pixar, Disney, 8k"

stabilityai-stable-diffusion-xl-refiner-1-0 input image and output visualization

Version: 1

Wiki menu

Home
Reference Documentation
- Components
- Data
- Environments
- Models
Contributing

Provide feedback

Saved searches

Use saved searches to filter your results more quickly