Skip to content

models stabilityai stable diffusion xl refiner 1 0

github-actions[bot] edited this page Nov 28, 2023 · 13 revisions

stabilityai-stable-diffusion-xl-refiner-1-0

Overview

stabilityai/stable-diffusion-xl-refiner-1.0 employs an ensemble of expert modules in a pipeline for latent diffusion. The process involves using a base model to generate noisy latents, which are then refined using a specialized denoising model. The base model can function independently. Alternatively, a two-stage pipeline involves generating latents with the base model and then refining them using a high-resolution model and the SDEdit technique. The second approach is slightly slower due to more function evaluations.

The above summary was generated using ChatGPT. Review the original-model-card to understand the data used to train the model, evaluation metrics, license, intended uses, limitations and bias before using the model.

Inference samples

Inference type Python sample (Notebook) CLI with YAML
Real time image-text-to-image-online-endpoint.ipynb image-text-to-image-online-endpoint.sh
Batch image-text-to-image-batch-endpoint.ipynb image-text-to-image-batch-endpoint.sh

Inference with Azure AI Content Safety (AACS) samples

Inference type Python sample (Notebook)
Real time safe-image-text-to-image-online-endpoint.ipynb
Batch safe-image-text-to-image-batch-endpoint.ipynb

Sample inputs and outputs (for real-time inference)

Sample input

{
   "input_data": {
        "columns": ["prompt", "image"],
        "data": [
            {
                "prompt": "Face of a yellow cat, high resolution, sitting on a park bench",
                "image": "image1",
            },
            {
                "prompt": "Face of a green cat, high resolution, sitting on a park bench",
                "image": "image2",
            }
        ],
        "index": [0, 1]
    }
}

Note:

  • "image1" and "image2" strings are base64 format.

Sample output

[
    {
        "prompt": "Face of a yellow cat, high resolution, sitting on a park bench",
        "generated_image": "generated_image1",
        "nsfw_content_detected": null
    },
    {
        "prompt": "Face of a green cat, high resolution, sitting on a park bench",
        "generated_image": "generated_image2",
        "nsfw_content_detected": null
    }
]

Note:

  • "generated_image1" and "generated_image2" strings are in base64 format.
  • The stabilityai-stable-diffusion-xl-refiner-1-0 model doesn't check for the NSFW content in generated image. We highly recommend to use the model with Azure AI Content Safety (AACS). Please refer sample online and batch notebooks for AACS integrated deployments.

Model inference: visualization for the prompt - "gandalf, lord of the rings, detailed, fantasy, cute, adorable, Pixar, Disney, 8k"

stabilityai-stable-diffusion-xl-refiner-1-0 input image and output visualization

Version: 1

Tags

Preview license : creativeml-openrail++-m task : image-to-image

View in Studio: https://ml.azure.com/registries/azureml/models/stabilityai-stable-diffusion-xl-refiner-1-0/version/1

License: creativeml-openrail++-m

Properties

SHA: 5d4cfe854c9a9a87939ff3653551c2b3c99a4356

datasets: LAION-5B

inference-min-sku-spec: 4|1|28|176

inference-recommended-sku: Standard_NC6s_v3, Standard_NC12s_v3, Standard_NC24s_v3, Standard_NC24rs_v3, Standard_NC16as_T4_v3, Standard_NC24ads_A100_v4, Standard_NC48ads_A100_v4, Standard_NC4as_T4_v3, Standard_NC64as_T4_v3, Standard_NC8as_T4_v3, Standard_NC96ads_A100_v4, Standard_ND40rs_v2, Standard_ND96amsr_A100_v4, Standard_ND96asr_v4

model_id: stabilityai/stable-diffusion-xl-refiner-1.0

Clone this wiki locally