-
Notifications
You must be signed in to change notification settings - Fork 130
models stabilityai stable diffusion xl refiner 1 0
stabilityai/stable-diffusion-xl-refiner-1.0 employs an ensemble of expert modules in a pipeline for latent diffusion. The process involves using a base model to generate noisy latents, which are then refined using a specialized denoising model. The base model can function independently. Alternatively, a two-stage pipeline involves generating latents with the base model and then refining them using a high-resolution model and the SDEdit technique. The second approach is slightly slower due to more function evaluations.
The above summary was generated using ChatGPT. Review the original-model-card to understand the data used to train the model, evaluation metrics, license, intended uses, limitations and bias before using the model.
Inference type | Python sample (Notebook) | CLI with YAML |
---|---|---|
Real time | image-text-to-image-online-endpoint.ipynb | image-text-to-image-online-endpoint.sh |
Batch | image-text-to-image-batch-endpoint.ipynb | image-text-to-image-batch-endpoint.sh |
Inference with Azure AI Content Safety (AACS) samples
Inference type | Python sample (Notebook) |
---|---|
Real time | safe-image-text-to-image-online-endpoint.ipynb |
Batch | safe-image-text-to-image-batch-endpoint.ipynb |
{
"input_data": {
"columns": ["prompt", "image"],
"data": [
{
"prompt": "Face of a yellow cat, high resolution, sitting on a park bench",
"image": "image1",
},
{
"prompt": "Face of a green cat, high resolution, sitting on a park bench",
"image": "image2",
}
],
"index": [0, 1]
}
}
Note:
- "image1" and "image2" strings are base64 format.
[
{
"prompt": "Face of a yellow cat, high resolution, sitting on a park bench",
"generated_image": "generated_image1",
"nsfw_content_detected": null
},
{
"prompt": "Face of a green cat, high resolution, sitting on a park bench",
"generated_image": "generated_image2",
"nsfw_content_detected": null
}
]
Note:
- "generated_image1" and "generated_image2" strings are in base64 format.
- The
stabilityai-stable-diffusion-xl-refiner-1-0
model doesn't check for the NSFW content in generated image. We highly recommend to use the model with Azure AI Content Safety (AACS). Please refer sample online and batch notebooks for AACS integrated deployments.
Model inference: visualization for the prompt - "gandalf, lord of the rings, detailed, fantasy, cute, adorable, Pixar, Disney, 8k"
Version: 1
Preview
license : creativeml-openrail++-m
task : image-to-image
View in Studio: https://ml.azure.com/registries/azureml/models/stabilityai-stable-diffusion-xl-refiner-1-0/version/1
License: creativeml-openrail++-m
SHA: 5d4cfe854c9a9a87939ff3653551c2b3c99a4356
datasets: LAION-5B
inference-min-sku-spec: 4|1|28|176
inference-recommended-sku: Standard_NC6s_v3, Standard_NC12s_v3, Standard_NC24s_v3, Standard_NC24rs_v3, Standard_NC16as_T4_v3, Standard_NC24ads_A100_v4, Standard_NC48ads_A100_v4, Standard_NC4as_T4_v3, Standard_NC64as_T4_v3, Standard_NC8as_T4_v3, Standard_NC96ads_A100_v4, Standard_ND40rs_v2, Standard_ND96amsr_A100_v4, Standard_ND96asr_v4
model_id: stabilityai/stable-diffusion-xl-refiner-1.0