DEV Community

Cover image for Comparison of Stable Diffusion XL (SDXL) 0.9 vs 1.0 For DreamBooth Training - Surprising Results
Furkan Gözükara
Furkan Gözükara

Posted on

Comparison of Stable Diffusion XL (SDXL) 0.9 vs 1.0 For DreamBooth Training - Surprising Results

You can download SDXL 0.9 from here : https://huggingface.co/stabilityai/stable-diffusion-xl-base-0.9/tree/main

SDXL 0.9 was the first released beta version of Stable Diffusion XL.

I have used Kohya GUI SS and the config I shared here for training : https://www.patreon.com/posts/89213064

Video of how to use config : https://youtu.be/EEV8RPohsbw

For training: 15 training images (show below), 140 repeat, 1 epoch (so total 15*140*2 = 4200 steps — takes less than 2 hours on RTX 3090 with 17 GB VRAM) and the real unsplash manually collected reg images from here : https://www.patreon.com/posts/massive-4k-woman-87700469 are used

Both for SDXL 0.9 and SDXL 1.0 exactly same training parameters and configuration used. For SDXL 0.9 I used the embedded VAE and for SDXL 1.0 I used the later released VAE which is supposed to be same as SDXL 0.9 VAE.

You can download original full resolution (6194 x 4034 pixels) and quality PNG images from attachments and see their PNG info (only PNG ones some failed so I uploaded as JPG) from Automatic1111 SD Web UI PNG info tab.

You can download full resolution images from here (public post don’t require membership) : https://www.patreon.com/posts/96924966

Prompt 1 PNG Info:

Medium shot photo of ohwx man wearing a very expensive suit in a studio with good lightning , hd, hdr, 2k, 4k, uhd
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: “model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]”, Version: v1.7.0

Image description

Prompt 2 PNG Info:

closeshot photo of ohwx man wearing a suit in a surreal outworldly garden, sunlight, hd, hdr, 2k, 4k, uhd
Negative prompt: sunglasses, cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: “model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]”, Version: v1.7.0

Image description

Prompt 3 PNG Info:

cinematic photo ohwx man riding dinosaur in a jungle with mud, sunny day shiny clear sky 35mm photograph,film,professional,4k,highly detailed
Negative prompt: sunglasses, cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: “model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]”, Version: v1.7.0

Image description

Prompt 4 PNG Info:

picture of (ohwx man) wearing a suit near a lake, simple flat color, 2 dimensional, flat 2d art style, cartoon
Negative prompt: photo, photograph, ugly, deformed, noisy, blurry, low contrast, realistic, distant shot, close shot, medium shot, 3d, cgi, render, studio shot, studio, shot, camera
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 3103186800, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: “picture of (ohwx man), simple flat color, 2 dimensional, flat 2d art style”, ADetailer confidence: 0.3, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: “model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]”, Version: v1.7.0

Image description

Prompt 5 PNG Info:

closeshot handsome photo of (ohwx man) (in a warrior armor ) in a coliseum, hdr, canon, hd, 8k, 4k, sharp focus
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 129509750, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: “model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]”, Version: v1.7.0

Image description

Prompt 6 PNG Info:

photo of warrior ohwx man with a pet dragon , epic, cinematic, sunlight, hd, hdr, 2k, 4k, uhd
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 2991427470, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: “model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]”, Version: v1.7.0

Image description

Prompt 7 PNG Info:

handsome portrait photo of (ohwx man) wearing a space armor on a space station, hdr, canon, hd, 8k, 4k, sharp focus
Negative prompt: cartoon, drawing, ugly, deformed, noisy, blurry, low contrast, realistic, 3d, cgi, render, anime, blender, graphic, drawing, digital art, sketch, line art, disfigured, mutated, abstract, 2d, minimalist, vintage, distorted, glitch, manga, Blurred, Hazy
Steps: 20, Sampler: DPM++ 2M SDE Karras, CFG scale: 7, Seed: 2897227315, Size: 1024x1024, Model hash: 1f6557fa7c, Model: 140_epoch_sdxl_0_9, ADetailer model: face_yolov8n.pt, ADetailer prompt: photo of ohwx man, ADetailer confidence: 0.3, ADetailer mask only top k largest: 1, ADetailer dilate erode: 4, ADetailer mask blur: 4, ADetailer denoising strength: 0.5, ADetailer inpaint only masked: True, ADetailer inpaint padding: 32, ADetailer use separate steps: True, ADetailer steps: 70, ADetailer version: 24.1.1, Script: X/Y/Z plot, X Type: Checkpoint name, X Values: “model\140_epoch_sdxl_0_9.safetensors [1f6557fa7c],model\140_epoch_sdxl_1_0.safetensors [cdaf2f236f]”, Version: v1.7.0

Image description

Top comments (2)

Collapse
 
sfleroy profile image
Leroy

So whats the comparison, two runs with the same model might just as well have resulted in the same images.

Collapse
 
furkangozukara profile image
Furkan Gözükara

The comparison is between SDXL 0.9 version and SDXL 1.0 version. you know 0.9 beta version?