Add SD3.5 large and large turbo presets #1960

james77777778 · 2024-10-29T05:37:50Z

SD3.5 large and large turbo share the same model architecture, with the main difference being the use of QK RMS normalization, which SD3 medium doesn't.

This PR implements QR RMS normalization in MMDiT. Additionally, the inference pipeline differs in turbo version, as it doesn't require classifier-free guidance, so I have made some changes to the APIs.

The presets have been uploaded to kaggle/kerashub path. Let me know if any changes are needed.

@divyashreepathihalli @mattdangerw

Prompt	Large	Large Turbo
"A cat holding a sign that says hello world"

Parameters of generate (ref: huggingface/diffusers)

Large: num_steps=40, guidance_scale=4.5
Large turbo: num_steps=4, guidance_scale=None (much faster that large version)

divyashreepathihalli

LGTM

james77777778 added 2 commits October 29, 2024 13:28

Add SD3.5 large and large turbo

685f0fb

Fix model cards

5923867

divyashreepathihalli added the kokoro:force-run Runs Tests on GPU label Oct 29, 2024

kokoro-team removed the kokoro:force-run Runs Tests on GPU label Oct 29, 2024

divyashreepathihalli approved these changes Oct 29, 2024

View reviewed changes

divyashreepathihalli merged commit 991bced into keras-team:master Oct 29, 2024
10 checks passed

james77777778 deleted the add-sd3-large branch October 30, 2024 01:22

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Add SD3.5 large and large turbo presets #1960

Add SD3.5 large and large turbo presets #1960

james77777778 commented Oct 29, 2024 •

edited

Loading

divyashreepathihalli left a comment

Add SD3.5 large and large turbo presets #1960

Add SD3.5 large and large turbo presets #1960

Conversation

james77777778 commented Oct 29, 2024 • edited Loading

divyashreepathihalli left a comment

Choose a reason for hiding this comment

james77777778 commented Oct 29, 2024 •

edited

Loading