Allow models to run without all text encoder(s) #645

stduhpf · 2025-04-02T14:57:58Z

For now only Flux and SD3.x.

Just puts a warning instead of crashing when text encoders are missing, and then proceed without it.

TODO:

Re-enable gpu prompt processing if t5 isn't actually used
Support unet models (SDXL ?)
Default embeddings (non-zero) to mimic empty prompts?

Comparisons:

Using clip_l/clip_g q8_0 and t5xxl q4_k.
5 steps
1024x1024
default seed
cfg 1
default guidance
tiled vae
prompt: 'Illustration of a cute cat holding a sign saying "You do not need all text encoders!"'

SD3.5 Large Turbo (iq4_nl):

With t5_xxl:

	with clip_l	without clip_l
with clip_g
without clip_g

Without t5_xxl:

	with clip_l	without clip_l
with clip_g
without clip_g

Flux Schnell (iq4_nl imatrix):

	with clip_l	without clip_l
with T5_xxl
without T5_xxl

rmatif · 2025-04-05T19:20:39Z

Thanks to this, one can now run Flux on an 8GB Android phone

Green-Sky · 2025-04-06T11:34:42Z

@rmatif is your comment for this pr specifically? ... kind of sounds not related.

BTW, did you try one of the flux 8B "lite" prunes?
https://huggingface.co/Green-Sky/flux.1-lite-8B-GGUF/tree/main/lora-experiments
Here with hyper-sd lora merged, for lower step count.

stduhpf · 2025-04-06T14:26:39Z

@Green-Sky I think @rmatif meant that with this PR it's possible to drop T5, which makes Flux fit in only 8GB of system memory.

rmatif · 2025-04-06T14:49:52Z

@Green-Sky I think @rmatif meant that with this PR it's possible to drop T5, which makes Flux fit in only 8GB of system memory.

This is exactly what I meant, sorry if I wasn't clear. With this PR, we can drop the heavy T5, so we can squeeze Flux into just an 8GB phone.

@Green-Sky I just tested Flux.1-lite and the q4_k version can also fit now into those kinds of devices, although you can't run inference on resolutions larger than 512x512 due to the compute buffer, but I bet q3_k will do just fine.

stduhpf changed the title ~~Allow models to run with without all text encoder(s)~~ Allow models to run without all text encoder(s) Apr 25, 2025

Green-Sky mentioned this pull request Jul 5, 2025

Difference of gguf conversion ? #708

Open

stduhpf force-pushed the no-text-encoders branch 2 times, most recently from bfbbc9f to 1562f0f Compare July 10, 2025 17:16

stduhpf added 5 commits July 10, 2025 19:29

conditionner: make text encoders optional for SD3.x

ae78b97

conditionner: make text encoders optional for Flux

8ed3074

only force clip on cpu ifd t5 is actually used

51fcd62

conditionner: make t5 optional for chroma

46623a7

Add todo

4096d99

stduhpf force-pushed the no-text-encoders branch from 1562f0f to 4096d99 Compare July 10, 2025 17:29

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Allow models to run without all text encoder(s) #645

Allow models to run without all text encoder(s) #645

stduhpf commented Apr 2, 2025 •

edited

Loading

Uh oh!

rmatif commented Apr 5, 2025

Uh oh!

Green-Sky commented Apr 6, 2025

Uh oh!

stduhpf commented Apr 6, 2025

Uh oh!

rmatif commented Apr 6, 2025 •

edited

Loading

Uh oh!

Uh oh!

Allow models to run without all text encoder(s) #645

Are you sure you want to change the base?

Allow models to run without all text encoder(s) #645

Conversation

stduhpf commented Apr 2, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Comparisons:

SD3.5 Large Turbo (iq4_nl):

Flux Schnell (iq4_nl imatrix):

Uh oh!

rmatif commented Apr 5, 2025

Uh oh!

Green-Sky commented Apr 6, 2025

Uh oh!

stduhpf commented Apr 6, 2025

Uh oh!

rmatif commented Apr 6, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

stduhpf commented Apr 2, 2025 •

edited

Loading

rmatif commented Apr 6, 2025 •

edited

Loading