Instructions to use zai-org/GLM-Image with libraries, inference providers, notebooks, and local apps. Follow these links to get started.
- Libraries
- Diffusers
How to use zai-org/GLM-Image with Diffusers:
pip install -U diffusers transformers accelerate
import torch from diffusers import DiffusionPipeline # switch to "mps" for apple devices pipe = DiffusionPipeline.from_pretrained("zai-org/GLM-Image", dtype=torch.bfloat16, device_map="cuda") prompt = "Astronaut in a jungle, cold color palette, muted colors, detailed, 8k" image = pipe(prompt).images[0] - Inference
- Notebooks
- Google Colab
- Kaggle
- Local Apps
- Draw Things
- DiffusionBee
Text generation quality is low with diffusers
Because the model uses an autoregressive generation structure, it exhibits greater diversity compared to diffusion models. Therefore, you can try generating multiple images, and occasional random errors in individual letters are within normal expectations.
I expected better text quality based on benchmark results, but completely understandable. Thank you for the explanation.
Because the model uses an autoregressive generation structure, it exhibits greater diversity compared to diffusion models. Therefore, you can try generating multiple images, and occasional random errors in individual letters are within normal expectations.
How can we train Loras for this model? And when will the optimisations be released?

