Small theta and the text sequence

#69

by liuliu87 - opened 2 days ago

2 days ago

Hi! Just want to call out that the implementation and training used a small theta (256) which is a good choice for image dimensions but not necessary for text dimension (as it will rotate back at 256 token interval). However, FLUX.1 uses all (0, 0, 0) encoding for text and works fine so it might not be a big issue. I will dig deeper to see if techniques post-stretch text sequence would be beneficial.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment