Mark Erdmann's Highlights on '(1) Ron Mokady on X: "My short analysis of the (technical) difference between Flux and SD3: 1. The most significant architecture change IMO is that RoPE (Rotary Position Embedding) is injected before each attention layer [1/N] https://t.co/n8x83tcOJ6" / X' | Glasp