VisorGPT: Learning Visual Prior via Generative Pre-Training Paper • 2305.13777 • Published May 23, 2023
BoxDiff: Text-to-Image Synthesis with Training-Free Box-Constrained Diffusion Paper • 2307.10816 • Published Jul 20, 2023 • 1
Cross-Attention Makes Inference Cumbersome in Text-to-Image Diffusion Models Paper • 2404.02747 • Published Apr 3, 2024 • 13
Learning Video Context as Interleaved Multimodal Sequences Paper • 2407.21757 • Published Jul 31, 2024