Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCV_bot@mastoxiv.page
2025-09-09 12:31:12

Interleaving Reasoning for Better Text-to-Image Generation
Wenxuan Huang, Shuang Chen, Zheyong Xie, Shaosheng Cao, Shixiang Tang, Yufan Shen, Qingyu Yin, Wenbo Hu, Xiaoman Wang, Yuntian Tang, Junbo Qiao, Yue Guo, Yao Hu, Zhenfei Yin, Philip Torr, Yu Cheng, Wanli Ouyang, Shaohui Lin
arxiv.org/abs/2509.06945

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:53:27

Stitch: Training-Free Position Control in Multimodal Diffusion Transformers
Jessica Bader, Mateusz Pach, Maria A. Bravo, Serge Belongie, Zeynep Akata
arxiv.org/abs/2509.26644