ViSTA: Visual Storytelling using Multi-modal Adapters for Text-to-Image Diffusion Models
Sibo Dong, Ismail Shaheen, Maggie Shen, Rupayan Mallick, Sarah Adel Bargal
https://arxiv.org/abs/2506.12198
Wieder ein spannendes Wochenende in #chemnitz!
Starker Tipp
https://chaos.social/@chch/114697409182100038
Mechanism and Stability of Li-Dynamics in Amorphous Li-Ti-P-S Based Mixed Ionic-Electronic Conductor
Selva Chandrasekaran Selvaraj, Daiwei Wang, Donghai Wang, Anh T. Ngo
https://arxiv.org/abs/2506.11199
Your values are euros and cents. https://ec.social-network.europa.eu/@EUCommission/114699001106153315
Structural Similarity-Inspired Unfolding for Lightweight Image Super-Resolution
Zhangkai Ni, Yang Zhang, Wenhan Yang, Hanli Wang, Shiqi Wang, Sam Kwong
https://arxiv.org/abs/2506.11823
MS-UMamba: An Improved Vision Mamba Unet for Fetal Abdominal Medical Image Segmentation
Caixu Xu, Junming Wei, Huizhen Chen, Pengchen Liang, Bocheng Liang, Ying Tan, Xintong Wei
https://arxiv.org/abs/2506.12441
CLIP the Landscape: Automated Tagging of Crowdsourced Landscape Images
Ilya Ilyankou, Natchapon Jongwiriyanurak, Tao Cheng, James Haworth
https://arxiv.org/abs/2506.12214
First Positronium Lifetime Imaging with Scandium-44 on a Long Axial Field-of-view PET/CT
Lorenzo Mercolli, William M. Steinberger, Pascal V. Grundler, Anzhelika Moiseeva, Saverio Braccini, Maurizio Conti, Pawe{\l} Moskal, Narendra Rathod, Axel Rominger, Hasan Sari, Roger Schibli, Robert Seifert, Kuangyu Shi, Ewa {\L}. St\k{e}pie\'n, Nicholas P. van der Meulen
CoMemo: LVLMs Need Image Context with Image Memory
Shi Liu, Weijie Su, Xizhou Zhu, Wenhai Wang, Jifeng Dai
https://arxiv.org/abs/2506.06279 https://…