PersonaTeaming: Exploring How Introducing Personas Can Improve Automated AI Red-Teaming
Wesley Hanwen Deng, Sunnie S. Y. Kim, Akshita Jha, Ken Holstein, Motahhare Eslami, Lauren Wilcox, Leon A Gatys
https://arxiv.org/abs/2509.03728
heute & morgen nehme ich an der VDB- und @CrossAsia-Online-Fortbildung für Fachreferent:innen und Bibliothekar:innen der Asienwissenschaften 2025: https://blog.crossasia.org/vdb-online-fortbildung-2025/. 🎧
ich kann die fächer, die wir aus dem themenspektrum in münster h…
Towards a Unified View of Large Language Model Post-Training
Xingtai Lv, Yuxin Zuo, Youbang Sun, Hongyi Liu, Yuntian Wei, Zhekai Chen, Lixuan He, Xuekai Zhu, Kaiyan Zhang, Bingning Wang, Ning Ding, Bowen Zhou
https://arxiv.org/abs/2509.04419
Format Inertia: A Failure Mechanism of LLMs in Medical Pre-Consultation
Seungseop Lim, Gibaeg Kim, Wooseok Han, Jean Seo, Hyunkyung Lee, Jaehyo Yoo, Eunho Yang
https://arxiv.org/abs/2510.01688
AI-CNet3D: An Anatomically-Informed Cross-Attention Network with Multi-Task Consistency Fine-tuning for 3D Glaucoma Classification
Roshan Kenia, Anfei Li, Rishabh Srivastava, Kaveri A. Thakoor
https://arxiv.org/abs/2510.00882
The Very Faint X-ray Transient 4XMM J174610.7-290020 at the Galactic center
Giovanni Stel, Gabriele Ponti, Nathalie Degenaar, Lara Sidoli, Sandro Mereghetti, Kaya Mori, Tong Bao, Giulia Illiano, Samaresh Mondal, Mark Reynolds, Chichuan Jin, Tianying Lian, Shifra Mandel, Simone Scaringi, Shuo Zhang, Grace Sanger-Johnson, Rudy Wijnands, Jon M. Miller, Jamie Kennea, Zhenlin Zhu
How Well Do Vision--Language Models Understand Cities? A Comparative Study on Spatial Reasoning from Street-View Images
Juneyoung Ro, Namwoo Kim, Yoonjin Yoon
https://arxiv.org/abs/2508.21565
Adaptive Planning for Multi-Attribute Controllable Summarization with Monte Carlo Tree Search
Sangwon Ryu, Heejin Do, Yunsu Kim, Gary Geunbae Lee, Jungseul Ok
https://arxiv.org/abs/2509.26435
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[3/5]:
- Medical Red Teaming Protocol of Language Models: On the Importance of User Perspectives in Health...
Jean-Philippe Corbeil, Minseon Kim, Alessandro Sordoni, Francois Beaulieu, Paul Vozila
ReSURE: Regularizing Supervision Unreliability for Multi-turn Dialogue Fine-tuning
Yiming Du, Yifan Xiang, Bin Liang, Dahua Lin, Kam-Fai Wong, Fei Tan
https://arxiv.org/abs/2508.19996