Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csHC_bot@mastoxiv.page
2025-10-03 08:25:11

An Anthropologist LLM to Elicit Users' Moral Preferences through Role-Play
Gianluca De Ninno, Paola Inverardi, Francesca Belotti
arxiv.org/abs/2510.01189

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 11:00:19

The Alignment Waltz: Jointly Training Agents to Collaborate for Safety
Jingyu Zhang, Haozhu Wang, Eric Michael Smith, Sid Wang, Amr Sharaf, Mahesh Pasupuleti, Benjamin Van Durme, Daniel Khashabi, Jason Weston, Hongyuan Zhan
arxiv.org/abs/2510.08240

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 07:33:08

Base Models Know How to Reason, Thinking Models Learn When
Constantin Venhoff, Iv\'an Arcuschin, Philip Torr, Arthur Conmy, Neel Nanda
arxiv.org/abs/2510.07364

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 10:32:51

Unlocking Vision-Language Models for Video Anomaly Detection via Fine-Grained Prompting
Shu Zou, Xinyu Tian, Lukas Wesemann, Fabian Waschkowski, Zhaoyuan Yang, Jing Zhang
arxiv.org/abs/2510.02155

@arXiv_csGT_bot@mastoxiv.page
2025-12-08 08:18:30

Robust forecast aggregation via additional queries
Rafael Frongillo, Mary Monroe, Eric Neyman, Bo Waggoner
arxiv.org/abs/2512.05271 arxiv.org/pdf/2512.05271 arxiv.org/html/2512.05271
arXiv:2512.05271v1 Announce Type: new
Abstract: We study the problem of robust forecast aggregation: combining expert forecasts with provable accuracy guarantees compared to the best possible aggregation of the underlying information. Prior work shows strong impossibility results, e.g. that even under natural assumptions, no aggregation of the experts' individual forecasts can outperform simply following a random expert (Neyman and Roughgarden, 2022).
In this paper, we introduce a more general framework that allows the principal to elicit richer information from experts through structured queries. Our framework ensures that experts will truthfully report their underlying beliefs, and also enables us to define notions of complexity over the difficulty of asking these queries. Under a general model of independent but overlapping expert signals, we show that optimal aggregation is achievable in the worst case with each complexity measure bounded above by the number of agents $n$. We further establish tight tradeoffs between accuracy and query complexity: aggregation error decreases linearly with the number of queries, and vanishes when the "order of reasoning" and number of agents relevant to a query is $\omega(\sqrt{n})$. These results demonstrate that modest extensions to the space of expert queries dramatically strengthen the power of robust forecast aggregation. We therefore expect that our new query framework will open up a fruitful line of research in this area.
toXiv_bot_toot

@arXiv_csCR_bot@mastoxiv.page
2025-09-29 07:33:44

Design and Implementation of a Secure RAG-Enhanced AI Chatbot for Smart Tourism Customer Service: Defending Against Prompt Injection Attacks -- A Case Study of Hsinchu, Taiwan
Yu-Kai Shih, You-Kai Kang
arxiv.org/abs/2509.21367

@arXiv_csCL_bot@mastoxiv.page
2025-10-03 10:54:41

More Than One Teacher: Adaptive Multi-Guidance Policy Optimization for Diverse Exploration
Xiaoyang Yuan, Yujuan Ding, Yi Bin, Wenqi Shao, Jinyu Cai, Jingkuan Song, Yang Yang, Hengtao Shen
arxiv.org/abs/2510.02227

@arXiv_econEM_bot@mastoxiv.page
2025-09-22 07:38:01

Efficient and Accessible Discrete Choice Experiments: The DCEtool Package for R
Daniel P\'erez-Troncoso
arxiv.org/abs/2509.15326 arxiv.…

@arXiv_csHC_bot@mastoxiv.page
2025-09-22 09:32:31

Relational Dissonance in Human-AI Interactions: The Case of Knowledge Work
Emrecan Gulay, Eleonora Picco, Enrico Glerean, Corinna Coupette
arxiv.org/abs/2509.15836

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:08:31

VoiceBBQ: Investigating Effect of Content and Acoustics in Social Bias of Spoken Language Model
Junhyuk Choi, Ro-hoon Oh, Jihwan Seol, Bugeun Kim
arxiv.org/abs/2509.21108

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:19:01

SciReasoner: Laying the Scientific Reasoning Ground Across Disciplines
Yizhou Wang, Chen Tang, Han Deng, Jiabei Xiao, Jiaqi Liu, Jianyu Wu, Jun Yao, Pengze Li, Encheng Su, Lintao Wang, Guohang Zhuang, Yuchen Ren, Ben Fei, Ming Hu, Xin Chen, Dongzhan Zhou, Junjun He, Xiangyu Yue, Zhenfei Yin, Jiamin Wu, Qihao Zheng, Yuhao Zhou, Huihui Xu, Chenglong Ma, Yan Lu, Wenlong Zhang, Chunfeng Song, Philip Torr, Shixiang Tang, Xinzhu Ma, Wanli Ouyang, Lei Bai