ZORRO: Zero-Knowledge Robustness and Privacy for Split Learning (Full Version)
Nojan Sheybani, Alessandro Pegoraro, Jonathan Knauer, Phillip Rieger, Elissa Mollakuqe, Farinaz Koushanfar, Ahmad-Reza Sadeghi
https://arxiv.org/abs/2509.09787
The large-scale kinematics of young stars in the Milky Way disc: first results from SDSS-V
Eleonora Zari, Jaime Villase\~nor, Marina Kounkel, Hans-Walter Rix, Neige Frankel, Andrew Tkachenko, Sergey Khoperskov, Elena D'Onghia, Alexandre Roman-Lopes, Carlos Rom\'an-Z\'u\~niga, S. Guy Stringfellow, C. Jonathan Tan, Aida Wofford, Dmitry Bizyaev, John Donor, G. Jos\'e Fern\'andez-Trincado, Sean Morrison, Kaike Pan, F. Sebastian Sanchez, Andrew Saydjari
Bag of Tricks for Subverting Reasoning-based Safety Guardrails
Shuo Chen, Zhen Han, Haokun Chen, Bailan He, Shengyun Si, Jingpei Wu, Philip Torr, Volker Tresp, Jindong Gu
https://arxiv.org/abs/2510.11570
Officials botch overtime coin toss during Berlin matchup between Colts, Falcons
https://www.cbssports.com/nfl/news/colts-falcons-overtime-coin-toss-berlin-ind…
EDUMATH: Generating Standards-aligned Educational Math Word Problems
Bryan R. Christ, Penelope Molitz, Jonathan Kropko, Thomas Hartvigsen
https://arxiv.org/abs/2510.06965 https:…
TIT-Score: Evaluating Long-Prompt Based Text-to-Image Alignment via Text-to-Image-to-Text Consistency
Juntong Wang, Huiyu Duan, Jiarui Wang, Ziheng Jia, Guangtao Zhai, Xiongkuo Min
https://arxiv.org/abs/2510.02987
Simultaneous Multi-objective Alignment Across Verifiable and Non-verifiable Rewards
Yiran Shen, Yu Xia, Jonathan Chang, Prithviraj Ammanabrolu
https://arxiv.org/abs/2510.01167 h…
Safety Instincts: LLMs Learn to Trust Their Internal Compass for Self-Defense
Guobin Shen, Dongcheng Zhao, Haibo Tong, Jindong Li, Feifei Zhao, Yi Zeng
https://arxiv.org/abs/2510.01088
Fairness in Token Delegation: Mitigating Voting Power Concentration in DAOs
Johnnatan Messias, Ayae Ide
https://arxiv.org/abs/2510.05830 https://arxiv.org/…
Safety Instincts: LLMs Learn to Trust Their Internal Compass for Self-Defense
Guobin Shen, Dongcheng Zhao, Haibo Tong, Jindong Li, Feifei Zhao, Yi Zeng
https://arxiv.org/abs/2510.01088