Inoculation Prompting: Instructing LLMs to misbehave at train-time improves test-time alignment
Nevan Wichers, Aram Ebtekar, Ariana Azarbal, Victor Gillioz, Christine Ye, Emil Ryd, Neil Rathi, Henry Sleight, Alex Mallen, Fabien Roger, Samuel Marks
https://arxiv.org/abs/2510.05024
For me I have two scanners for papers, pictures, negatives and slides. For floppies and ZIP disks I have an old SFF machine where I can extract all of the files and save them to USB sticks.
Friends and family knows that if they don't want the pictures to send them to me. I digitize and store them in archival quality holders in a room where the humidity is controlled and the temperature doesn't change very much. Last batch were family images that no one knew of and everyone r…
SecInfer: Preventing Prompt Injection via Inference-time Scaling
Yupei Liu, Yanting Wang, Yuqi Jia, Jinyuan Jia, Neil Zhenqiang Gong
https://arxiv.org/abs/2509.24967 https://
kabr-tools: Automated Framework for Multi-Species Behavioral Monitoring
Jenna Kline, Maksim Kholiavchenko, Samuel Stevens, Nina van Tiel, Alison Zhong, Namrata Banerji, Alec Sheets, Sowbaranika Balasubramaniam, Isla Duporge, Matthew Thompson, Elizabeth Campolongo, Jackson Miliko, Neil Rosser, Tanya Berger-Wolf, Charles V. Stewart, Daniel I. Rubenstein
https://
WAInjectBench: Benchmarking Prompt Injection Detections for Web Agents
Yinuo Liu, Ruohan Xu, Xilong Wang, Yuqi Jia, Neil Zhenqiang Gong
https://arxiv.org/abs/2510.01354 https://…
RangeSAM: Leveraging Visual Foundation Models for Range-View repesented LiDAR segmentation
Paul Julius K\"uhn, Duc Anh Nguyen, Arjan Kuijper, Holger Graf, Dieter Fellner, Saptarshi Neil Sinha
https://arxiv.org/abs/2509.15886