Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets
Lei Hsiung, Tianyu Pang, Yung-Chen Tang, Linyue Song, Tsung-Yi Ho, Pin-Yu Chen, Yaoqing Yang
https://arxiv.org/abs/2506.05346
Buyer With Ties to Chinese Communist Party Got V.I.P. Treatment at Trump Crypto Dinner (New York Times)
https://www.nytimes.com/2025/06/06/us/politics/trump-crypto-dinner-china-he-tianying.html
http://www.memeorandum.com/250606/p144#a250606p144
The Chinese Pulsar Timing Array data release I. Single pulsar noise analysis
Siyuan Chen, Heng Xu, Yanjun Guo, Bojun Wang, R. Nicolas Caballero, Jinchen Jiang, Jiangwei Xu, Zihan Xue, Kejia Lee, Jianping Yuan, Yonghua Xu, Jingbo Wang, Longfei Hao, Jintao Luo, Jinlin Han, Peng Jiang, Zhiqiang Shen, Min Wang, Na Wang, Renxin Xu, Xiangping Wu, Lei Qian, Xin Guan, Menglin Huang, Chun Sun, Yan Zhu
Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning
Hasin Us Sami, Swapneel Sen, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy, Basak Guler
https://arxiv.org/abs/2506.04453
Precise Timing Analysis of Four Magnetic Cataclysmic Variables with TESS
Srinivas M Rao, Jeewan C Pandey, Nikita Rawat, Arti Joshi, Ajay Kumar Singh
https://arxiv.org/abs/2506.04371
ROSGuard: A Bandwidth Regulation Mechanism for ROS2-based Applications
Jon Altonaga Puente, Enrico Mezzetti, Irune Agirre Troncoso, Jaume Abella Ferrer, Francisco J. Cazorla Almeida
https://arxiv.org/abs/2506.04640
Teaming in the AI Era: AI-Augmented Frameworks for Forming, Simulating, and Optimizing Human Teams
Mohammed Almutairi
https://arxiv.org/abs/2506.05265 http…
VisCoder: Fine-Tuning LLMs for Executable Python Visualization Code Generation
Yuansheng Ni, Ping Nie, Kai Zou, Xiang Yue, Wenhu Chen
https://arxiv.org/abs/2506.03930
Para los regalones mšs metaleros también tenemos lanzamientos musicales. «God Of Angels Trust» de los daneses Volbeat.
#Volbeat
Learning to Diagnose Privately: DP-Powered LLMs for Radiology Report Classification
Payel Bhattacharjee, Fengwei Tian, Ravi Tandon, Joseph Lo, Heidi Hanson, Geoffrey Rubin, Nirav Merchant, John Gounley
https://arxiv.org/abs/2506.04450