Why LLM Safety Guardrails Collapse After Fine-tuning: A Similarity Analysis Between Alignment and Fine-tuning Datasets
Lei Hsiung, Tianyu Pang, Yung-Chen Tang, Linyue Song, Tsung-Yi Ho, Pin-Yu Chen, Yaoqing Yang
https://arxiv.org/abs/2506.05346
Buyer With Ties to Chinese Communist Party Got V.I.P. Treatment at Trump Crypto Dinner (New York Times)
https://www.nytimes.com/2025/06/06/us/politics/trump-crypto-dinner-china-he-tianying.html
http://www.memeorandum.com/250606/p144#a250606p144
The Chinese Pulsar Timing Array data release I. Single pulsar noise analysis
Siyuan Chen, Heng Xu, Yanjun Guo, Bojun Wang, R. Nicolas Caballero, Jinchen Jiang, Jiangwei Xu, Zihan Xue, Kejia Lee, Jianping Yuan, Yonghua Xu, Jingbo Wang, Longfei Hao, Jintao Luo, Jinlin Han, Peng Jiang, Zhiqiang Shen, Min Wang, Na Wang, Renxin Xu, Xiangping Wu, Lei Qian, Xin Guan, Menglin Huang, Chun Sun, Yan Zhu
Gradient Inversion Attacks on Parameter-Efficient Fine-Tuning
Hasin Us Sami, Swapneel Sen, Amit K. Roy-Chowdhury, Srikanth V. Krishnamurthy, Basak Guler
https://arxiv.org/abs/2506.04453
🇺🇦 #NowPlaying on KEXP's #MiddayShow
Tomo Nakayama:
🎵 Get To Know You
#TomoNakayama
https://tomomusic.bandcamp.com/track/get-to-know-you
https://open.spotify.com/track/6LpfuMXJuFc1WQbv9xqizX
CO-RFT: Efficient Fine-Tuning of Vision-Language-Action Models through Chunked Offline Reinforcement Learning
Dongchi Huang, Zhirui Fang, Tianle Zhang, Yihang Li, Lin Zhao, Chunhe Xia
https://arxiv.org/abs/2508.02219
Precise Timing Analysis of Four Magnetic Cataclysmic Variables with TESS
Srinivas M Rao, Jeewan C Pandey, Nikita Rawat, Arti Joshi, Ajay Kumar Singh
https://arxiv.org/abs/2506.04371
ROSGuard: A Bandwidth Regulation Mechanism for ROS2-based Applications
Jon Altonaga Puente, Enrico Mezzetti, Irune Agirre Troncoso, Jaume Abella Ferrer, Francisco J. Cazorla Almeida
https://arxiv.org/abs/2506.04640
ASTRA: Autonomous Spatial-Temporal Red-teaming for AI Software Assistants
Xiangzhe Xu, Guangyu Shen, Zian Su, Siyuan Cheng, Hanxi Guo, Lu Yan, Xuan Chen, Jiasheng Jiang, Xiaolong Jin, Chengpeng Wang, Zhuo Zhang, Xiangyu Zhang
https://arxiv.org/abs/2508.03936