Chain of Thought Monitorability: A New and Fragile Opportunity for AI Safety
Tomek Korbak, Mikita Balesni, Elizabeth Barnes, Yoshua Bengio, Joe Benton, Joseph Bloom, Mark Chen, Alan Cooney, Allan Dafoe, Anca Dragan, Scott Emmons, Owain Evans, David Farhi, Ryan Greenblatt, Dan Hendrycks, Marius Hobbhahn, Evan Hubinger, Geoffrey Irving, Erik Jenner, Daniel Kokotajlo, Victoria Krakovna, Shane Legg, David Lindner, David Luan, Aleksander M\k{a}dry, Julian Michael, Neel Nanda, Dave Orr, Jaku…
Been volunteering with @… and @… since January.
We are in the Toronto Star today!
Thanks @…
ICPR SPONSORED TWO DAY MULTIDISCIPLINARY INTERNATIONAL SEMINAR ON EXPLORING THE PHILOSOPHY OF YOGA: THE PURSUIT OF HEALTH, HAPPINESS, HARMONY AND BEYOND
https://ift.tt/j1JM3nr
updated: Monday, July 7, 2025 - 2:17pmfull name / name of organization: ICPR AND DEPARTMENT OF…
via Input 4 RELCFP
The year 2038 will be "interesting".
At 03:14:07 UTC on 19 January 2038 we will hit what may be the real Y2K problem.
And will the US exist as a single nation-state in the year 2038? (I have serious doubts.)
LLM-Driven Self-Refinement for Embodied Drone Task Planning
Deyu Zhang, Xicheng Zhang, Jiahao Li, Tingting Long, Xunhua Dai, Yongjian Fu, Jinrui Zhang, Ju Ren, Yaoxue Zhang
https://arxiv.org/abs/2508.15501
Data-driven optimized high-order WENO schemes with low-dissipation and low-dispersion
Jinrui Zhou, Yiqi Gu, Song Jiang, Hua Shen, Liwei Xu, Guanyu Zhou
https://arxiv.org/abs/2508.13190
Prompt-aware of Frame Sampling for Efficient Text-Video Retrieval
Deyu Zhang, Tingting Long, Jinrui Zhang, Ligeng Chen, Ju Ren, Yaoxue Zhang
https://arxiv.org/abs/2507.15491