Vier Raumfahrer von der ISS abgedockt
Rund eine Woche nach der Ankunft ihrer Ablöse-Crew haben sich vier Raumfahrer von der ISS auf den Weg zurück zur Erde gemacht.
https://www.heise.de/news/Vier-Raumfah…
Oil exploration in the Congo basin rainforest could be a disaster for nature and the climate https://www.theguardian.com/environment/2025/aug/07/opening-up-the-congo-basin-rainforest-could-be-a-disaster-for-na…
Reinforcement Learning Optimization for Large-Scale Learning: An Efficient and User-Friendly Scaling Library
Weixun Wang, Shaopan Xiong, Gengru Chen, Wei Gao, Sheng Guo, Yancheng He, Ju Huang, Jiaheng Liu, Zhendong Li, Xiaoyang Li, Zichen Liu, Haizhou Zhao, Dakai An, Lunxi Cao, Qiyang Cao, Wanxi Deng, Feilei Du, Yiliang Gu, Jiahe Li, Xiang Li, Mingjie Liu, Yijia Luo, Zihe Liu, Yadao Wang, Pei Wang, Tianyuan Wu, Yanan Wu, Yuheng Zhao, Shuaibing Zhao, Jin Yang, Siran Yang, Yingshui Tan, …
Video-RTS: Rethinking Reinforcement Learning and Test-Time Scaling for Efficient and Enhanced Video Reasoning
Ziyang Wang, Jaehong Yoon, Shoubin Yu, Md Mohaiminul Islam, Gedas Bertasius, Mohit Bansal
https://arxiv.org/abs/2507.06485
Deep reinforcement learning for near-deterministic preparation of cubic- and quartic-phase gates in photonic quantum computing
Amanuel Anteneh L\'eandre Brunel, Carlos Gonz\'alez-Arciniegas, Olivier Pfister
https://arxiv.org/abs/2506.07859
Improving Long-Range Navigation with Spatially-Enhanced Recurrent Memory via End-to-End Reinforcement Learning
Fan Yang, Per Frivik, David Hoeller, Chen Wang, Cesar Cadena, Marco Hutter
https://arxiv.org/abs/2506.05997
Reinforcement Learning for Trade Execution with Market Impact
Patrick Cheridito, Moritz Weiss
https://arxiv.org/abs/2507.06345 https://
Detecting and Mitigating Reward Hacking in Reinforcement Learning Systems: A Comprehensive Empirical Study
Ibne Farabi Shihab, Sanjeda Akter, Anuj Sharma
https://arxiv.org/abs/2507.05619