LESER: Learning to Expand via Search Engine-feedback Reinforcement in e-Commerce
Yipeng Zhang, Bowen Liu, Xiaoshuang Zhang, Aritra Mandal, Zhe Wu, Canran Xu
https://arxiv.org/abs/2509.05570
Learning to Capture Rocks using an Excavator: A Reinforcement Learning Approach with Guiding Reward Formulation
Amirmasoud Molaei, Reza Ghabcheloo
https://arxiv.org/abs/2510.04168
AQA-TTRL: Self-Adaptation in Audio Question Answering with Test-Time Reinforcement Learning
Haoyu Zhang, Jiaxian Guo, Yusuke Iwasawa, Yutaka Matsuo
https://arxiv.org/abs/2510.05478
Reinforcement Learning with Action-Triggered Observations
Alexander Ryabchenko, Wenlong Mou
https://arxiv.org/abs/2510.02149 https://arxiv.org/pdf/2510.021…
#HotTake:
Liberalism is a syncretic religion where the highest deity is the State and all other deities are subject to it. Liberalism asserts this distinction by classifying political systems as deriving from "natural law" and religions as faith systems. Classical liberalism divided the world into "primitive" beliefs and "enlightened" beliefs. Atheistic anarchism aligns with this distinction by opposing the state and capitalism through logic, rather than asserting that the state and capitalism are themselves systems of faith. Atheistic anarchism further reinforces the narrative of the liberal state by aligning with the liberal understanding of religion as a "primitive" institution.
Graeber et al pointed out the connections between property rights as a form of fedishism (magic) and the evolution of currency, debt, taxes, and the state from temples.
Mutual Information Tracks Policy Coherence in Reinforcement Learning
Cameron Reid, Wael Hafez, Amirhossein Nazeri
https://arxiv.org/abs/2509.10423 https://…
Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning
Antonio Guillen-Perez
https://arxiv.org/abs/2508.18397
Learning Real-World Acrobatic Flight from Human Preferences
Colin Merk, Ismail Geles, Jiaxu Xing, Angel Romero, Giorgia Ramponi, Davide Scaramuzza
https://arxiv.org/abs/2508.18817
Robot Trains Robot: Automatic Real-World Policy Adaptation and Learning for Humanoids
Kaizhe Hu, Haochen Shi, Yao He, Weizhuo Wang, C. Karen Liu, Shuran Song
https://arxiv.org/abs/2508.12252
Sight Over Site: Perception-Aware Reinforcement Learning for Efficient Robotic Inspection
Richard Kuhlmann, Jakob Wolfram, Boyang Sun, Jiaxu Xing, Davide Scaramuzza, Marc Pollefeys, Cesar Cadena
https://arxiv.org/abs/2509.17877