Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csIR_bot@mastoxiv.page
2025-09-09 07:52:32

LESER: Learning to Expand via Search Engine-feedback Reinforcement in e-Commerce
Yipeng Zhang, Bowen Liu, Xiaoshuang Zhang, Aritra Mandal, Zhe Wu, Canran Xu
arxiv.org/abs/2509.05570

@arXiv_csRO_bot@mastoxiv.page
2025-10-07 11:26:52

Learning to Capture Rocks using an Excavator: A Reinforcement Learning Approach with Guiding Reward Formulation
Amirmasoud Molaei, Reza Ghabcheloo
arxiv.org/abs/2510.04168

@arXiv_eessAS_bot@mastoxiv.page
2025-10-08 08:00:19

AQA-TTRL: Self-Adaptation in Audio Question Answering with Test-Time Reinforcement Learning
Haoyu Zhang, Jiaxian Guo, Yusuke Iwasawa, Yutaka Matsuo
arxiv.org/abs/2510.05478

@arXiv_csLG_bot@mastoxiv.page
2025-10-03 11:00:11

Reinforcement Learning with Action-Triggered Observations
Alexander Ryabchenko, Wenlong Mou
arxiv.org/abs/2510.02149 arxiv.org/pdf/2510.021…

@hex@kolektiva.social
2025-08-21 08:16:38

#HotTake:
Liberalism is a syncretic religion where the highest deity is the State and all other deities are subject to it. Liberalism asserts this distinction by classifying political systems as deriving from "natural law" and religions as faith systems. Classical liberalism divided the world into "primitive" beliefs and "enlightened" beliefs. Atheistic anarchism aligns with this distinction by opposing the state and capitalism through logic, rather than asserting that the state and capitalism are themselves systems of faith. Atheistic anarchism further reinforces the narrative of the liberal state by aligning with the liberal understanding of religion as a "primitive" institution.
Graeber et al pointed out the connections between property rights as a form of fedishism (magic) and the evolution of currency, debt, taxes, and the state from temples.

@arXiv_csAI_bot@mastoxiv.page
2025-09-15 09:32:41

Mutual Information Tracks Policy Coherence in Reinforcement Learning
Cameron Reid, Wael Hafez, Amirhossein Nazeri
arxiv.org/abs/2509.10423

@arXiv_csRO_bot@mastoxiv.page
2025-08-27 07:39:42

Mining the Long Tail: A Comparative Study of Data-Centric Criticality Metrics for Robust Offline Reinforcement Learning in Autonomous Motion Planning
Antonio Guillen-Perez
arxiv.org/abs/2508.18397

@arXiv_csRO_bot@mastoxiv.page
2025-08-27 09:15:42

Learning Real-World Acrobatic Flight from Human Preferences
Colin Merk, Ismail Geles, Jiaxu Xing, Angel Romero, Giorgia Ramponi, Davide Scaramuzza
arxiv.org/abs/2508.18817

@arXiv_csRO_bot@mastoxiv.page
2025-08-19 11:02:40

Robot Trains Robot: Automatic Real-World Policy Adaptation and Learning for Humanoids
Kaizhe Hu, Haochen Shi, Yao He, Weizhuo Wang, C. Karen Liu, Shuran Song
arxiv.org/abs/2508.12252

@arXiv_csRO_bot@mastoxiv.page
2025-09-23 12:13:20

Sight Over Site: Perception-Aware Reinforcement Learning for Efficient Robotic Inspection
Richard Kuhlmann, Jakob Wolfram, Boyang Sun, Jiaxu Xing, Davide Scaramuzza, Marc Pollefeys, Cesar Cadena
arxiv.org/abs/2509.17877