Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csLG_bot@mastoxiv.page
2025-07-17 10:13:20

Online Training and Pruning of Deep Reinforcement Learning Networks
Valentin Frank Ingmar Guenter, Athanasios Sideris
arxiv.org/abs/2507.11975

@arXiv_csAI_bot@mastoxiv.page
2025-07-16 10:17:21

Illuminating the Three Dogmas of Reinforcement Learning under Evolutionary Light
Mani Hamidi, Terrence W. Deacon
arxiv.org/abs/2507.11482

@radioeinsmusicbot@mastodonapp.uk
2025-08-17 16:28:11

🇺🇦 Auf radioeins läuft...
The Rembrandts:
🎵 I'll Be There For You (Theme from "Friends")
#NowPlaying #TheRembrandts
chrisaeson.bandcamp.com/track/
open.spotify.com/track/15tHagk

@arXiv_quantph_bot@mastoxiv.page
2025-07-17 10:02:20

BenchRL-QAS: Benchmarking reinforcement learning algorithms for quantum architecture search
Azhar Ikhtiarudin, Aditi Das, Param Thakkar, Akash Kundu
arxiv.org/abs/2507.12189

@arXiv_csRO_bot@mastoxiv.page
2025-07-16 10:17:51

Ocean Diviner: A Diffusion-Augmented Reinforcement Learning for AUV Robust Control in the Underwater Tasks
Weiyi Liu, Jingzehua Xu, Guanwen Xie, Yi Li
arxiv.org/abs/2507.11283

@arXiv_csHC_bot@mastoxiv.page
2025-06-17 10:52:09

Can you see how I learn? Human observers' inferences about Reinforcement Learning agents' learning processes
Bernhard Hilpert, Muhan Hou, Kim Baraka, Joost Broekens
arxiv.org/abs/2506.13583

@heiseonline@social.heise.de
2025-07-15 12:21:00

Private Astronautenmission zurück auf der Erde
Nach zwei Wochen an Bord der Internationalen Raumstation ISS sind vier Raumfahrer der Axiom-4-Crew zurück auf der Erde. Die Mission brachte viele Premieren.

@arXiv_mathOC_bot@mastoxiv.page
2025-06-17 12:24:17

Research on Optimal Control Problem Based on Reinforcement Learning under Knightian Uncertainty
Ziyu Li, Chen Fei, Weiyin Fei
arxiv.org/abs/2506.13207

@arXiv_csRO_bot@mastoxiv.page
2025-07-16 09:30:31

Versatile and Generalizable Manipulation via Goal-Conditioned Reinforcement Learning with Grounded Object Detection
Huiyi Wang, Fahim Shahriar, Alireza Azimi, Gautham Vasan, Rupam Mahmood, Colin Bellinger
arxiv.org/abs/2507.10814

@arXiv_csRO_bot@mastoxiv.page
2025-06-16 08:01:59

Multi-Loco: Unifying Multi-Embodiment Legged Locomotion via Reinforcement Learning Augmented Diffusion
Shunpeng Yang, Zhen Fu, Zhefeng Cao, Guo Junde, Patrick Wensing, Wei Zhang, Hua Chen
arxiv.org/abs/2506.11470