Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csDB_bot@mastoxiv.page
2025-10-01 08:12:47

ActorDB: A Unified Database Model Integrating Single-Writer Actors, Incremental View Maintenance, and Zero-Trust Messaging
Jun Kawasaki
arxiv.org/abs/2509.25285

@arXiv_csAI_bot@mastoxiv.page
2025-09-29 09:48:47

ProRe: A Proactive Reward System for GUI Agents via Reasoner-Actor Collaboration
Gaole Dai, Shiqi Jiang, Ting Cao, Yuqing Yang, Yuanchun Li, Rui Tan, Mo Li, Lili Qiu
arxiv.org/abs/2509.21823

@arXiv_quantph_bot@mastoxiv.page
2025-09-18 10:08:31

Quantum Reinforcement Learning-Guided Diffusion Model for Image Synthesis via Hybrid Quantum-Classical Generative Model Architectures
Chi-Sheng Chen, En-Jui Kuo
arxiv.org/abs/2509.14163

@arXiv_csRO_bot@mastoxiv.page
2025-09-26 10:07:01

MPC-based Deep Reinforcement Learning Method for Space Robotic Control with Fuel Sloshing Mitigation
Mahya Ramezani, M. Amin Alandihallaj, Bar{\i}\c{s} Can Yal\c{c}{\i}n, Miguel Angel Olivares Mendez, Holger Voos
arxiv.org/abs/2509.21045

@arXiv_csLG_bot@mastoxiv.page
2025-08-18 09:41:10

Fusing Rewards and Preferences in Reinforcement Learning
Sadegh Khorasani, Saber Salehkaleybar, Negar Kiyavash, Matthias Grossglauser
arxiv.org/abs/2508.11363

@arXiv_eessSP_bot@mastoxiv.page
2025-09-18 08:19:01

Dual Actor DDPG for Airborne STAR-RIS Assisted Communications
Danish Rizvi, David Boyle
arxiv.org/abs/2509.13328 arxiv.org/pdf/2509.13328…

@Mediagazer@mstdn.social
2025-09-11 08:55:55

People CEO Neil Vogel calls Google a "bad actor" for refusing to pay publishers for AI content and says Google Search drives 25%-30% of visits to People sites (Jeff John Roberts/Fortune)
fortune.com/2025/09/11/media-i

@arXiv_statME_bot@mastoxiv.page
2025-09-03 12:32:43

A Hybrid APIM-CFGM Model for Longitudinal Non-Exchangeable Dyads: Demonstrating and Comparing Estimation Approaches Using Multilevel Modeling
Liu Liu
arxiv.org/abs/2509.00993

@arXiv_csRO_bot@mastoxiv.page
2025-10-08 10:13:39

Learning to Crawl: Latent Model-Based Reinforcement Learning for Soft Robotic Adaptive Locomotion
Vaughn Gzenda, Robin Chhabra
arxiv.org/abs/2510.05957

@arXiv_csCL_bot@mastoxiv.page
2025-09-12 10:01:39

CDE: Curiosity-Driven Exploration for Efficient Reinforcement Learning in Large Language Models
Runpeng Dai, Linfeng Song, Haolin Liu, Zhenwen Liang, Dian Yu, Haitao Mi, Zhaopeng Tu, Rui Liu, Tong Zheng, Hongtu Zhu, Dong Yu
arxiv.org/abs/2509.09675

@arXiv_physicssocph_bot@mastoxiv.page
2025-08-06 08:16:30

Mapping Innovation Networks: A Network-Based Approach to Actor Heterogeneity in National Innovation Systems
Dawoon Jeong, Taewon Kang, Saerom Si, Sangnam Lee, Wonsub Eum
arxiv.org/abs/2508.03498

@arXiv_csLG_bot@mastoxiv.page
2025-10-15 10:46:41

Laminar: A Scalable Asynchronous RL Post-Training Framework
Guangming Sheng, Yuxuan Tong, Borui Wan, Wang Zhang, Chaobo Jia, Xibin Wu, Yuqi Wu, Xiang Li, Chi Zhang, Yanghua Peng, Haibin Lin, Xin Liu, Chuan Wu
arxiv.org/abs/2510.12633

@arXiv_csGR_bot@mastoxiv.page
2025-10-07 10:04:22

Pulp Motion: Framing-aware multimodal camera and human motion generation
Robin Courant, Xi Wang, David Loiseaux, Marc Christie, Vicky Kalogeiton
arxiv.org/abs/2510.05097