Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csLG_bot@mastoxiv.page
2025-07-04 10:13:51

On Efficient Bayesian Exploration in Model-Based Reinforcement Learning
Alberto Caron, Chris Hicks, Vasilios Mavroudis
arxiv.org/abs/2507.02639

@arXiv_csAI_bot@mastoxiv.page
2025-09-05 09:58:41

CoT-Space: A Theoretical Framework for Internal Slow-Thinking via Reinforcement Learning
Zeyu Gan, Hao Yi, Yong Liu
arxiv.org/abs/2509.04027

@aligyie@digitalcourage.social
2025-07-02 12:48:41

"While crop seed vaults are common around the world, nurseries for wild and native plants are rare, and many plant species quietly become extinct. This marks #Gurukula out as a Noah’s ark for endangered plant species."

"The primary vegetation around Gurukula Botanical Sanctuary is wet evergreen, medium elevation rainforest."
"Laly Joseph, the head of plant conservation at Gurukula Botanical Sanctuary, has spent most of her life learning about and caring for plants."
"An explosion of Alsophila spinulosa tree ferns, also known as flying spider-monkey tree fern. These are native species found in tropical and subtropical forests across Asia. An abundance of ferns can mean a healthy, high-quality habitat with minimal human disturbance."
@arXiv_csLG_bot@mastoxiv.page
2025-06-05 10:59:18

This arxiv.org/abs/2505.24298 has been replaced.
initial toot: mastoxiv.page/@arXiv_csLG_…

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 08:42:02

BED-LLM: Intelligent Information Gathering with LLMs and Bayesian Experimental Design
Deepro Choudhury, Sinead Williamson, Adam Goli\'nski, Ning Miao, Freddie Bickford Smith, Michael Kirchhof, Yizhe Zhang, Tom Rainforth
arxiv.org/abs/2508.21184

@arXiv_csRO_bot@mastoxiv.page
2025-07-02 08:30:20

Control-Optimized Deep Reinforcement Learning for Artificially Intelligent Autonomous Systems
Oren Fivel, Matan Rudman, Kobi Cohen
arxiv.org/abs/2507.00268

@arXiv_csAI_bot@mastoxiv.page
2025-09-03 08:59:03

Know When to Explore: Difficulty-Aware Certainty as a Guide for LLM Reinforcement Learning
Ang Li, Zhihang Yuan, Yang Zhang, Shouda Liu, Yisen Wang
arxiv.org/abs/2509.00125

@arXiv_csLG_bot@mastoxiv.page
2025-09-01 09:52:42

Priors Matter: Addressing Misspecification in Bayesian Deep Q-Learning
Pascal R. van der Vaart, Neil Yorke-Smith, Matthijs T. J. Spaan
arxiv.org/abs/2508.21488

@arXiv_csLG_bot@mastoxiv.page
2025-07-31 09:18:31

Spatial-Temporal Reinforcement Learning for Network Routing with Non-Markovian Traffic
Molly Wang
arxiv.org/abs/2507.22174 arxiv.org/pdf/25…

@arXiv_csLG_bot@mastoxiv.page
2025-09-01 09:58:32

Neural Network Acceleration on MPSoC board: Integrating SLAC's SNL, Rogue Software and Auto-SNL
Hamza Ezzaoui Rahali, Abhilasha Dave, Larry Ruckman, Mohammad Mehdi Rahimifar, Audrey C. Therrien, James J. Russel, Ryan T. Herbst
arxiv.org/abs/2508.21739