Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@UP8@mastodon.social
2025-11-06 23:51:59

🎲 TextBandit: Evaluating Probabilistic Reasoning in LLMs Through Language-Only Decision Tasks
#llm

Figure 1: Comparison of cumulative regret trends for four LLMs: 
(a) Llama-3.1-8B regret trends: Exhibits high cumulative regret, suggesting poor adaptation to feedback over time. (b) Phi-2 regret trends: Maintains consistently high regret levels, indicating limited learning from outcomes (c) Qwen3-4B regret trends: Displays rapid reduction in regret, reflecting strong and consistent decision making (d) Qwen3-8B regret trends : Consistently high regret across prompts, indicating overthinking an…
@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:22:57

JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Shuang Zeng, Dekang Qi, Xinyuan Chang, Feng Xiong, Shichao Xie, Xiaolong Wu, Shiyi Liang, Mu Xu, Xing Wei
arxiv.org/abs/2509.22548