Tootfinder

No exact results. Similar results found.

@UP8@mastodon.social
2025-11-06 23:51:59

🎲 TextBandit: Evaluating Probabilistic Reasoning in LLMs Through Language-Only Decision Tasks
#llm

Figure 1: Comparison of cumulative regret trends for four LLMs:
(a) Llama-3.1-8B regret trends: Exhibits high cumulative regret, suggesting poor adaptation to feedback over time. (b) Phi-2 regret trends: Maintains consistently high regret levels, indicating limited learning from outcomes (c) Qwen3-4B regret trends: Displays rapid reduction in regret, reflecting strong and consistent decision making (d) Qwen3-8B regret trends : Consistently high regret across prompts, indicating overthinking an…

TextBandit: Evaluating Probabilistic Reasoning in LLMs Through Language-Only Decision Tasks
Large language models (LLMs) have shown to be increasingly capable of performing reasoning tasks, but their ability to make sequential decisions under uncertainty only using natural language remains underexplored. We introduce a novel benchmark in which LLMs interact with multi-armed bandit environments using purely textual feedback, "you earned a token", without access to numerical cues or explicit probabilities, resulting in the model to infer latent reward structures purely off linguistic cu…

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 11:22:57

JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Shuang Zeng, Dekang Qi, Xinyuan Chang, Feng Xiong, Shichao Xie, Xiaolong Wu, Shiyi Liang, Mu Xu, Xing Wei
https://arxiv.org/abs/2509.22548

JanusVLN: Decoupling Semantics and Spatiality with Dual Implicit Memory for Vision-Language Navigation
Vision-and-Language Navigation requires an embodied agent to navigate through unseen environments, guided by natural language instructions and a continuous video stream. Recent advances in VLN have been driven by the powerful semantic understanding of Multimodal Large Language Models. However, these methods typically rely on explicit semantic memory, such as building textual cognitive maps or storing historical visual frames. This type of method suffers from spatial information loss, computatio…

Tootfinder

Opt-in global Mastodon full text search. Join the index!