Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCL_bot@mastoxiv.page
2025-08-13 10:19:12

OdysseyBench: Evaluating LLM Agents on Long-Horizon Complex Office Application Workflows
Weixuan Wang, Dongge Han, Daniel Madrigal Diaz, Jin Xu, Victor R\"uhle, Saravan Rajmohan
arxiv.org/abs/2508.09124

@arXiv_csRO_bot@mastoxiv.page
2025-08-12 12:01:03

ODYSSEY: Open-World Quadrupeds Exploration and Manipulation for Long-Horizon Tasks
Kaijun Wang, Liqin Lu, Mingyu Liu, Jianuo Jiang, Zeju Li, Bolin Zhang, Wancai Zheng, Xinyi Yu, Hao Chen, Chunhua Shen
arxiv.org/abs/2508.08240

@arXiv_csAI_bot@mastoxiv.page
2025-08-11 07:59:59

Mediator-Guided Multi-Agent Collaboration among Open-Source Models for Medical Decision-Making
Kaitao Chen, Mianxin Liu, Daoming Zong, Chaoyue Ding, Shaohao Rui, Yankai Jiang, Mu Zhou, Xiaosong Wang
arxiv.org/abs/2508.05996

@arXiv_astrophHE_bot@mastoxiv.page
2025-10-09 09:33:01

Testing new-physics scenarios with the combined LHAASO and Carpet-3 fluence spectrum of GRB 221009A: axion-like particles and Lorentz-invariance violation
P. S. Satunin, S. V. Troitsky
arxiv.org/abs/2510.07234

@arXiv_csCR_bot@mastoxiv.page
2025-08-11 09:55:09

ScamAgents: How AI Agents Can Simulate Human-Level Scam Calls
Sanket Badhe
arxiv.org/abs/2508.06457 arxiv.org/pdf/2508.06457

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 11:20:19

SciVideoBench: Benchmarking Scientific Video Reasoning in Large Multimodal Models
Andong Deng, Taojiannan Yang, Shoubin Yu, Lincoln Spencer, Mohit Bansal, Chen Chen, Serena Yeung-Levy, Xiaohan Wang
arxiv.org/abs/2510.08559

@arXiv_csCL_bot@mastoxiv.page
2025-10-09 10:21:11

Scaling LLM Multi-turn RL with End-to-end Summarization-based Context Management
Miao Lu, Weiwei Sun, Weihua Du, Zhan Ling, Xuesong Yao, Kang Liu, Jiecao Chen
arxiv.org/abs/2510.06727

@samvarma@fosstodon.org
2025-09-01 03:26:36

Just finished "Sinners".
Have a feeling it's going to stay with me for a while. Anyone who says "cinema" is dying is digging in the wrong place.
themoviedb.org/movie/1233413-s

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:31:19

R-Horizon: How Far Can Your Large Reasoning Model Really Go in Breadth and Depth?
Yi Lu, Jianing Wang, Linsen Guo, Wei He, Hongyin Tang, Tao Gui, Xuanjing Huang, Xuezhi Cao, Wei Wang, Xunliang Cai
arxiv.org/abs/2510.08189

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 10:52:19

DACIP-RC: Domain Adaptive Continual Instruction Pre-Training via Reading Comprehension on Business Conversations
Elena Khasanova, Harsh Saini, Md Tahmid Rahman Laskar, Xue-Yong Fu, Cheng Chen, Shashi Bhushan TN
arxiv.org/abs/2510.08152