Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCV_bot@mastoxiv.page
2025-09-17 14:04:13

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/4]:
- Evaluating the Robustness of Open-Source Vision-Language Models to Domain Shift in Object Captioning
Federico Tavella, Amber Drinkwater, Angelo Cangelosi

@arXiv_csSE_bot@mastoxiv.page
2025-09-17 09:49:30

Automating Code Generation for Semiconductor Equipment Control from Developer Utterances with LLMs
Youngkyoung Kim, Sanghyeok Park, Misoo Kim, Gangho Yoon, Eunseok Lee, Simon S. Woo
arxiv.org/abs/2509.13055

@harrysentonbury@social.linux.pizza
2025-10-16 20:10:01

this should keep yous out of trouble for a while :luna_moth: 🎹
#liveCoding

@arXiv_csAI_bot@mastoxiv.page
2025-08-15 09:33:42

Reverse Physician-AI Relationship: Full-process Clinical Diagnosis Driven by a Large Language Model
Shicheng Xu, Xin Huang, Zihao Wei, Liang Pang, Huawei Shen, Xueqi Cheng
arxiv.org/abs/2508.10492

@arXiv_csCV_bot@mastoxiv.page
2025-08-18 12:06:15

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/4]:
- IRL-VLA: Training an Vision-Language-Action Policy via Reward World Model
Jiang, Gao, Wang, Sun, Wang, Heng, Sun, Tang, Zhu, Chai, Wang, Gu, Jiang, Sun

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:44:11

Reasoning Pattern Matters: Learning to Reason without Human Rationales
Chaoxu Pang, Yixuan Cao, Ping Luo
arxiv.org/abs/2510.12643 arxiv.org…

@arXiv_qbioNC_bot@mastoxiv.page
2025-08-15 08:21:42

Large Language Models Show Signs of Alignment with Human Neurocognition During Abstract Reasoning
Christopher Pinier, Sonia Acu\~na Vargas, Mariia Steeghs-Turchina, Dora Matzke, Claire E. Stevenson, Michael D. Nunez
arxiv.org/abs/2508.10057

@sascha_wolfer@fediscience.org
2025-10-10 06:06:17

Finally, what Xia & Lindell call a "separation problem" is, in our view, a feature of our approach and not a bug.
If, e.g., all languages in a family are polysynthetic (or none are), that’s not a statistical artefact – it’s the signal. The outcome is well associated with genealogy, showing that family membership captures someth genuinely informative about the process. When the model finds that family explains a large share of the variance, that's not a failure–it's evidence that phylogenetic structure dominates the pattern.
So while Xia & Lindell insist that "autocorrelation due to relationships and distance cannot be captured in family or regional-level analyses", we see that as an empirical question – and we treated it as one.
The real test is whether a mixed model that explicitly represents phylogeny and geography performs worse than their alternative, where the entire shared history of languages and environments is effectively collapsed into a single dimension (an eigenvector).
In other words: we model relationships – Xia & Lindell summarise them into one number per language.

@arXiv_csDB_bot@mastoxiv.page
2025-10-14 08:38:58

Poseidon: A OneGraph Engine
Brad Bebee, \"Umit V. \c{C}ataly\"urek, Olaf Hartig, Ankesh Khandelwal, Simone Rondelli, Michael Schmidt, Lefteris Sidirourgos, Bryan Thompson
arxiv.org/abs/2510.11166

@arXiv_csHC_bot@mastoxiv.page
2025-08-11 09:11:39

Automatic Semantic Alignment of Flow Pattern Representations for Exploration with Large Language Models
Weihan Zhang, Jun Tao
arxiv.org/abs/2508.06300

@arXiv_csCR_bot@mastoxiv.page
2025-09-11 09:23:03

Architecting Resilient LLM Agents: A Guide to Secure Plan-then-Execute Implementations
Ron F. Del Rosario, Klaudia Krawiecka, Christian Schroeder de Witt
arxiv.org/abs/2509.08646

@arXiv_csCV_bot@mastoxiv.page
2025-09-16 17:49:35

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/5]:
- Video-based Sign Language Recognition without Temporal Segmentation
Jie Huang, Wengang Zhou, Qilin Zhang, Houqiang Li, Weiping Li

@arXiv_csRO_bot@mastoxiv.page
2025-10-09 10:06:51

TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
Yi Han, Cheng Chi, Enshen Zhou, Shanyu Rong, Jingkun An, Pengwei Wang, Zhongyuan Wang, Lu Sheng, Shanghang Zhang
arxiv.org/abs/2510.07181

@arXiv_csCL_bot@mastoxiv.page
2025-09-10 10:01:01

Talking with Oompa Loompas: A novel framework for evaluating linguistic acquisition of LLM agents
Sankalp Tattwadarshi Swain, Anshika Krishnatray, Dhruv Kumar, Jagat Sesh Challa
arxiv.org/abs/2509.07389

@arXiv_physicsoptics_bot@mastoxiv.page
2025-08-13 08:26:52

Outsmarting Linear Neural Networks via an Incoherent Light-Driven Optical Extreme Learner with Data Reverberation
Bofeng Liu, Xu Mei, Sadman Shafi, Tunan Xia, Iam-Choon Khoo, Zhiwen Liu, Xingjie Ni
arxiv.org/abs/2508.08428

@frankel@mastodon.top
2025-08-26 16:16:02

Architectural #Patterns, The Pattern Language of #SoftwareArchitecture
github.com/denyspoltorak/metap

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 13:59:38

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- SpaCE-10: A Comprehensive Benchmark for Multimodal Large Language Models in Compositional Spatial...
Ziyang Gong, Wenhao Li, Oliver Ma, Songyuan Li, Jiayi Ji, Xue Yang, Gen Luo, Junchi Yan, Rongrong Ji

@arXiv_csAI_bot@mastoxiv.page
2025-09-12 07:43:39

Understanding Economic Tradeoffs Between Human and AI Agents in Bargaining Games
Crystal Qian, Kehang Zhu, John Horton, Benjamin S. Manning, Vivian Tsai, James Wexler, Nithum Thain
arxiv.org/abs/2509.09071

@arXiv_csSD_bot@mastoxiv.page
2025-08-05 10:23:11

Hidden in the Noise: Unveiling Backdoors in Audio LLMs Alignment through Latent Acoustic Pattern Triggers
Liang Lin, Miao Yu, Kaiwen Luo, Yibo Zhang, Lilan Peng, Dexian Wang, Xuehai Tang, Yuanhe Zhang, Xikang Yang, Zhenhong Zhou, Kun Wang, Yang Liu
arxiv.org/abs/2508.02175

@arXiv_csPL_bot@mastoxiv.page
2025-07-30 07:41:31

One Weird Trick to Untie Landin's Knot
Paulette Koronkevich, William J. Bowman
arxiv.org/abs/2507.21317 arxiv.org/pdf/2507.21317

@arXiv_csLG_bot@mastoxiv.page
2025-08-22 10:18:51

Tutorial on the Probabilistic Unification of Estimation Theory, Machine Learning, and Generative AI
Mohammed Elmusrati
arxiv.org/abs/2508.15719

@arXiv_csNI_bot@mastoxiv.page
2025-07-31 08:34:01

OFCnetLLM: Large Language Model for Network Monitoring and Alertness
Hong-Jun Yoon, Mariam Kiran, Danial Ebling, Joe Breen
arxiv.org/abs/2507.22711

@arXiv_csCL_bot@mastoxiv.page
2025-10-01 11:38:37

Convergence and Divergence of Language Models under Different Random Seeds
Finlay Fehlauer (ETH Zurich), Kyle Mahowald (University of Texas at Austin), Tiago Pimentel (ETH Zurich)
arxiv.org/abs/2509.26643

@arXiv_csAI_bot@mastoxiv.page
2025-10-01 11:33:07

AI Playing Business Games: Benchmarking Large Language Models on Managerial Decision-Making in Dynamic Simulations
Berdymyrat Ovezmyradov
arxiv.org/abs/2509.26331

@arXiv_csCR_bot@mastoxiv.page
2025-09-22 07:34:01

Synergizing Static Analysis with Large Language Models for Vulnerability Discovery and beyond
Vaibhav Agrawal, Kiarash Ahi
arxiv.org/abs/2509.15433

@arXiv_csCV_bot@mastoxiv.page
2025-10-10 16:32:27

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- TIGeR: Tool-Integrated Geometric Reasoning in Vision-Language Models for Robotics
Han, Chi, Zhou, Rong, An, Wang, Wang, Sheng, Zhang

@arXiv_csIR_bot@mastoxiv.page
2025-08-19 09:30:39

Diagnostic-Guided Dynamic Profile Optimization for LLM-based User Simulators in Sequential Recommendation
Hongyang Liu, Zhu Sun, Tianjun Wei, Yan Wang, Jiajie Zhu, Xinghua Qu
arxiv.org/abs/2508.12645

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 07:30:44

SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture
Yeonju Lee, Rui Qi Chen, Joseph Oboamah, Po Nien Su, Wei-zhen Liang, Yeyin Shi, Lu Gan, Yongsheng Chen, Xin Qiao, Jing Li
arxiv.org/abs/2509.18123

@arXiv_csHC_bot@mastoxiv.page
2025-07-25 09:23:12

Multimodal Behavioral Patterns Analysis with Eye-Tracking and LLM-Based Reasoning
Dongyang Guo, Yasmeen Abdrabou, Enkeleda Thaqi, Enkelejda Kasneci
arxiv.org/abs/2507.18252

@arXiv_csCV_bot@mastoxiv.page
2025-10-07 20:41:53

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/7]:
- AutoDrive-QA: A Multiple-Choice Benchmark for Vision-Language Evaluation in Urban Autonomous Driving
Boshra Khalili, Andrew W. Smyth

@arXiv_csAI_bot@mastoxiv.page
2025-08-06 09:37:30

Beyond Surface-Level Detection: Towards Cognitive-Driven Defense Against Jailbreak Attacks via Meta-Operations Reasoning
Rui Pu, Chaozhuo Li, Rui Ha, Litian Zhang, Lirong Qiu, Xi Zhang
arxiv.org/abs/2508.03054

@arXiv_csCV_bot@mastoxiv.page
2025-09-08 10:57:34

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/1]:
- Sample-efficient Integration of New Modalities into Large Language Models
Osman Batur \.Ince, Andr\'e F. T. Martins, Oisin Mac Aodha, Edoardo M. Ponti

@arXiv_csCV_bot@mastoxiv.page
2025-08-08 14:04:29

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- CountingFruit: Language-Guided 3D Fruit Counting with Semantic Gaussian Splatting
Fengze Li, Yangle Liu, Jieming Ma, Hai-Ning Liang, Yaochun Shen, Huangxiang Li, Zhijing Wu

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 15:04:42

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/5]:
- RS-OOD: A Vision-Language Augmented Framework for Out-of-Distribution Detection in Remote Sensing
Chenhao Wang, Yingrui Ji, Yu Meng, Yunjian Zhang, Yao Zhu

@arXiv_csAI_bot@mastoxiv.page
2025-08-27 10:07:43

STARec: An Efficient Agent Framework for Recommender Systems via Autonomous Deliberate Reasoning
Chenghao Wu, Ruiyang Ren, Junjie Zhang, Ruirui Wang, Zhongrui Ma, Qi Ye, Wayne Xin Zhao
arxiv.org/abs/2508.18812

@arXiv_csCV_bot@mastoxiv.page
2025-10-03 15:05:08

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/5]:
- HAMLET: Switch your Vision-Language-Action Model into a History-Aware Policy
Myungkyu Koo, Daewon Choi, Taeyoung Kim, Kyungmin Lee, Changyeon Kim, Younggyo Seo, Jinwoo Shin

@arXiv_csCV_bot@mastoxiv.page
2025-08-05 19:49:36

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[6/10]:
- CountingFruit: Language-Guided 3D Fruit Counting with Semantic Gaussian Splatting
Fengze Li, Yangle Liu, Jieming Ma, Hai-Ning Liang, Yaochun Shen, Huangxiang Li, Zhijing Wu

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 17:36:23

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/3]:
- MultiStream-LLM: Bridging Modalities for Robust Sign Language Translation
Marshall Thomas, Edward Fish, Richard Bowden

@arXiv_csCV_bot@mastoxiv.page
2025-09-03 22:48:15

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/9]:
- DivScene: Towards Open-Vocabulary Object Navigation with Large Vision Language Models in Diverse ...
Wang, Zhang, Fang, Tian, Yang, Ma, Pan, Song, Yu

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:50:29

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/7]:
- LFTR: Learning-Free Token Reduction for Multimodal Large Language Models
Zihui Zhao, Yingxin Li, Yang Li

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 17:51:10

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/7]:
- CoFFT: Chain of Foresight-Focus Thought for Visual Language Models
Zhang, Dong, Zhang, Jia, Dang, Fernando, Liu, Shou

@arXiv_csCV_bot@mastoxiv.page
2025-07-29 18:30:03

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/9]:
- "Principal Components" Enable A New Language of Images
Xin Wen, Bingchen Zhao, Ismail Elezi, Jiankang Deng, Xiaojuan Qi

@arXiv_csCV_bot@mastoxiv.page
2025-09-22 10:35:31

Language-Instructed Reasoning for Group Activity Detection via Multimodal Large Language Model
Jihua Peng, Qianxiong Xu, Yichen Liu, Chenxi Liu, Cheng Long, Rui Zhao, Ziyue Li
arxiv.org/abs/2509.16054

@arXiv_csCV_bot@mastoxiv.page
2025-09-29 13:06:24

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[1/2]:
- SGAligner : Cross-Modal Language-Aided 3D Scene Graph Alignment
Binod Singh, Sayan Deb Sarkar, Iro Armeni

@arXiv_csCV_bot@mastoxiv.page
2025-07-23 14:03:25

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- VRU-Accident: A Vision-Language Benchmark for Video Question Answering and Dense Captioning for A...
Younggun Kim, Ahmed S. Abdelrahman, Mohamed Abdel-Aty

@arXiv_csCV_bot@mastoxiv.page
2025-09-26 11:37:49

Crosslisted article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/2]:
- CaTS-Bench: Can Language Models Describe Numeric Time Series?
Luca Zhou, Pratham Yashwante, Marshall Fisher, Alessio Sampieri, Zihao Zhou, Fabio Galasso, Rose Yu

@arXiv_csCV_bot@mastoxiv.page
2025-08-26 17:42:34

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/6]:
- Beyond Label Semantics: Language-Guided Action Anatomy for Few-shot Action Recognition
Zefeng Qian, Xincheng Yao, Yifei Huang, Chongyang Zhang, Jiangyong Ying, Hong Sun

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 14:33:57

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[4/5]:
- Investigating Traffic Accident Detection Using Multimodal Large Language Models
Ilhan Skender, Kailin Tong, Selim Solmaz, Daniel Watzenig

@arXiv_csCV_bot@mastoxiv.page
2025-09-23 20:09:30

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/8]:
- SafeEraser: Enhancing Safety in Multimodal Large Language Models through Multimodal Machine Unlea...
Chen, Deng, Zheng, Yan, Liu, Wu, Jiang, Liu, Hu

@arXiv_csCV_bot@mastoxiv.page
2025-09-23 20:10:43

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[8/8]:
- The Better You Learn, The Smarter You Prune: Towards Efficient Vision-language-action Models via ...
Jiang, Jiang, Ma, Wen, Li, Zhan, Jia, Liu, Sun, Lang

@arXiv_csCV_bot@mastoxiv.page
2025-07-23 14:03:05

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/5]:
- RadAlign: Advancing Radiology Report Generation with Vision-Language Concept Alignment
Difei Gu, Yunhe Gao, Yang Zhou, Mu Zhou, Dimitris Metaxas

@arXiv_csCV_bot@mastoxiv.page
2025-07-22 18:49:20

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[3/7]:
- Zero-AVSR: Zero-Shot Audio-Visual Speech Recognition with LLMs by Learning Language-Agnostic Spee...
Jeong Hun Yeo, Minsu Kim, Chae Won Kim, Stavros Petridis, Yong Man Ro

@arXiv_csCV_bot@mastoxiv.page
2025-08-19 16:47:09

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[2/7]:
- SLGaussian: Fast Language Gaussian Splatting in Sparse Views
Kangjie Chen, BingQuan Dai, Minghan Qin, Dongbin Zhang, Peihao Li, Yingshuang Zou, Haoqian Wang

@arXiv_csCV_bot@mastoxiv.page
2025-07-22 18:49:41

Replaced article(s) found for cs.CV. arxiv.org/list/cs.CV/new
[5/7]:
- Calibrated and Robust Foundation Models for Vision-Language and Medical Image Tasks Under Distrib...
Behraj Khan, Tahir Qasim Syed, Nouman M. Durrani, Bilal Naseem, Shabir Ahmad, Rizwan Qureshi