
2025-09-25 10:51:12
Video models are zero-shot learners and reasoners
Thadd\"aus Wiedemer, Yuxuan Li, Paul Vicol, Shixiang Shane Gu, Nick Matarese, Kevin Swersky, Been Kim, Priyank Jaini, Robert Geirhos
https://arxiv.org/abs/2509.20328
Video models are zero-shot learners and reasoners
Thadd\"aus Wiedemer, Yuxuan Li, Paul Vicol, Shixiang Shane Gu, Nick Matarese, Kevin Swersky, Been Kim, Priyank Jaini, Robert Geirhos
https://arxiv.org/abs/2509.20328
Advances in Large Language Models for Medicine
Zhiyu Kan, Wensheng Gan, Zhenlian Qi, Philip S. Yu
https://arxiv.org/abs/2509.18690 https://arxiv.org/pdf/25…
Political Ideology Shifts in Large Language Models
Pietro Bernardelle, Stefano Civelli, Leon Fr\"ohling, Riccardo Lunardi, Kevin Roitero, Gianluca Demartini
https://arxiv.org/abs/2508.16013
VT-LVLM-AR: A Video-Temporal Large Vision-Language Model Adapter for Fine-Grained Action Recognition in Long-Term Videos
Kaining Li, Shuwei He, Zihan Xu
https://arxiv.org/abs/2508.15903
RECALLED: An Unbounded Resource Consumption Attack on Large Vision-Language Models
Haoran Gao, Yuanhe Zhang, Zhenhong Zhou, Lei Jiang, Fanyu Meng, Yujia Xiao, Kun Wang, Yang Liu, Junlan Feng
https://arxiv.org/abs/2507.18053
OpenNav: Open-World Navigation with Multimodal Large Language Models
Mingfeng Yuan, Letian Wang, Steven L. Waslander
https://arxiv.org/abs/2507.18033 https://
DIFFA: Large Language Diffusion Models Can Listen and Understand
Jiaming Zhou, Hongjie Chen, Shiwan Zhao, Jian Kang, Jie Li, Enzhi Wang, Yujie Guo, Haoqin Sun, Hui Wang, Aobo Kong, Yong Qin, Xuelong Li
https://arxiv.org/abs/2507.18452
Reverse Engineering User Stories from Code using Large Language Models
Mohamed Ouf, Haoyu Li, Michael Zhang, Mariam Guizani
https://arxiv.org/abs/2509.19587 https://
Zhu on AI/AGI: 'A sign of true intelligence... is the ability to reason towards a goal with minimal inputs ... a “small data, big task” approach, compared with the “big data, small task” approach employed by large language models like ChatGPT. AGI... is characterised by qualities such as resourcefulness in novel situations, social and physical intuition, and an understanding of cause and effect. Large language models... will never achieve this.'
Affective Computing and Emotional Data: Challenges and Implications in Privacy Regulations, The AI Act, and Ethics in Large Language Models
Nicola Fabiano
https://arxiv.org/abs/2509.20153
A Versatile Pathology Co-pilot via Reasoning Enhanced Multimodal Large Language Model
Zhe Xu, Ziyi Liu, Junlin Hou, Jiabo Ma, Cheng Jin, Yihui Wang, Zhixuan Chen, Zhengyu Zhang, Zhengrui Guo, Fengtao Zhou, Yingxue Xu, Xi Wang, Ronald Cheong Kin Chan, Li Liang, Hao Chen
https://arxiv.org/abs/2507.17303
A Query-Aware Multi-Path Knowledge Graph Fusion Approach for Enhancing Retrieval-Augmented Generation in Large Language Models
Qikai Wei, Huansheng Ning, Chunlong Han, Jianguo Ding
https://arxiv.org/abs/2507.16826
Leveraging Large Language Models to Detect Missed Peephole Optimizations
Zhenyang Xu, Hongxu Xu, Yongqiang Tian, Xintong Zhou, Chengnian Sun
https://arxiv.org/abs/2508.16125 htt…
Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems
Imran Latif, Muhammad Ali Shafique, Hayat Ullah, Alex C. Newkirk, Xi Yu, Arslan Munir
https://arxiv.org/abs/2507.16781
Hardwired-Neurons Language Processing Units as General-Purpose Cognitive Substrates
Yang Liu, Yi Chen, Yongwei Zhao, Yifan Hao, Zifu Zheng, Weihao Kong, Zhangmai Li, Dongchen Jiang, Ruiyang Xia, Zhihong Ma, Zisheng Liu, Zhaoyong Wan, Yunqi Lu, Ximing Liu, Hongrui Guo, Zhihao Yang, Zhe Wang, Tianrui Ma, Mo Zou, Rui Zhang, Ling Li, Xing Hu, Zidong Du, Zhiwei Xu, Qi Guo, Tianshi Chen, Yunji Chen
Congestion Control System Optimization with Large Language Models
Zhiyuan He, Aashish Gottipati, Lili Qiu, Yuqing Yang, Francis Y. Yan
https://arxiv.org/abs/2508.16074 https://
Benchmarking Gaslighting Attacks Against Speech Large Language Models
Jinyang Wu, Bin Zhu, Xiandong Zou, Qiquan Zhang, Xu Fang, Pan Zhou
https://arxiv.org/abs/2509.19858 https:/…
Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning
Situo Zhang, Hanqi Li, Lu Chen, Zihan Zhao, Xuanze Lin, Zichen Zhu, Bo Chen, Xin Chen, Kai Yu
https://arxiv.org/abs/2507.17448
Decoding Consumer Preferences Using Attention-Based Language Models
Joshua Foster, Fredrik Odegaard
https://arxiv.org/abs/2507.17564 https://arxiv.org/pdf/…
EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models
Botai Yuan, Yutian Zhou, Yingjie Wang, Fushuo Huo, Yongcheng Jing, Li Shen, Ying Wei, Zhiqi Shen, Ziwei Liu, Tianwei Zhang, Jie Yang, Dacheng Tao
https://arxiv.org/abs/2509.20146
Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning
Xiao Han, Zimo Zhao, Wanyu Wang, Maolin Wang, Zitao Liu, Yi Chang, Xiangyu Zhao
https://arxiv.org/abs/2509.18942
On the Evolution of Federated Post-Training Large Language Models: A Model Accessibility View
Tao Guo, Junxiao Wang, Fushuo Huo, Laizhong Cui, Song Guo, Jie Gui, Dacheng Tao
https://arxiv.org/abs/2508.16261
Building and Measuring Trust between Large Language Models
Maarten Buyl, Yousra Fettach, Guillaume Bied, Tijl De Bie
https://arxiv.org/abs/2508.15858 https://
Automated Code Review Using Large Language Models with Symbolic Reasoning
Busra Icoz, Goksel Biricik
https://arxiv.org/abs/2507.18476 https://arxiv.org/pdf…
Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models
Guangyu Yang, Jinghong Chen, Jingbiao Mei, Weizhe Lin, Bill Byrne
https://arxiv.org/abs/2508.16406 …
TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks
\.Irem Demirta\c{s}, Burak Payzun, Se\c{c}il Arslan
https://arxiv.org/abs/2508.16243
G\"odel Test: Can Large Language Models Solve Easy Conjectures?
Moran Feldman, Amin Karbasi
https://arxiv.org/abs/2509.18383 https://arxiv.org/pdf/250…
A Comprehensive Evaluation on Quantization Techniques for Large Language Models
Yutong Liu, Cairong Zhao, Guosheng Hu
https://arxiv.org/abs/2507.17417 http…
Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards
Honghao Chen, Xingzhou Lou, Xiaokun Feng, Kaiqi Huang, Xinlong Wang
https://arxiv.org/abs/2509.19003
V-GameGym: Visual Game Generation for Code Large Language Models
Wei Zhang, Jack Yang, Renshuai Tao, Lingzheng Chai, Shawn Guo, Jiajun Wu, Xiaoming Chen, Ganqu Cui, Ning Ding, Xander Xu, Hu Wei, Bowen Zhou
https://arxiv.org/abs/2509.20136
DiagR1: A Vision-Language Model Trained via Reinforcement Learning for Digestive Pathology Diagnosis
Minxi Ouyang, Lianghui Zhu, Yaqing Bao, Qiang Huang, Jingli Ouyang, Tian Guan, Xitong Ling, Jiawen Li, Song Duan, Wenbin Dai, Li Zheng, Xuemei Zhang, Yonghong He
https://arxiv.org/abs/2507.18433
When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models
Yingming Zheng, Hanqi Li, Kai Yu, Lu Chen
https://arxiv.org/abs/2509.18762
Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models
Andrii Balashov
https://arxiv.org/abs/2507.17107 https://
A Cost-Benefit Analysis of On-Premise Large Language Model Deployment: Breaking Even with Commercial LLM Services
Guanzhong Pan, Haibo Wang
https://arxiv.org/abs/2509.18101 http…
Lang2Morph: Language-Driven Morphological Design of Robotic Hands
Yanyuan Qiao, Kieran Gilday, Yutong Xie, Josie Hughes
https://arxiv.org/abs/2509.18937 https://
LLM4MEA: Data-free Model Extraction Attacks on Sequential Recommenders via Large Language Models
Shilong Zhao, Fei Sun, Kaike Zhang, Shaoling Jing, Du Su, Zhichao Shi, Zhiyi Yin, Huawei Shen, Xueqi Cheng
https://arxiv.org/abs/2507.16969
MizanQA: Benchmarking Large Language Models on Moroccan Legal Question Answering
Adil Bahaj, Mounir Ghogho
https://arxiv.org/abs/2508.16357 https://arxiv.o…
Assertion Messages with Large Language Models (LLMs) for Code
Ahmed Aljohani, Anamul Haque Mollah, Hyunsook Do
https://arxiv.org/abs/2509.19673 https://arx…
Scout: Leveraging Large Language Models for Rapid Digital Evidence Discovery
Shariq Murtuza
https://arxiv.org/abs/2507.18478 https://arxiv.org/pdf/2507.184…
Structuring GUI Elements through Vision Language Models: Towards Action Space Generation
Yi Xu, Yesheng Zhang, jiajia Liu, Jingdong Chen
https://arxiv.org/abs/2508.16271 https:/…
SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture
Yeonju Lee, Rui Qi Chen, Joseph Oboamah, Po Nien Su, Wei-zhen Liang, Yeyin Shi, Lu Gan, Yongsheng Chen, Xin Qiao, Jing Li
https://arxiv.org/abs/2509.18123
SiLQ: Simple Large Language Model Quantization-Aware Training
Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha
https://arxiv.org/abs/2507.16933
Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Chaojun Nie, Jun Zhou, Guanxiang Wang, Shisong Wud, Zichen Wang
https://arxiv.org/abs/2509.20162
How Small is Enough? Empirical Evidence of Quantized Small Language Models for Automated Program Repair
Kazuki Kusama, Honglin Shu, Masanari Kondo, Yasutaka Kamei
https://arxiv.org/abs/2508.16499
LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models
Doohee You, Andy Parisi, Zach Vander Velden, Lara Dantas Inojosa
https://arxiv.org/abs/2508.16478
Temporal-Aware User Behaviour Simulation with Large Language Models for Recommender Systems
Xinye Wanyan, Danula Hettiachchi, Chenglong Ma, Ziqi Xu, Jeffrey Chan
https://arxiv.org/abs/2509.16895
ThinkFake: Reasoning in Multimodal Large Language Models for AI-Generated Image Detection
Tai-Ming Huang, Wei-Tung Lin, Kai-Lung Hua, Wen-Huang Cheng, Junichi Yamagishi, Jun-Cheng Chen
https://arxiv.org/abs/2509.19841
Enhancing Requirement Traceability through Data Augmentation Using Large Language Models
Jianzhang Zhang, Jialong Zhou, Nan Niu, Chuang Liu
https://arxiv.org/abs/2509.20149 http…
Evaluating the Safety and Skill Reasoning of Large Reasoning Models Under Compute Constraints
Adarsha Balaji, Le Chen, Rajeev Thakur, Franck Cappello, Sandeep Madireddy
https://arxiv.org/abs/2509.18382
AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field
Chen Liang, Zhaoqi Huang, Haofen Wang, Fu Chai, Chunying Yu, Huanhuan Wei, Zhengjie Liu, Yanpeng Li, Hongjun Wang, Ruifeng Luo, Xianzhong Zhao
https://arxiv.org/abs/2509.18776
Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
Sabri Boughorbel, Fahim Dalvi, Nadir Durrani, Majd Hawasly
https://arxiv.org/abs/2509.18792
Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning
Guoxin Wang, Jun Zhao, Xinyi Liu, Yanbo Liu, Xuyang Cao, Chao Li, Zhuoyun Liu, Qintian Sun, Fangru Zhou, Haoqiang Xing, Zhenhong Yang
https://arxiv.org/abs/2509.19090
CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage
Na Li, Yansong Gao, Hongsheng Hu, Boyu Kuang, Anmin Fu
https://arxiv.org/abs/2507.16872 https://
Seed&Steer: Guiding Large Language Models with Compilable Prefix and Branch Signals for Unit Test Generation
Shuaiyu Zhou, Zhengran Zeng, Xiaoling Zhou, Rui Xie, Shikun Zhang, Wei Ye
https://arxiv.org/abs/2507.17271
Retrieval Enhanced Feedback via In-context Neural Error-book
Jongyeop Hyun, Bumsoo Kim
https://arxiv.org/abs/2508.16313 https://arxiv.org/pdf/2508.16313
Memory in Large Language Models: Mechanisms, Evaluation and Evolution
Dianxing Zhang, Wendong Li, Kani Song, Jiaye Lu, Gang Li, Liuchun Yang, Sheng Li
https://arxiv.org/abs/2509.18868
Steering Multimodal Large Language Models Decoding for Context-Aware Safety
Zheyuan Liu, Zhangchen Xu, Guangyao Dou, Xiangchi Yuan, Zhaoxuan Tan, Radha Poovendran, Meng Jiang
https://arxiv.org/abs/2509.19212
Understanding the Supply Chain and Risks of Large Language Model Applications
Yujie Ma, Lili Quan, Xiaofei Xie, Qiang Hu, Jiongchi Yu, Yao Zhang, Sen Chen
https://arxiv.org/abs/2507.18105
CEQuest: Benchmarking Large Language Models for Construction Estimation
Yanzhao Wu, Lufan Wang, Rui Liu
https://arxiv.org/abs/2508.16081 https://arxiv.org/…
SilentStriker:Toward Stealthy Bit-Flip Attacks on Large Language Models
Haotian Xu, Qingsong Peng, Jie Shi, Huadi Zheng, Yu Li, Cheng Zhuo
https://arxiv.org/abs/2509.17371 https…
From latent factors to language: a user study on LLM-generated explanations for an inherently interpretable matrix-based recommender system
Maxime Manderlier, Fabian Lecron, Olivier Vu Thanh, Nicolas Gillis
https://arxiv.org/abs/2509.18980
Ethical Considerations of Large Language Models in Game Playing
Qingquan Zhang, Yuchen Li, Bo Yuan, Julian Togelius, Georgios N. Yannakakis, Jialin Liu
https://arxiv.org/abs/2508.16065
Beyond Language Barriers: Multi-Agent Coordination for Multi-Language Code Generation
Micheline B\'en\'edicte Moumoula, Serge Lionel Nikiema, Alb\'erick Euraste Djire, Abdoul Kader Kabore, Jacques Klein, Tegawend\'e F. Bissyande
https://arxiv.org/abs/2509.19918
iFinder: Structured Zero-Shot Vision-Based LLM Grounding for Dash-Cam Video Reasoning
Manyi Yao, Bingbing Zhuang, Sparsh Garg, Amit Roy-Chowdhury, Christian Shelton, Manmohan Chandraker, Abhishek Aich
https://arxiv.org/abs/2509.19552
The Moral Gap of Large Language Models
Maciej Skorski, Alina Landowska
https://arxiv.org/abs/2507.18523 https://arxiv.org/pdf/2507.18523
Understanding Post-Training Structural Changes in Large Language Models
Xinyu He, Xianghui Cao
https://arxiv.org/abs/2509.17866 https://arxiv.org/pdf/2509.…
LLM-GUARD: Large Language Model-Based Detection and Repair of Bugs and Security Vulnerabilities in C and Python
Akshay Mhatre, Noujoud Nader, Patrick Diehl, Deepti Gupta
https://arxiv.org/abs/2508.16419
Global-Recent Semantic Reasoning on Dynamic Text-Attributed Graphs with Large Language Models
Yunan Wang, Jianxin Li, Ziwei Zhang
https://arxiv.org/abs/2509.18742 https://
BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems
Malsha Ashani Mahawatta Dona, Beatriz Cabrero-Daniel, Yinan Yu, Christian Berger
https://arxiv.org/abs/2507.17722
BadReasoner: Planting Tunable Overthinking Backdoors into Large Reasoning Models for Fun or Profit
Biao Yi, Zekun Fei, Jianing Geng, Tong Li, Lihai Nie, Zheli Liu, Yiming Li
https://arxiv.org/abs/2507.18305
DeepThink3D: Enhancing Large Language Models with Programmatic Reasoning in Complex 3D Situated Reasoning Tasks
Jiayi Song, Rui Wan, Lipeng Ma, Weidong Yang, Qingyuan Zhou, Yixuan Li, Ben Fei
https://arxiv.org/abs/2508.15548
NoCode-bench: A Benchmark for Evaluating Natural Language-Driven Feature Addition
Le Deng, Zhonghao Jiang, Jialun Cao, Michael Pradel, Zhongxin Liu
https://arxiv.org/abs/2507.18130
Boardwalk: Towards a Framework for Creating Board Games with LLMs
\'Alvaro Guglielmin Becker, Gabriel Bauer de Oliveira, Lana Bertoldo Rossato, Anderson Rocha Tavares
https://arxiv.org/abs/2508.16447
Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates
Hy Dang, Tianyi Liu, Zhuofeng Wu, Jingfeng Yang, Haoming Jiang, Tao Yang, Pei Chen, Zhengyang Wang, Helen Wang, Huasheng Li, Bing Yin, Meng Jiang
https://arxiv.org/abs/2509.18076
Multi-Hierarchical Feature Detection for Large Language Model Generated Text
Luyan Zhang, Xinyu Xie
https://arxiv.org/abs/2509.18862 https://arxiv.org/pdf/…
Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models
Renjie Pi, Kehao Miao, Li Peihang, Runtao Liu, Jiahui Gao, Jipeng Zhang, Xiaofang Zhou
https://arxiv.org/abs/2509.16149
Towards Recommending Usability Improvements with Multimodal Large Language Models
Sebastian Lubos, Alexander Felfernig, Gerhard Leitner, Julian Schwazer
https://arxiv.org/abs/2508.16165
EngiBench: A Benchmark for Evaluating Large Language Models on Engineering Problem Solving
Xiyuan Zhou, Xinlei Wang, Yirui He, Yang Wu, Ruixi Zou, Yuheng Cheng, Yulu Xie, Wenxuan Liu, Huan Zhao, Yan Xu, Jinjin Gu, Junhua Zhao
https://arxiv.org/abs/2509.17677
RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution
Haodong He, Yancheng Bai, Rui Lan, Xu Duan, Lei Sun, Xiangxiang Chu, Gui-Song Xia
https://arxiv.org/abs/2508.16158
LOCOFY Large Design Models -- Design to code conversion solution
Sohaib Muhammad, Ashwati Vipin, Karan Shetti, Honey Mittal
https://arxiv.org/abs/2507.16208
Extractive Fact Decomposition for Interpretable Natural Language Inference in one Forward Pass
Nicholas Popovi\v{c}, Michael F\"arber
https://arxiv.org/abs/2509.18901 https…
Distilled Large Language Model in Confidential Computing Environment for System-on-Chip Design
Dong Ben, Hui Feng, Qian Wang
https://arxiv.org/abs/2507.16226
From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training
Tianqiao Liu, Xueyi Li, Hao Wang, Haoxuan Li, Zhichao Chen, Weiqi Luo, Zitao Liu
https://arxiv.org/abs/2509.20072
MedOmni-45{\deg}: A Safety-Performance Benchmark for Reasoning-Oriented LLMs in Medicine
Kaiyuan Ji, Yijin Guo, Zicheng Zhang, Xiangyang Zhu, Yuan Tian, Ning Liu, Guangtao Zhai
https://arxiv.org/abs/2508.16213
Not All Features Deserve Attention: Graph-Guided Dependency Learning for Tabular Data Generation with Language Models
Zheyu Zhang, Shuo Yang, Bardh Prenkaj, Gjergji Kasneci
https://arxiv.org/abs/2507.18504
TERAG: Token-Efficient Graph-Based Retrieval-Augmented Generation
Qiao Xiao, Hong Ting Tsang, Jiaxin Bai
https://arxiv.org/abs/2509.18667 https://arxiv.org…
CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition
Sina J. Semnani, Han Zhang, Xinyan He, Merve Tekg\"urler, Monica S. Lam
https://arxiv.org/abs/2509.19768
Exploring Large Language Models for Analyzing and Improving Method Names in Scientific Code
Gunnar Larsen, Carol Wong, Anthony Peruma
https://arxiv.org/abs/2507.16439
Restoring Rhythm: Punctuation Restoration Using Transformer Models for Bangla, a Low-Resource Language
Md Obyedullahil Mamun, Md Adyelullahil Mamun, Arif Ahmad, Md. Imran Hossain Emu
https://arxiv.org/abs/2507.18448
Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Yakup Abrek Er, Ilker Kesen, G\"ozde G\"ul \c{S}ahin, Aykut Erdem
https://arxiv.org/abs/2508.16431
D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models
Satyapriya Krishna, Andy Zou, Rahul Gupta, Eliot Krzysztof Jones, Nick Winter, Dan Hendrycks, J. Zico Kolter, Matt Fredrikson, Spyros Matsoukas
https://arxiv.org/abs/2509.17938
HAMSA: Hijacking Aligned Compact Models via Stealthy Automation
Alexey Krylov, Iskander Vagizov, Dmitrii Korzh, Maryam Douiba, Azidine Guezzaz, Vladimir Kokh, Sergey D. Erokhin, Elena V. Tutubalina, Oleg Y. Rogov
https://arxiv.org/abs/2508.16484
EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation
Sen Yang, Yu Bao, Yu Lu, Jiajun Chen, Shujian Huang, Shanbo Cheng
https://arxiv.org/abs/2509.19770 htt…
AraTable: Benchmarking LLMs' Reasoning and Understanding of Arabic Tabular Data
Rana Alshaikh, Israa Alghanmi, Shelan Jeawak
https://arxiv.org/abs/2507.18442 https://…
Variation in Verification: Understanding Verification Dynamics in Large Language Models
Yefan Zhou, Austin Xu, Yilun Zhou, Janvijay Singh, Jiang Gui, Shafiq Joty
https://arxiv.org/abs/2509.17995
Text Takes Over: A Study of Modality Bias in Multimodal Intent Detection
Ankan Mullick, Saransh Sharma, Abhik Jana, Pawan Goyal
https://arxiv.org/abs/2508.16122 https://
DeepMEL: A Multi-Agent Collaboration Framework for Multimodal Entity Linking
Fang Wang, Tianwei Yan, Zonghao Yang, Minghao Hu, Jun Zhang, Zhunchen Luo, Xiaoying Bai
https://arxiv.org/abs/2508.15876
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[4/5]:
- Macroeconomic Forecasting with Large Language Models
Andrea Carriero, Davide Pettenuzzo, Shubhranshu Shekhar
Crosslisted article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/2]:
- Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencode...
Jiaqi Weng, Han Zheng, Hanyu Zhang, Qinqin He, Jialing Tao, Hui Xue, Zhixuan Chu, Xiting Wang
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[2/5]:
- Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models
Banca Calvo Figueras, Rodrigo Agerri