Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLG_bot@mastoxiv.page
2025-09-25 10:51:12

Video models are zero-shot learners and reasoners
Thadd\"aus Wiedemer, Yuxuan Li, Paul Vicol, Shixiang Shane Gu, Nick Matarese, Kevin Swersky, Been Kim, Priyank Jaini, Robert Geirhos
arxiv.org/abs/2509.20328

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 10:13:24

Advances in Large Language Models for Medicine
Zhiyu Kan, Wensheng Gan, Zhenlian Qi, Philip S. Yu
arxiv.org/abs/2509.18690 arxiv.org/pdf/25…

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:01:20

Political Ideology Shifts in Large Language Models
Pietro Bernardelle, Stefano Civelli, Leon Fr\"ohling, Riccardo Lunardi, Kevin Roitero, Gianluca Demartini
arxiv.org/abs/2508.16013

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 07:33:40

VT-LVLM-AR: A Video-Temporal Large Vision-Language Model Adapter for Fine-Grained Action Recognition in Long-Term Videos
Kaining Li, Shuwei He, Zihan Xu
arxiv.org/abs/2508.15903

@arXiv_csCR_bot@mastoxiv.page
2025-07-25 08:41:32

RECALLED: An Unbounded Resource Consumption Attack on Large Vision-Language Models
Haoran Gao, Yuanhe Zhang, Zhenhong Zhou, Lei Jiang, Fanyu Meng, Yujia Xiao, Kun Wang, Yang Liu, Junlan Feng
arxiv.org/abs/2507.18053

@arXiv_csRO_bot@mastoxiv.page
2025-07-25 08:43:42

OpenNav: Open-World Navigation with Multimodal Large Language Models
Mingfeng Yuan, Letian Wang, Steven L. Waslander
arxiv.org/abs/2507.18033

@arXiv_csSD_bot@mastoxiv.page
2025-07-25 08:50:42

DIFFA: Large Language Diffusion Models Can Listen and Understand
Jiaming Zhou, Hongjie Chen, Shiwan Zhao, Jian Kang, Jie Li, Enzhi Wang, Yujie Guo, Haoqin Sun, Hui Wang, Aobo Kong, Yong Qin, Xuelong Li
arxiv.org/abs/2507.18452

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 08:02:02

Reverse Engineering User Stories from Code using Large Language Models
Mohamed Ouf, Haoyu Li, Michael Zhang, Mariam Guizani
arxiv.org/abs/2509.19587

@mia@hcommons.social
2025-09-25 11:35:17

Zhu on AI/AGI: 'A sign of true intelligence... is the ability to reason towards a goal with minimal inputs ... a “small data, big task” approach, compared with the “big data, small task” approach employed by large language models like ChatGPT. AGI... is characterised by qualities such as resourcefulness in novel situations, social and physical intuition, and an understanding of cause and effect. Large language models... will never achieve this.'

@arXiv_csCY_bot@mastoxiv.page
2025-09-25 08:27:52

Affective Computing and Emotional Data: Challenges and Implications in Privacy Regulations, The AI Act, and Ethics in Large Language Models
Nicola Fabiano
arxiv.org/abs/2509.20153

@arXiv_eessIV_bot@mastoxiv.page
2025-07-24 09:04:50

A Versatile Pathology Co-pilot via Reasoning Enhanced Multimodal Large Language Model
Zhe Xu, Ziyi Liu, Junlin Hou, Jiabo Ma, Cheng Jin, Yihui Wang, Zhixuan Chen, Zhengyu Zhang, Zhengrui Guo, Fengtao Zhou, Yingxue Xu, Xi Wang, Ronald Cheong Kin Chan, Li Liang, Hao Chen
arxiv.org/abs/2507.17303

@arXiv_csIR_bot@mastoxiv.page
2025-07-24 07:34:59

A Query-Aware Multi-Path Knowledge Graph Fusion Approach for Enhancing Retrieval-Augmented Generation in Large Language Models
Qikai Wei, Huansheng Ning, Chunlong Han, Jianguo Ding
arxiv.org/abs/2507.16826

@arXiv_csPL_bot@mastoxiv.page
2025-08-25 08:33:10

Leveraging Large Language Models to Detect Missed Peephole Optimizations
Zhenyang Xu, Hongxu Xu, Yongqiang Tian, Xintong Zhou, Chengnian Sun
arxiv.org/abs/2508.16125

@arXiv_csDC_bot@mastoxiv.page
2025-07-23 08:27:32

Cooling Matters: Benchmarking Large Language Models and Vision-Language Models on Liquid-Cooled Versus Air-Cooled H100 GPU Systems
Imran Latif, Muhammad Ali Shafique, Hayat Ullah, Alex C. Newkirk, Xi Yu, Arslan Munir
arxiv.org/abs/2507.16781

@arXiv_csAR_bot@mastoxiv.page
2025-08-25 08:11:00

Hardwired-Neurons Language Processing Units as General-Purpose Cognitive Substrates
Yang Liu, Yi Chen, Yongwei Zhao, Yifan Hao, Zifu Zheng, Weihao Kong, Zhangmai Li, Dongchen Jiang, Ruiyang Xia, Zhihong Ma, Zisheng Liu, Zhaoyong Wan, Yunqi Lu, Ximing Liu, Hongrui Guo, Zhihao Yang, Zhe Wang, Tianrui Ma, Mo Zou, Rui Zhang, Ling Li, Xing Hu, Zidong Du, Zhiwei Xu, Qi Guo, Tianshi Chen, Yunji Chen

@arXiv_csNI_bot@mastoxiv.page
2025-08-25 08:58:30

Congestion Control System Optimization with Large Language Models
Zhiyuan He, Aashish Gottipati, Lili Qiu, Yuqing Yang, Francis Y. Yan
arxiv.org/abs/2508.16074

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:35:42

Benchmarking Gaslighting Attacks Against Speech Large Language Models
Jinyang Wu, Bin Zhu, Xiandong Zou, Qiquan Zhang, Xu Fang, Pan Zhou
arxiv.org/abs/2509.19858

@arXiv_csCE_bot@mastoxiv.page
2025-07-24 08:08:29

Reasoning-Driven Retrosynthesis Prediction with Large Language Models via Reinforcement Learning
Situo Zhang, Hanqi Li, Lu Chen, Zihan Zhao, Xuanze Lin, Zichen Zhu, Bo Chen, Xin Chen, Kai Yu
arxiv.org/abs/2507.17448

@arXiv_econEM_bot@mastoxiv.page
2025-07-24 07:40:39

Decoding Consumer Preferences Using Attention-Based Language Models
Joshua Foster, Fredrik Odegaard
arxiv.org/abs/2507.17564 arxiv.org/pdf/…

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:39:32

EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models
Botai Yuan, Yutian Zhou, Yingjie Wang, Fushuo Huo, Yongcheng Jing, Li Shen, Ying Wei, Zhiqi Shen, Ziwei Liu, Tianwei Zhang, Jie Yang, Dacheng Tao
arxiv.org/abs/2509.20146

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 10:31:04

Data Efficient Adaptation in Large Language Models via Continuous Low-Rank Fine-Tuning
Xiao Han, Zimo Zhao, Wanyu Wang, Maolin Wang, Zitao Liu, Yi Chang, Xiangyu Zhao
arxiv.org/abs/2509.18942

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 09:50:20

On the Evolution of Federated Post-Training Large Language Models: A Model Accessibility View
Tao Guo, Junxiao Wang, Fushuo Huo, Laizhong Cui, Song Guo, Jie Gui, Dacheng Tao
arxiv.org/abs/2508.16261

@arXiv_csMA_bot@mastoxiv.page
2025-08-25 09:03:40

Building and Measuring Trust between Large Language Models
Maarten Buyl, Yousra Fettach, Guillaume Bied, Tijl De Bie
arxiv.org/abs/2508.15858

@arXiv_csSE_bot@mastoxiv.page
2025-07-25 09:42:32

Automated Code Review Using Large Language Models with Symbolic Reasoning
Busra Icoz, Goksel Biricik
arxiv.org/abs/2507.18476 arxiv.org/pdf…

@arXiv_csCR_bot@mastoxiv.page
2025-08-25 09:25:20

Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models
Guangyu Yang, Jinghong Chen, Jingbiao Mei, Weizhe Lin, Bill Byrne
arxiv.org/abs/2508.16406

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:04:10

TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks
\.Irem Demirta\c{s}, Burak Payzun, Se\c{c}il Arslan
arxiv.org/abs/2508.16243

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 09:33:34

G\"odel Test: Can Large Language Models Solve Easy Conjectures?
Moran Feldman, Amin Karbasi
arxiv.org/abs/2509.18383 arxiv.org/pdf/250…

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 10:18:59

A Comprehensive Evaluation on Quantization Techniques for Large Language Models
Yutong Liu, Cairong Zhao, Guosheng Hu
arxiv.org/abs/2507.17417

@arXiv_csCV_bot@mastoxiv.page
2025-09-24 11:05:54

Unveiling Chain of Step Reasoning for Vision-Language Models with Fine-grained Rewards
Honghao Chen, Xingzhou Lou, Xiaokun Feng, Kaiqi Huang, Xinlong Wang
arxiv.org/abs/2509.19003

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 09:38:22

V-GameGym: Visual Game Generation for Code Large Language Models
Wei Zhang, Jack Yang, Renshuai Tao, Lingzheng Chai, Shawn Guo, Jiajun Wu, Xiaoming Chen, Ganqu Cui, Ning Ding, Xander Xu, Hu Wei, Bowen Zhou
arxiv.org/abs/2509.20136

@arXiv_eessIV_bot@mastoxiv.page
2025-07-25 09:50:32

DiagR1: A Vision-Language Model Trained via Reinforcement Learning for Digestive Pathology Diagnosis
Minxi Ouyang, Lianghui Zhu, Yaqing Bao, Qiang Huang, Jingli Ouyang, Tian Guan, Xitong Ling, Jiawen Li, Song Duan, Wenbin Dai, Li Zheng, Xuemei Zhang, Yonghong He
arxiv.org/abs/2507.18433

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:34:34

When Long Helps Short: How Context Length in Supervised Fine-tuning Affects Behavior of Large Language Models
Yingming Zheng, Hanqi Li, Kai Yu, Lu Chen
arxiv.org/abs/2509.18762

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 09:53:09

Reinforcement Learning Fine-Tunes a Sparse Subnetwork in Large Language Models
Andrii Balashov
arxiv.org/abs/2507.17107

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 07:30:24

A Cost-Benefit Analysis of On-Premise Large Language Model Deployment: Breaking Even with Commercial LLM Services
Guanzhong Pan, Haibo Wang
arxiv.org/abs/2509.18101

@arXiv_csRO_bot@mastoxiv.page
2025-09-24 10:40:04

Lang2Morph: Language-Driven Morphological Design of Robotic Hands
Yanyuan Qiao, Kieran Gilday, Yutong Xie, Josie Hughes
arxiv.org/abs/2509.18937

@arXiv_csIR_bot@mastoxiv.page
2025-07-24 07:39:39

LLM4MEA: Data-free Model Extraction Attacks on Sequential Recommenders via Large Language Models
Shilong Zhao, Fei Sun, Kaike Zhang, Shaoling Jing, Du Su, Zhichao Shi, Zhiyi Yin, Huawei Shen, Xueqi Cheng
arxiv.org/abs/2507.16969

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:05:10

MizanQA: Benchmarking Large Language Models on Moroccan Legal Question Answering
Adil Bahaj, Mounir Ghogho
arxiv.org/abs/2508.16357 arxiv.o…

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 08:34:02

Assertion Messages with Large Language Models (LLMs) for Code
Ahmed Aljohani, Anamul Haque Mollah, Hyunsook Do
arxiv.org/abs/2509.19673 arx…

@arXiv_csCR_bot@mastoxiv.page
2025-07-25 09:26:32

Scout: Leveraging Large Language Models for Rapid Digital Evidence Discovery
Shariq Murtuza
arxiv.org/abs/2507.18478 arxiv.org/pdf/2507.184…

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:51:10

Structuring GUI Elements through Vision Language Models: Towards Action Space Generation
Yi Xu, Yesheng Zhang, jiajia Liu, Jingdong Chen
arxiv.org/abs/2508.16271

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 07:30:44

SPADE: A Large Language Model Framework for Soil Moisture Pattern Recognition and Anomaly Detection in Precision Agriculture
Yeonju Lee, Rui Qi Chen, Joseph Oboamah, Po Nien Su, Wei-zhen Liang, Yeyin Shi, Lu Gan, Yongsheng Chen, Xin Qiao, Jing Li
arxiv.org/abs/2509.18123

@arXiv_csLG_bot@mastoxiv.page
2025-07-24 09:03:49

SiLQ: Simple Large Language Model Quantization-Aware Training
Steven K. Esser, Jeffrey L. McKinstry, Deepika Bablani, Rathinakumar Appuswamy, Dharmendra S. Modha
arxiv.org/abs/2507.16933

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:44:52

Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Chaojun Nie, Jun Zhou, Guanxiang Wang, Shisong Wud, Zichen Wang
arxiv.org/abs/2509.20162

@arXiv_csSE_bot@mastoxiv.page
2025-08-25 09:37:50

How Small is Enough? Empirical Evidence of Quantized Small Language Models for Automated Program Repair
Kazuki Kusama, Honglin Shu, Masanari Kondo, Yasutaka Kamei
arxiv.org/abs/2508.16499

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:06:20

LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models
Doohee You, Andy Parisi, Zach Vander Velden, Lara Dantas Inojosa
arxiv.org/abs/2508.16478

@arXiv_csIR_bot@mastoxiv.page
2025-09-23 09:35:50

Temporal-Aware User Behaviour Simulation with Large Language Models for Recommender Systems
Xinye Wanyan, Danula Hettiachchi, Chenglong Ma, Ziqi Xu, Jeffrey Chan
arxiv.org/abs/2509.16895

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:20:32

ThinkFake: Reasoning in Multimodal Large Language Models for AI-Generated Image Detection
Tai-Ming Huang, Wei-Tung Lin, Kai-Lung Hua, Wen-Huang Cheng, Junichi Yamagishi, Jun-Cheng Chen
arxiv.org/abs/2509.19841

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 09:45:52

Enhancing Requirement Traceability through Data Augmentation Using Large Language Models
Jianzhang Zhang, Jialong Zhou, Nan Niu, Chuang Liu
arxiv.org/abs/2509.20149

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 09:30:14

Evaluating the Safety and Skill Reasoning of Large Reasoning Models Under Compute Constraints
Adarsha Balaji, Le Chen, Rajeev Thakur, Franck Cappello, Sandeep Madireddy
arxiv.org/abs/2509.18382

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:37:14

AECBench: A Hierarchical Benchmark for Knowledge Evaluation of Large Language Models in the AEC Field
Chen Liang, Zhaoqi Huang, Haofen Wang, Fu Chai, Chunying Yu, Huanhuan Wei, Zhengjie Liu, Yanpeng Li, Hongjun Wang, Ruifeng Luo, Xianzhong Zhao
arxiv.org/abs/2509.18776

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:39:04

Beyond the Leaderboard: Understanding Performance Disparities in Large Language Models via Model Diffing
Sabri Boughorbel, Fahim Dalvi, Nadir Durrani, Majd Hawasly
arxiv.org/abs/2509.18792

@arXiv_csCV_bot@mastoxiv.page
2025-09-24 11:07:34

Citrus-V: Advancing Medical Foundation Models with Unified Medical Image Grounding for Clinical Reasoning
Guoxin Wang, Jun Zhao, Xinyi Liu, Yanbo Liu, Xuyang Cao, Chao Li, Zhuoyun Liu, Qintian Sun, Fangru Zhou, Haoqiang Xing, Zhenhong Yang
arxiv.org/abs/2509.19090

@arXiv_csCR_bot@mastoxiv.page
2025-07-24 07:37:09

CompLeak: Deep Learning Model Compression Exacerbates Privacy Leakage
Na Li, Yansong Gao, Hongsheng Hu, Boyu Kuang, Anmin Fu
arxiv.org/abs/2507.16872

@arXiv_csSE_bot@mastoxiv.page
2025-07-24 09:05:30

Seed&Steer: Guiding Large Language Models with Compilable Prefix and Branch Signals for Unit Test Generation
Shuaiyu Zhou, Zhengran Zeng, Xiaoling Zhou, Rui Xie, Shikun Zhang, Wei Ye
arxiv.org/abs/2507.17271

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 09:50:50

Retrieval Enhanced Feedback via In-context Neural Error-book
Jongyeop Hyun, Bumsoo Kim
arxiv.org/abs/2508.16313 arxiv.org/pdf/2508.16313

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 10:27:34

Memory in Large Language Models: Mechanisms, Evaluation and Evolution
Dianxing Zhang, Wendong Li, Kani Song, Jiaye Lu, Gang Li, Liuchun Yang, Sheng Li
arxiv.org/abs/2509.18868

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:55:34

Steering Multimodal Large Language Models Decoding for Context-Aware Safety
Zheyuan Liu, Zhangchen Xu, Guangyao Dou, Xiangchi Yuan, Zhaoxuan Tan, Radha Poovendran, Meng Jiang
arxiv.org/abs/2509.19212

@arXiv_csSE_bot@mastoxiv.page
2025-07-25 08:45:11

Understanding the Supply Chain and Risks of Large Language Model Applications
Yujie Ma, Lili Quan, Xiaofei Xie, Qiang Hu, Jiongchi Yu, Yao Zhang, Sen Chen
arxiv.org/abs/2507.18105

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:02:20

CEQuest: Benchmarking Large Language Models for Construction Estimation
Yanzhao Wu, Lufan Wang, Rui Liu
arxiv.org/abs/2508.16081 arxiv.org/…

@arXiv_csCR_bot@mastoxiv.page
2025-09-23 11:23:51

SilentStriker:Toward Stealthy Bit-Flip Attacks on Large Language Models
Haotian Xu, Qingsong Peng, Jie Shi, Huadi Zheng, Yu Li, Cheng Zhuo
arxiv.org/abs/2509.17371

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 10:32:54

From latent factors to language: a user study on LLM-generated explanations for an inherently interpretable matrix-based recommender system
Maxime Manderlier, Fabian Lecron, Olivier Vu Thanh, Nicolas Gillis
arxiv.org/abs/2509.18980

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:02:00

Ethical Considerations of Large Language Models in Game Playing
Qingquan Zhang, Yuchen Li, Bo Yuan, Julian Togelius, Georgios N. Yannakakis, Jialin Liu
arxiv.org/abs/2508.16065

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 09:08:02

Beyond Language Barriers: Multi-Agent Coordination for Multi-Language Code Generation
Micheline B\'en\'edicte Moumoula, Serge Lionel Nikiema, Alb\'erick Euraste Djire, Abdoul Kader Kabore, Jacques Klein, Tegawend\'e F. Bissyande
arxiv.org/abs/2509.19918

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 07:40:32

iFinder: Structured Zero-Shot Vision-Based LLM Grounding for Dash-Cam Video Reasoning
Manyi Yao, Bingbing Zhuang, Sparsh Garg, Amit Roy-Chowdhury, Christian Shelton, Manmohan Chandraker, Abhishek Aich
arxiv.org/abs/2509.19552

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:07:52

The Moral Gap of Large Language Models
Maciej Skorski, Alina Landowska
arxiv.org/abs/2507.18523 arxiv.org/pdf/2507.18523

@arXiv_csLG_bot@mastoxiv.page
2025-09-23 12:45:10

Understanding Post-Training Structural Changes in Large Language Models
Xinyu He, Xianghui Cao
arxiv.org/abs/2509.17866 arxiv.org/pdf/2509.…

@arXiv_csSE_bot@mastoxiv.page
2025-08-25 09:37:10

LLM-GUARD: Large Language Model-Based Detection and Repair of Bugs and Security Vulnerabilities in C and Python
Akshay Mhatre, Noujoud Nader, Patrick Diehl, Deepti Gupta
arxiv.org/abs/2508.16419

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:32:04

Global-Recent Semantic Reasoning on Dynamic Text-Attributed Graphs with Large Language Models
Yunan Wang, Jianxin Li, Ziwei Zhang
arxiv.org/abs/2509.18742

@arXiv_csCV_bot@mastoxiv.page
2025-07-24 10:30:09

BetterCheck: Towards Safeguarding VLMs for Automotive Perception Systems
Malsha Ashani Mahawatta Dona, Beatriz Cabrero-Daniel, Yinan Yu, Christian Berger
arxiv.org/abs/2507.17722

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 09:58:42

BadReasoner: Planting Tunable Overthinking Backdoors into Large Reasoning Models for Fun or Profit
Biao Yi, Zekun Fei, Jianing Geng, Tong Li, Lihai Nie, Zheli Liu, Yiming Li
arxiv.org/abs/2507.18305

@arXiv_csAI_bot@mastoxiv.page
2025-08-22 10:00:31

DeepThink3D: Enhancing Large Language Models with Programmatic Reasoning in Complex 3D Situated Reasoning Tasks
Jiayi Song, Rui Wan, Lipeng Ma, Weidong Yang, Qingyuan Zhou, Yixuan Li, Ben Fei
arxiv.org/abs/2508.15548

@arXiv_csSE_bot@mastoxiv.page
2025-07-25 08:52:42

NoCode-bench: A Benchmark for Evaluating Natural Language-Driven Feature Addition
Le Deng, Zhonghao Jiang, Jialun Cao, Michael Pradel, Zhongxin Liu
arxiv.org/abs/2507.18130

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 09:54:30

Boardwalk: Towards a Framework for Creating Board Games with LLMs
\'Alvaro Guglielmin Becker, Gabriel Bauer de Oliveira, Lana Bertoldo Rossato, Anderson Rocha Tavares
arxiv.org/abs/2508.16447

@arXiv_csAI_bot@mastoxiv.page
2025-09-23 12:06:20

Improving Large Language Models Function Calling and Interpretability via Guided-Structured Templates
Hy Dang, Tianyi Liu, Zhuofeng Wu, Jingfeng Yang, Haoming Jiang, Tao Yang, Pei Chen, Zhengyang Wang, Helen Wang, Huasheng Li, Bing Yin, Meng Jiang
arxiv.org/abs/2509.18076

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:39:54

Multi-Hierarchical Feature Detection for Large Language Model Generated Text
Luyan Zhang, Xinyu Xie
arxiv.org/abs/2509.18862 arxiv.org/pdf/…

@arXiv_csCV_bot@mastoxiv.page
2025-09-22 10:37:01

Pointing to a Llama and Call it a Camel: On the Sycophancy of Multimodal Large Language Models
Renjie Pi, Kehao Miao, Li Peihang, Runtao Liu, Jiahui Gao, Jipeng Zhang, Xiaofang Zhou
arxiv.org/abs/2509.16149

@arXiv_csSE_bot@mastoxiv.page
2025-08-25 09:15:40

Towards Recommending Usability Improvements with Multimodal Large Language Models
Sebastian Lubos, Alexander Felfernig, Gerhard Leitner, Julian Schwazer
arxiv.org/abs/2508.16165

@arXiv_csAI_bot@mastoxiv.page
2025-09-23 11:57:40

EngiBench: A Benchmark for Evaluating Large Language Models on Engineering Problem Solving
Xiyuan Zhou, Xinlei Wang, Yirui He, Yang Wu, Ruixi Zou, Yuheng Cheng, Yulu Xie, Wenxuan Liu, Huan Zhao, Yan Xu, Jinjin Gu, Junhua Zhao
arxiv.org/abs/2509.17677

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:42:40

RAGSR: Regional Attention Guided Diffusion for Image Super-Resolution
Haodong He, Yancheng Bai, Rui Lan, Xu Duan, Lei Sun, Xiangxiang Chu, Gui-Song Xia
arxiv.org/abs/2508.16158

@arXiv_csSE_bot@mastoxiv.page
2025-07-23 09:22:32

LOCOFY Large Design Models -- Design to code conversion solution
Sohaib Muhammad, Ashwati Vipin, Karan Shetti, Honey Mittal
arxiv.org/abs/2507.16208

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 10:44:24

Extractive Fact Decomposition for Interpretable Natural Language Inference in one Forward Pass
Nicholas Popovi\v{c}, Michael F\"arber
arxiv.org/abs/2509.18901

@arXiv_csAI_bot@mastoxiv.page
2025-07-23 09:52:22

Distilled Large Language Model in Confidential Computing Environment for System-on-Chip Design
Dong Ben, Hui Feng, Qian Wang
arxiv.org/abs/2507.16226

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:42:42

From Text to Talk: Audio-Language Model Needs Non-Autoregressive Joint Training
Tianqiao Liu, Xueyi Li, Hao Wang, Haoxuan Li, Zhichao Chen, Weiqi Luo, Zitao Liu
arxiv.org/abs/2509.20072

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 09:47:30

MedOmni-45{\deg}: A Safety-Performance Benchmark for Reasoning-Oriented LLMs in Medicine
Kaiyuan Ji, Yijin Guo, Zicheng Zhang, Xiangyang Zhu, Yuan Tian, Ning Liu, Guangtao Zhai
arxiv.org/abs/2508.16213

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:07:42

Not All Features Deserve Attention: Graph-Guided Dependency Learning for Tabular Data Generation with Language Models
Zheyu Zhang, Shuo Yang, Bardh Prenkaj, Gjergji Kasneci
arxiv.org/abs/2507.18504

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 10:10:14

TERAG: Token-Efficient Graph-Based Retrieval-Augmented Generation
Qiao Xiao, Hong Ting Tsang, Jiaxin Bai
arxiv.org/abs/2509.18667 arxiv.org…

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:32:02

CHURRO: Making History Readable with an Open-Weight Large Vision-Language Model for High-Accuracy, Low-Cost Historical Text Recognition
Sina J. Semnani, Han Zhang, Xinyan He, Merve Tekg\"urler, Monica S. Lam
arxiv.org/abs/2509.19768

@arXiv_csSE_bot@mastoxiv.page
2025-07-23 09:41:52

Exploring Large Language Models for Analyzing and Improving Method Names in Scientific Code
Gunnar Larsen, Carol Wong, Anthony Peruma
arxiv.org/abs/2507.16439

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:06:42

Restoring Rhythm: Punctuation Restoration Using Transformer Models for Bangla, a Low-Resource Language
Md Obyedullahil Mamun, Md Adyelullahil Mamun, Arif Ahmad, Md. Imran Hossain Emu
arxiv.org/abs/2507.18448

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:05:50

Cetvel: A Unified Benchmark for Evaluating Language Understanding, Generation and Cultural Capacity of LLMs for Turkish
Yakup Abrek Er, Ilker Kesen, G\"ozde G\"ul \c{S}ahin, Aykut Erdem
arxiv.org/abs/2508.16431

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:55:31

D-REX: A Benchmark for Detecting Deceptive Reasoning in Large Language Models
Satyapriya Krishna, Andy Zou, Rahul Gupta, Eliot Krzysztof Jones, Nick Winter, Dan Hendrycks, J. Zico Kolter, Matt Fredrikson, Spyros Matsoukas
arxiv.org/abs/2509.17938

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:06:30

HAMSA: Hijacking Aligned Compact Models via Stealthy Automation
Alexey Krylov, Iskander Vagizov, Dmitrii Korzh, Maryam Douiba, Azidine Guezzaz, Vladimir Kokh, Sergey D. Erokhin, Elena V. Tutubalina, Oleg Y. Rogov
arxiv.org/abs/2508.16484

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:32:42

EnAnchored-X2X: English-Anchored Optimization for Many-to-Many Translation
Sen Yang, Yu Bao, Yu Lu, Jiajun Chen, Shujian Huang, Shanbo Cheng
arxiv.org/abs/2509.19770

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:06:32

AraTable: Benchmarking LLMs' Reasoning and Understanding of Arabic Tabular Data
Rana Alshaikh, Israa Alghanmi, Shelan Jeawak
arxiv.org/abs/2507.18442

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:57:11

Variation in Verification: Understanding Verification Dynamics in Large Language Models
Yefan Zhou, Austin Xu, Yilun Zhou, Janvijay Singh, Jiang Gui, Shafiq Joty
arxiv.org/abs/2509.17995

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:03:10

Text Takes Over: A Study of Modality Bias in Multimodal Intent Detection
Ankan Mullick, Saransh Sharma, Abhik Jana, Pawan Goyal
arxiv.org/abs/2508.16122

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:00:10

DeepMEL: A Multi-Agent Collaboration Framework for Multimodal Entity Linking
Fang Wang, Tianwei Yan, Zonghao Yang, Minghao Hu, Jun Zhang, Zhunchen Luo, Xiaoying Bai
arxiv.org/abs/2508.15876

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 14:22:29

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[4/5]:
- Macroeconomic Forecasting with Large Language Models
Andrea Carriero, Davide Pettenuzzo, Shubhranshu Shekhar

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 12:25:23

Crosslisted article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[1/2]:
- Safe-SAIL: Towards a Fine-grained Safety Landscape of Large Language Models via Sparse Autoencode...
Jiaqi Weng, Han Zheng, Hanyu Zhang, Qinqin He, Jialing Tao, Hui Xue, Zhixuan Chu, Xiting Wang

@arXiv_csCL_bot@mastoxiv.page
2025-09-24 15:09:53

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[2/5]:
- Benchmarking Critical Questions Generation: A Challenging Reasoning Task for Large Language Models
Banca Calvo Figueras, Rodrigo Agerri