Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:12:16

Demographic Biases and Gaps in the Perception of Sexism in Large Language Models
Judith Tavarez-Rodr\'iguez, Fernando S\'anchez-Vega, A. Pastor L\'opez-Monroy
arxiv.org/abs/2508.18245

@arXiv_csCR_bot@mastoxiv.page
2025-08-26 11:02:26

Risk Assessment and Security Analysis of Large Language Models
Xiaoyan Zhang, Dongyang Lyu, Xiaoqi Li
arxiv.org/abs/2508.17329 arxiv.org/pd…

@arXiv_csAI_bot@mastoxiv.page
2025-09-26 07:38:21

Cognitive Load Limits in Large Language Models: Benchmarking Multi-Hop Reasoning
Sai Teja Reddy Adapala
arxiv.org/abs/2509.19517 arxiv.org/…

@arXiv_csLG_bot@mastoxiv.page
2025-08-26 12:25:46

CMPhysBench: A Benchmark for Evaluating Large Language Models in Condensed Matter Physics
Weida Wang, Dongchen Huang, Jiatong Li, Tengchao Yang, Ziyang Zheng, Di Zhang, Dong Han, Benteng Chen, Binzhao Luo, Zhiyu Liu, Kunling Liu, Zhiyuan Gao, Shiqi Geng, Wei Ma, Jiaming Su, Xin Li, Shuchen Pu, Yuhan Shui, Qianjia Cheng, Zhihao Dou, Dongfei Cui, Changyong He, Jin Zeng, Zeke Xie, Mao Su, Dongzhan Zhou, Yuqiang Li, Wanli Ouyang, Lei Bai, Yunqi Cai, Xi Dai, Shufei Zhang, Jinguang Cheng, Zh…

@arXiv_csSE_bot@mastoxiv.page
2025-08-27 09:06:02

Interleaving Large Language Models for Compiler Testing
Yunbo Ni, Shaohua Li
arxiv.org/abs/2508.18955 arxiv.org/pdf/2508.18955

@arXiv_csHC_bot@mastoxiv.page
2025-08-26 10:36:16

Measuring Large Language Models Dependency: Validating the Arabic Version of the LLM-D12 Scale
Sameha AlShakhsi, Ala Yankouskaya, Magnus Liebherr, Raian Ali
arxiv.org/abs/2508.17063

@arXiv_csCY_bot@mastoxiv.page
2025-09-26 08:28:41

Communication Bias in Large Language Models: A Regulatory Perspective
Adrian Kuenzler, Stefan Schmid
arxiv.org/abs/2509.21075 arxiv.org/pdf…

@arXiv_csDC_bot@mastoxiv.page
2025-08-27 08:54:32

Federated Fine-Tuning of Sparsely-Activated Large Language Models on Resource-Constrained Devices
Fahao Chen, Jie Wan, Peng Li, Zhou Su, Dongxiao Yu
arxiv.org/abs/2508.19078

@arXiv_csIR_bot@mastoxiv.page
2025-09-26 07:37:21

DELM: a Python toolkit for Data Extraction with Language Models
Eric Fithian, Kirill Skobelev
arxiv.org/abs/2509.20617 arxiv.org/pdf/2509.2…

@arXiv_csCE_bot@mastoxiv.page
2025-09-26 07:35:21

Difference-Guided Reasoning: A Temporal-Spatial Framework for Large Language Models
Hong Su
arxiv.org/abs/2509.20713 arxiv.org/pdf/2509.207…

@arXiv_csCV_bot@mastoxiv.page
2025-08-25 07:33:40

VT-LVLM-AR: A Video-Temporal Large Vision-Language Model Adapter for Fine-Grained Action Recognition in Long-Term Videos
Kaining Li, Shuwei He, Zihan Xu
arxiv.org/abs/2508.15903

@arXiv_csRO_bot@mastoxiv.page
2025-09-26 09:53:51

Digital Twin-Guided Robot Path Planning: A Beta-Bernoulli Fusion with Large Language Model as a Sensor
Mani Amani, Reza Akhavian
arxiv.org/abs/2509.20709

@arXiv_csMA_bot@mastoxiv.page
2025-08-27 07:36:32

Consensus Is All You Need: Gossip-Based Reasoning Among Large Language Models
Saksham Arora
arxiv.org/abs/2508.18292 arxiv.org/pdf/2508.182…

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:06:03

Beyond Quality: Unlocking Diversity in Ad Headline Generation with Large Language Models
Chang Wang, Siyu Yan, Depeng Yuan, Yuqi Chen, Yanhua Huang, Yuanhang Zheng, Shuhao Li, Yinqi Zhang, Kedi Chen, Mingrui Zhu, Ruiwen Xu
arxiv.org/abs/2508.18739

@arXiv_csDL_bot@mastoxiv.page
2025-08-26 07:37:36

Named Entity Recognition of Historical Text via Large Language Model
Shibingfeng Zhang, Giovanni Colavizza
arxiv.org/abs/2508.18090 arxiv.o…

@arXiv_csLG_bot@mastoxiv.page
2025-09-26 10:30:41

Go With The Flow: Churn-Tolerant Decentralized Training of Large Language Models
Nikolay Blagoev, Bart Cox, J\'er\'emie Decouchant, Lydia Y. Chen
arxiv.org/abs/2509.21221

@arXiv_csAI_bot@mastoxiv.page
2025-08-27 10:14:23

Investigating Advanced Reasoning of Large Language Models via Black-Box Interaction
Congchi Yin, Tianyi Wu, Yankai Shu, Alex Gu, Yunhan Wang, Jun Shao, Xun Jiang, Piji Li
arxiv.org/abs/2508.19035

@arXiv_csSD_bot@mastoxiv.page
2025-07-25 08:50:42

DIFFA: Large Language Diffusion Models Can Listen and Understand
Jiaming Zhou, Hongjie Chen, Shiwan Zhao, Jian Kang, Jie Li, Enzhi Wang, Yujie Guo, Haoqin Sun, Hui Wang, Aobo Kong, Yong Qin, Xuelong Li
arxiv.org/abs/2507.18452

@arXiv_csSE_bot@mastoxiv.page
2025-08-26 08:23:56

Cognitive Agents Powered by Large Language Models for Agile Software Project Management
Konrad Cinkusz, Jaros{\l}aw A. Chudziak, Ewa Niewiadomska-Szynkiewicz
arxiv.org/abs/2508.16678

@seeingwithsound@mas.to
2025-08-27 10:57:48

High-level visual representations in the human brain are aligned with large language models nature.com/articles/s42256-025
News release: Using AI to "see" what we see

A mapping from LLM embeddings captures visual responses to natural scenes.
@arXiv_eessAS_bot@mastoxiv.page
2025-09-26 09:38:01

Measuring Audio's Impact on Correctness: Audio-Contribution-Aware Post-Training of Large Audio Language Models
Haolin He, Xingjian Du, Renhe Sun, Zheqi Dai, Yujia Xiao, Mingru Yang, Jiayi Zhou, Xiquan Li, Zhengxi Liu, Zining Liang, Chunyat Wu, Qianhua He, Tan Lee, Xie Chen, Weilong Zheng, Weiqiang Wang, Mark Plumbley, Jian Liu, Qiuqiang Kong

@arXiv_csHC_bot@mastoxiv.page
2025-08-26 07:59:56

Adaptive Command: Real-Time Policy Adjustment via Language Models in StarCraft II
Weiyu Ma, Dongyu Xu, Shu Lin, Haifeng Zhang, Jun Wang
arxiv.org/abs/2508.16580

@arXiv_csCY_bot@mastoxiv.page
2025-08-26 09:36:36

Invisible Filters: Cultural Bias in Hiring Evaluations Using Large Language Models
Pooja S. B. Rao, Laxminarayen Nagarajan Venkatesan, Mauro Cherubini, Dinesh Babu Jayagopi
arxiv.org/abs/2508.16673

@arXiv_csCR_bot@mastoxiv.page
2025-09-26 09:01:41

A Framework for Rapidly Developing and Deploying Protection Against Large Language Model Attacks
Adam Swanda, Amy Chang, Alexander Chen, Fraser Burch, Paul Kassianik, Konstantin Berlin
arxiv.org/abs/2509.20639

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:08:26

DiscussLLM: Teaching Large Language Models When to Speak
Deep Anil Patel, Iain Melvin, Christopher Malon, Martin Renqiang Min
arxiv.org/abs/2508.18167

@arXiv_csDC_bot@mastoxiv.page
2025-08-26 08:35:06

Memory-Efficient Federated Fine-Tuning of Large Language Models via Layer Pruning
Yebo Wu, Jingguang Li, Chunlin Tian, Zhijiang Guo, Li Li
arxiv.org/abs/2508.17209

@arXiv_csAI_bot@mastoxiv.page
2025-08-27 10:08:53

Interactive Evaluation of Large Language Models for Multi-Requirement Software Engineering Tasks
Dimitrios Rontogiannis, Maxime Peyrard, Nicolas Baldwin, Martin Josifoski, Robert West, Dimitrios Gunopulos
arxiv.org/abs/2508.18905

@arXiv_csLG_bot@mastoxiv.page
2025-08-26 12:26:36

AdLoCo: adaptive batching significantly improves communications efficiency and convergence for Large Language Models
Nikolay Kutuzov, Makar Baderko, Stepan Kulibaba, Artem Dzhalilov, Daniel Bobrov, Maxim Mashtaler, Alexander Gasnikov
arxiv.org/abs/2508.18182

@arXiv_csRO_bot@mastoxiv.page
2025-08-27 09:40:52

An LLM-powered Natural-to-Robotic Language Translation Framework with Correctness Guarantees
ZhenDong Chen, ZhanShang Nie, ShiXing Wan, JunYi Li, YongTian Cheng, Shuai Zhao
arxiv.org/abs/2508.19074

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:39:32

EchoBench: Benchmarking Sycophancy in Medical Large Vision-Language Models
Botai Yuan, Yutian Zhou, Yingjie Wang, Fushuo Huo, Yongcheng Jing, Li Shen, Ying Wei, Zhiqi Shen, Ziwei Liu, Tianwei Zhang, Jie Yang, Dacheng Tao
arxiv.org/abs/2509.20146

@arXiv_csSE_bot@mastoxiv.page
2025-08-26 08:53:26

CelloAI: Leveraging Large Language Models for HPC Software Development in High Energy Physics
Mohammad Atif, Kriti Chopra, Ozgur Kilic, Tianle Wang, Zhihua Dong, Charles Leggett, Meifeng Lin, Paolo Calafiura, Salman Habib
arxiv.org/abs/2508.16713

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:09:26

Leveraging Large Language Models for Accurate Sign Language Translation in Low-Resource Scenarios
Luana Bulla, Gabriele Tuccio, Misael Mongiov\`i, Aldo Gangemi
arxiv.org/abs/2508.18183

@arXiv_csCR_bot@mastoxiv.page
2025-08-27 09:18:13

Collaborative Intelligence: Topic Modelling of Large Language Model use in Live Cybersecurity Operations
Martin Lochner, Keegan Keplinger
arxiv.org/abs/2508.18488

@arXiv_csIR_bot@mastoxiv.page
2025-08-26 09:16:56

A Universal Framework for Offline Serendipity Evaluation in Recommender Systems via Large Language Models
Yu Tokutake, Kazushi Okamoto, Kei Harada, Atsushi Shibata, Koki Karube
arxiv.org/abs/2508.17571

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:05:41

PerHalluEval: Persian Hallucination Evaluation Benchmark for Large Language Models
Mohammad Hosseini, Kimia Hosseini, Shayan Bali, Zahra Zanjani, Saeedeh Momtazi
arxiv.org/abs/2509.21104

@arXiv_csCV_bot@mastoxiv.page
2025-08-27 10:27:03

Enhancing Document VQA Models via Retrieval-Augmented Generation
Eric L\'opez, Artemis Llabr\'es, Ernest Valveny
arxiv.org/abs/2508.18984

@arXiv_csAI_bot@mastoxiv.page
2025-09-26 09:37:11

Embodied AI: From LLMs to World Models
Tongtong Feng, Xin Wang, Yu-Gang Jiang, Wenwu Zhu
arxiv.org/abs/2509.20021 arxiv.org/pdf/2509.20021

@arXiv_csDC_bot@mastoxiv.page
2025-08-26 07:58:46

Equinox: Holistic Fair Scheduling in Serving Large Language Models
Zhixiang Wei, James Yen, Jingyi Chen, Ziyang Zhang, Zhibai Huang, Chen Chen, Xingzi Yu, Yicheng Gu, Chenggang Wu, Yun Wang, Mingyuan Xia, Jie Wu, Hao Wang, Zhengwei Qi
arxiv.org/abs/2508.16646

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 09:48:43

Emotion Omni: Enabling Empathetic Speech Response Generation through Large Language Models
Haoyu Wang, Guangyan Zhang, Jiale Chen, Jingyu Li, Yuehai Wang, Yiwen Guo
arxiv.org/abs/2508.18655

@arXiv_csRO_bot@mastoxiv.page
2025-07-25 08:43:42

OpenNav: Open-World Navigation with Multimodal Large Language Models
Mingfeng Yuan, Letian Wang, Steven L. Waslander
arxiv.org/abs/2507.18033

@arXiv_csLG_bot@mastoxiv.page
2025-09-25 10:51:12

Video models are zero-shot learners and reasoners
Thadd\"aus Wiedemer, Yuxuan Li, Paul Vicol, Shixiang Shane Gu, Nick Matarese, Kevin Swersky, Been Kim, Priyank Jaini, Robert Geirhos
arxiv.org/abs/2509.20328

@arXiv_csCR_bot@mastoxiv.page
2025-08-26 11:10:36

Attacking LLMs and AI Agents: Advertisement Embedding Attacks Against Large Language Models
Qiming Guo, Jinwen Tang, Xingran Huang
arxiv.org/abs/2508.17674

@arXiv_csIR_bot@mastoxiv.page
2025-08-26 10:44:47

Retrieval Feedback Memory Enhancement Large Model Retrieval Generation Method
Leqian Li, Dianxi Shi, Jialu Zhou, Xinyu Wei, Mingyue Yang, Songchang Jin, Shaowu Yang
arxiv.org/abs/2508.17862

@arXiv_csSE_bot@mastoxiv.page
2025-08-27 07:37:02

Training Language Model Agents to Find Vulnerabilities with CTF-Dojo
Terry Yue Zhuo, Dingmin Wang, Hantian Ding, Varun Kumar, Zijian Wang
arxiv.org/abs/2508.18370

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:16:23

ConfTuner: Training Large Language Models to Express Their Confidence Verbally
Yibo Li, Miao Xiong, Jiaying Wu, Bryan Hooi
arxiv.org/abs/2508.18847

@arXiv_csDC_bot@mastoxiv.page
2025-08-27 08:25:23

Strata: Hierarchical Context Caching for Long Context Language Model Serving
Zhiqiang Xie, Ziyi Xu, Mark Zhao, Yuwei An, Vikram Sharma Mailthody, Scott Mahlke, Michael Garland, Christos Kozyrakis
arxiv.org/abs/2508.18572

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:02:16

Understanding Subword Compositionality of Large Language Models
Qiwei Peng, Yekun Chai, Anders S{\o}gaard
arxiv.org/abs/2508.17953 arxiv.or…

@arXiv_csLG_bot@mastoxiv.page
2025-08-27 10:31:53

PAX-TS: Model-agnostic multi-granular explanations for time series forecasting via localized perturbations
Tim Kreuzer, Jelena Zdravkovic, Panagiotis Papapetrou
arxiv.org/abs/2508.18982

@arXiv_csAI_bot@mastoxiv.page
2025-09-25 07:44:22

Cognitive Load Limits in Large Language Models: Benchmarking Multi-Hop Reasoning
Sai Teja Reddy Adapala
arxiv.org/abs/2509.19517 arxiv.org/…

@arXiv_csCR_bot@mastoxiv.page
2025-07-25 08:41:32

RECALLED: An Unbounded Resource Consumption Attack on Large Vision-Language Models
Haoran Gao, Yuanhe Zhang, Zhenhong Zhou, Lei Jiang, Fanyu Meng, Yujia Xiao, Kun Wang, Yang Liu, Junlan Feng
arxiv.org/abs/2507.18053

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:14:01

CLaw: Benchmarking Chinese Legal Knowledge in Large Language Models - A Fine-grained Corpus and Reasoning Analysis
Xinzhe Xu, Liang Zhao, Hongshen Xu, Chen Chen
arxiv.org/abs/2509.21208

@arXiv_csCV_bot@mastoxiv.page
2025-09-26 10:23:21

Instruction-tuned Self-Questioning Framework for Multimodal Reasoning
You-Won Jang, Yu-Jung Heo, Jaeseok Kim, Minsu Lee, Du-Seong Chang, Byoung-Tak Zhang
arxiv.org/abs/2509.21251

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:09:43

ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
Qianyu He, Siyu Yuan, Xuefeng Li, Mingxuan Wang, Jiangjie Chen
arxiv.org/abs/2508.18773

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 08:02:02

Reverse Engineering User Stories from Code using Large Language Models
Mohamed Ouf, Haoyu Li, Michael Zhang, Mariam Guizani
arxiv.org/abs/2509.19587

@arXiv_csLG_bot@mastoxiv.page
2025-08-25 09:50:20

On the Evolution of Federated Post-Training Large Language Models: A Model Accessibility View
Tao Guo, Junxiao Wang, Fushuo Huo, Laizhong Cui, Song Guo, Jie Gui, Dacheng Tao
arxiv.org/abs/2508.16261

@arXiv_csAI_bot@mastoxiv.page
2025-08-27 10:13:13

AI Models Exceed Individual Human Accuracy in Predicting Everyday Social Norms
Pontus Strimling, Simon Karlsson, Irina Vartanova, Kimmo Eriksson
arxiv.org/abs/2508.19004

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:15:53

Arrows of Math Reasoning Data Synthesis for Large Language Models: Diversity, Complexity and Correctness
Sirui Chen, Changxin Tian, Binbin Hu, Kunlong Chen, Ziqi Liu, Zhiqiang Zhang, Jun Zhou
arxiv.org/abs/2508.18824

@arXiv_csCV_bot@mastoxiv.page
2025-09-26 10:19:41

MOSS-ChatV: Reinforcement Learning with Process Reasoning Reward for Video Temporal Reasoning
Sicheng Tao, Jungang Li, Yibo Yan, Junyan Zhang, Yubo Gao, Hanqian Li, ShuHang Xun, Yuxuan Fan, Hong Chen, Jianxiang He, Xuming Hu
arxiv.org/abs/2509.21113

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:24:13

Generative Interfaces for Language Models
Jiaqi Chen, Yanzhe Zhang, Yutong Zhang, Yijia Shao, Diyi Yang
arxiv.org/abs/2508.19227 arxiv.org/…

@arXiv_csCR_bot@mastoxiv.page
2025-08-26 08:50:26

Guarding Your Conversations: Privacy Gatekeepers for Secure Interactions with Cloud-Based AI Models
GodsGift Uzor, Hasan Al-Qudah, Ynes Ineza, Abdul Serwadda
arxiv.org/abs/2508.16765

@arXiv_csSE_bot@mastoxiv.page
2025-07-25 09:42:32

Automated Code Review Using Large Language Models with Symbolic Reasoning
Busra Icoz, Goksel Biricik
arxiv.org/abs/2507.18476 arxiv.org/pdf…

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 11:57:26

DRQA: Dynamic Reasoning Quota Allocation for Controlling Overthinking in Reasoning Large Language Models
Kaiwen Yan, Xuanqing Shi, Hongcheng Guo, Wenxuan Wang, Zhuosheng Zhang, Chengwei Qin
arxiv.org/abs/2508.17803

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 09:47:33

Breaking the Trade-Off Between Faithfulness and Expressiveness for Large Language Models
Chenxu Yang, Qingyi Si, Zheng Lin
arxiv.org/abs/2508.18651

@arXiv_csSE_bot@mastoxiv.page
2025-09-26 08:25:11

Dynamic ReAct: Scalable Tool Selection for Large-Scale MCP Environments
Nishant Gaurav, Adit Akarsh, Ankit Ranjan, Manoj Bajaj
arxiv.org/abs/2509.20386

@arXiv_csAI_bot@mastoxiv.page
2025-09-24 10:13:24

Advances in Large Language Models for Medicine
Zhiyu Kan, Wensheng Gan, Zhenlian Qi, Philip S. Yu
arxiv.org/abs/2509.18690 arxiv.org/pdf/25…

@arXiv_csLG_bot@mastoxiv.page
2025-09-26 10:29:31

Mixture of Thoughts: Learning to Aggregate What Experts Think, Not Just What They Say
Jacob Fein-Ashley, Dhruv Parikh, Rajgopal Kannan, Viktor Prasanna
arxiv.org/abs/2509.21164

@arXiv_csCR_bot@mastoxiv.page
2025-08-25 09:25:20

Retrieval-Augmented Defense: Adaptive and Controllable Jailbreak Prevention for Large Language Models
Guangyu Yang, Jinghong Chen, Jingbiao Mei, Weizhe Lin, Bill Byrne
arxiv.org/abs/2508.16406

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:12:51

GEP: A GCG-Based method for extracting personally identifiable information from chatbots built on small language models
Jieli Zhu, Vi Ngoc-Nha Tran
arxiv.org/abs/2509.21192

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:13:26

From BERT to LLMs: Comparing and Understanding Chinese Classifier Prediction in Language Models
ZiqiZhang, Jianfei Ma, Emmanuele Chersoni, Jieshun You, Zhaoxin Feng
arxiv.org/abs/2508.18253

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 09:38:22

V-GameGym: Visual Game Generation for Code Large Language Models
Wei Zhang, Jack Yang, Renshuai Tao, Lingzheng Chai, Shawn Guo, Jiajun Wu, Xiaoming Chen, Ganqu Cui, Ning Ding, Xander Xu, Hu Wei, Bowen Zhou
arxiv.org/abs/2509.20136

@arXiv_csAI_bot@mastoxiv.page
2025-08-26 09:12:46

Bridging the Gap in Ophthalmic AI: MM-Retinal-Reason Dataset and OphthaReason Model toward Dynamic Multimodal Reasoning
Ruiqi Wu, Yuang Yao, Tengfei Ma, Chenran Zhang, Na Su, Tao Zhou, Geng Chen, Wen Fan, Yi Zhou
arxiv.org/abs/2508.16129

@arXiv_csSE_bot@mastoxiv.page
2025-09-25 08:34:02

Assertion Messages with Large Language Models (LLMs) for Code
Ahmed Aljohani, Anamul Haque Mollah, Hyunsook Do
arxiv.org/abs/2509.19673 arx…

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:04:36

Neither Valid nor Reliable? Investigating the Use of LLMs as Judges
Khaoula Chehbouni, Mohammed Haddou, Jackie Chi Kit Cheung, Golnoosh Farnadi
arxiv.org/abs/2508.18076

@arXiv_csAI_bot@mastoxiv.page
2025-08-27 10:13:33

Sense of Self and Time in Borderline Personality. A Comparative Robustness Study with Generative AI
Marcin Moskalewicz, Anna Sterna, Marek Pokropski, Paula Flores
arxiv.org/abs/2508.19008

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:19:23

Automatic Prompt Optimization with Prompt Distillation
Viktor N. Zhuravlev, Artur R. Khairullin, Ernest A. Dyagin, Alena N. Sitkina, Nikita I. Kulin
arxiv.org/abs/2508.18992

@arXiv_csSE_bot@mastoxiv.page
2025-08-25 09:37:50

How Small is Enough? Empirical Evidence of Quantized Small Language Models for Automated Program Repair
Kazuki Kusama, Honglin Shu, Masanari Kondo, Yasutaka Kamei
arxiv.org/abs/2508.16499

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:06:26

Detecting and Characterizing Planning in Language Models
Jatin Nainani, Sankaran Vaidyanathan, Connor Watts, Andre N. Assis, Alice Rigg
arxiv.org/abs/2508.18098

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:04:46

How Quantization Shapes Bias in Large Language Models
Federico Marcuzzi, Xuefei Ning, Roy Schwartz, Iryna Gurevych
arxiv.org/abs/2508.18088

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:07:51

BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback
Hyunseo Kim, Sangam Lee, Kwangwook Seo, Dongha Lee
arxiv.org/abs/2509.21106

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 11:59:36

ILRe: Intermediate Layer Retrieval for Context Compression in Causal Language Models
Manlai Liang, Mandi Liu, Jiangzhou Ji, Huaijun Li, Haobo Yang, Yaohan He, Jinlong Li
arxiv.org/abs/2508.17892

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:01:56

AMELIA: A Family of Multi-task End-to-end Language Models for Argumentation
Henri Savigny, Bruno Yun
arxiv.org/abs/2508.17926 arxiv.org/pdf…

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:03:06

A Retail-Corpus for Aspect-Based Sentiment Analysis with Large Language Models
Oleg Silcenco, Marcos R. Machad, Wallace C. Ugulino, Daniel Braun
arxiv.org/abs/2508.17994

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:17:31

DisCoCLIP: A Distributional Compositional Tensor Network Encoder for Vision-Language Understanding
Kin Ian Lo, Hala Hawashin, Mina Abbaszadeh, Tilen Limback-Stokin, Hadi Wazni, Mehrnoosh Sadrzadeh
arxiv.org/abs/2509.21287

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:01:20

Political Ideology Shifts in Large Language Models
Pietro Bernardelle, Stefano Civelli, Leon Fr\"ohling, Riccardo Lunardi, Kevin Roitero, Gianluca Demartini
arxiv.org/abs/2508.16013

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:35:42

Benchmarking Gaslighting Attacks Against Speech Large Language Models
Jinyang Wu, Bin Zhu, Xiandong Zou, Qiquan Zhang, Xu Fang, Pan Zhou
arxiv.org/abs/2509.19858

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:04:10

TULIP: Adapting Open-Source Large Language Models for Underrepresented Languages and Specialized Financial Tasks
\.Irem Demirta\c{s}, Burak Payzun, Se\c{c}il Arslan
arxiv.org/abs/2508.16243

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:05:10

MizanQA: Benchmarking Large Language Models on Moroccan Legal Question Answering
Adil Bahaj, Mounir Ghogho
arxiv.org/abs/2508.16357 arxiv.o…

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:05:31

Which Cultural Lens Do Models Adopt? On Cultural Positioning Bias and Agentic Mitigation in LLMs
Yixin Wan, Xingrun Chen, Kai-Wei Chang
arxiv.org/abs/2509.21080

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:44:52

Embedding Domain Knowledge for Large Language Models via Reinforcement Learning from Augmented Generation
Chaojun Nie, Jun Zhou, Guanxiang Wang, Shisong Wud, Zichen Wang
arxiv.org/abs/2509.20162

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 12:11:46

MTalk-Bench: Evaluating Speech-to-Speech Models in Multi-Turn Dialogues via Arena-style and Rubrics Protocols
Yuhao Du, Qianwei Huang, Guo Zhu, Zhanchen Dai, Sunian Chen, Qiming Zhu, Yuhao Zhang, Li Zhou, Benyou Wang
arxiv.org/abs/2508.18240

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 11:58:56

Speech Discrete Tokens or Continuous Features? A Comparative Analysis for Spoken Language Understanding in SpeechLLMs
Dingdong Wang, Junan Li, Mingyu Cui, Dongchao Yang, Xueyuan Chen, Helen Meng
arxiv.org/abs/2508.17863

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:06:20

LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models
Doohee You, Andy Parisi, Zach Vander Velden, Lara Dantas Inojosa
arxiv.org/abs/2508.16478

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:02:20

CEQuest: Benchmarking Large Language Models for Construction Estimation
Yanzhao Wu, Lufan Wang, Rui Liu
arxiv.org/abs/2508.16081 arxiv.org/…

@arXiv_csCL_bot@mastoxiv.page
2025-08-25 10:02:00

Ethical Considerations of Large Language Models in Game Playing
Qingquan Zhang, Yuchen Li, Bo Yuan, Julian Togelius, Georgios N. Yannakakis, Jialin Liu
arxiv.org/abs/2508.16065

@arXiv_csCL_bot@mastoxiv.page
2025-08-27 10:16:43

ReflectivePrompt: Reflective evolution in autoprompting algorithms
Viktor N. Zhuravlev, Artur R. Khairullin, Ernest A. Dyagin, Alena N. Sitkina, Nikita I. Kulin
arxiv.org/abs/2508.18870

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 10:07:52

The Moral Gap of Large Language Models
Maciej Skorski, Alina Landowska
arxiv.org/abs/2507.18523 arxiv.org/pdf/2507.18523

@arXiv_csCL_bot@mastoxiv.page
2025-07-25 09:58:42

BadReasoner: Planting Tunable Overthinking Backdoors into Large Reasoning Models for Fun or Profit
Biao Yi, Zekun Fei, Jianing Geng, Tong Li, Lihai Nie, Zheli Liu, Yiming Li
arxiv.org/abs/2507.18305

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 17:16:41

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[1/6]:
- Recursively Summarizing Enables Long-Term Dialogue Memory in Large Language Models
Qingyue Wang, Yanhe Fu, Yanan Cao, Shuai Wang, Zhiliang Tian, Liang Ding

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 17:16:54

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[2/6]:
- EmoBench-M: Benchmarking Emotional Intelligence for Multimodal Large Language Models
Hu, Zhou, You, Xu, Wang, Lian, Yu, Ma, Cui

@arXiv_csCL_bot@mastoxiv.page
2025-08-26 17:17:07

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[3/6]:
- Exploring the Vulnerability of the Content Moderation Guardrail in Large Language Models via Inte...
Jun Zhuang, Haibo Jin, Ye Zhang, Zhengjian Kang, Wenbin Zhang, Gaby G. Dagher, Haohan Wang