Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/5]:
- Beyond In-Distribution Success: Scaling Curves of CoT Granularity for Language Model Generalization
Ru Wang, Wei Huang, Selena Song, Haoyu Zhang, Qian Niu, Yusuke Iwasawa, Yutaka Matsuo, Jiaxian Guo
https://arxiv.org/abs/2502.18273 https://mastoxiv.page/@arXiv_csCL_bot/114069031700102129
- Benchmarking NLP-supported Language Sample Analysis for Swiss Children's Speech
Anja Ryser, Yingqiang Gao, Sarah Ebling
https://arxiv.org/abs/2504.00780 https://mastoxiv.page/@arXiv_csCL_bot/114267149909002069
- Cultural Biases of Large Language Models and Humans in Historical Interpretation
Fabio Celli, Georgios Spathulas
https://arxiv.org/abs/2504.02572 https://mastoxiv.page/@arXiv_csCL_bot/114278467094094490
- BRIDGE: Benchmarking Large Language Models for Understanding Real-world Clinical Practice Text
Jiageng Wu, et al.
https://arxiv.org/abs/2504.19467 https://mastoxiv.page/@arXiv_csCL_bot/114420036189999973
- Understanding the Anchoring Effect of LLM with Synthetic Data: Existence, Mechanism, and Potentia...
Yiming Huang, Biquan Bie, Zuqiu Na, Weilin Ruan, Songxin Lei, Yutao Yue, Xinlei He
https://arxiv.org/abs/2505.15392 https://mastoxiv.page/@arXiv_csCL_bot/114550277171100272
- Just as Humans Need Vaccines, So Do Models: Model Immunization to Combat Falsehoods
Raza, Qureshi, Farooq, Lotif, Chadha, Pandya, Emmanouilidis
https://arxiv.org/abs/2505.17870 https://mastoxiv.page/@arXiv_csCL_bot/114572956853819813
- LingoLoop Attack: Trapping MLLMs via Linguistic Context and State Entrapment into Endless Loops
Fu, Jiang, Hong, Li, Guo, Yang, Chen, Zhang
https://arxiv.org/abs/2506.14493 https://mastoxiv.page/@arXiv_csCL_bot/114703502552989170
- GHTM: A Graph-based Hybrid Topic Modeling Approach with a Benchmark Dataset for the Low-Resource ...
Farhana Haque, Md. Abdur Rahman, Sumon Ahmed
https://arxiv.org/abs/2508.00605 https://mastoxiv.page/@arXiv_csCL_bot/114969875643478303
- Link Prediction for Event Logs in the Process Industry
Anastasia Zhukova, Thomas Walton, Christian E. Lobm\"uller, Bela Gipp
https://arxiv.org/abs/2508.09096 https://mastoxiv.page/@arXiv_csCL_bot/115020938764936882
- AirQA: A Comprehensive QA Dataset for AI Research with Instance-Level Evaluation
Huang, Cao, Zhang, Kang, Wang, Wang, Luo, Zheng, Qian, Chen, Yu
https://arxiv.org/abs/2509.16952 https://mastoxiv.page/@arXiv_csCL_bot/115253526588472475
- Multi-View Attention Multiple-Instance Learning Enhanced by LLM Reasoning for Cognitive Distortio...
Jun Seo Kim, Hyemi Kim, Woo Joo Oh, Hongjin Cho, Hochul Lee, Hye Hyeon Kim
https://arxiv.org/abs/2509.17292 https://mastoxiv.page/@arXiv_csCL_bot/115253586227941157
- Dual-Space Smoothness for Robust and Balanced LLM Unlearning
Han Yan, Zheyuan Liu, Meng Jiang
https://arxiv.org/abs/2509.23362 https://mastoxiv.page/@arXiv_csCL_bot/115293308293558024
- The Rise of AfricaNLP: Contributions, Contributors, Community Impact, and Bibliometric Analysis
Tadesse Destaw Belay, et al.
https://arxiv.org/abs/2509.25477 https://mastoxiv.page/@arXiv_csCL_bot/115298213432594791
- Open ASR Leaderboard: Towards Reproducible and Transparent Multilingual and Long-Form Speech Reco...
Srivastav, Zheng, Bezzam, Le Bihan, Koluguri, \.Zelasko, Majumdar, Moumen, Gandhi
https://arxiv.org/abs/2510.06961 https://mastoxiv.page/@arXiv_csCL_bot/115343748052193267
- Neuron-Level Analysis of Cultural Understanding in Large Language Models
Taisei Yamamoto, Ryoma Kumon, Danushka Bollegala, Hitomi Yanaka
https://arxiv.org/abs/2510.08284 https://mastoxiv.page/@arXiv_csCL_bot/115349533441895984
- CLMN: Concept based Language Models via Neural Symbolic Reasoning
Yibo Yang
https://arxiv.org/abs/2510.10063 https://mastoxiv.page/@arXiv_csCL_bot/115372392366793754
- Schema for In-Context Learning
Chen, Chen, Wang, Leong, Fung, Bernales, Aspuru-Guzik
https://arxiv.org/abs/2510.13905 https://mastoxiv.page/@arXiv_csCL_bot/115389057899856601
- Evaluating Latent Knowledge of Public Tabular Datasets in Large Language Models
Matteo Silvestri, Fabiano Veglianti, Flavio Giorgi, Fabrizio Silvestri, Gabriele Tolomei
https://arxiv.org/abs/2510.20351 https://mastoxiv.page/@arXiv_csCL_bot/115428615784704418
- LuxIT: A Luxembourgish Instruction Tuning Dataset from Monolingual Seed Data
Julian Valline, Cedric Lothritz, Siwen Guo, Jordi Cabot
https://arxiv.org/abs/2510.24434 https://mastoxiv.page/@arXiv_csCL_bot/115457025096322944
- Surfacing Subtle Stereotypes: A Multilingual, Debate-Oriented Evaluation of Modern LLMs
Muhammed Saeed, Muhammad Abdul-mageed, Shady Shehata
https://arxiv.org/abs/2511.01187 https://mastoxiv.page/@arXiv_csCL_bot/115491321130591723
toXiv_bot_toot
Amazon MGM's Project Hail Mary becomes its highest-grossing film ever, crossing $300M globally, including $54.1M just this weekend; the movie cost $200M to make (Brent Lang/Variety)
https://variety.com/2026/film/news/pro…
Crosslisted article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[2/2]:
- The Geometry of Harmful Intent: Training-Free Anomaly Detection via Angular Deviation in LLM Resi...
Isaac Llorente-Saguer
https://arxiv.org/abs/2603.27412 https://mastoxiv.page/@arXiv_csLG_bot/116323180390164201
- LongCat-Next: Lexicalizing Modalities as Discrete Tokens
Meituan LongCat Team, et al.
https://arxiv.org/abs/2603.27538 https://mastoxiv.page/@arXiv_csCV_bot/116323299668026852
- Emergent Social Intelligence Risks in Generative Multi-Agent Systems
Huang, Jiang, Wang, Zhuang, Luo, Ma, Xu, Chen, Moniz, Lin, Chen, Chawla, Dziri, Sun, Zhang
https://arxiv.org/abs/2603.27771 https://mastoxiv.page/@arXiv_csMA_bot/116322908437739020
- KVSculpt: KV Cache Compression as Distillation
Bo Jiang, Sian Jin
https://arxiv.org/abs/2603.27819 https://mastoxiv.page/@arXiv_csLG_bot/116323241993833314
- Q-Bridge: Code Translation for Quantum Machine Learning via LLMs
Runjia Zeng, Priyabrata Senapati, Ruixiang Tang, Dongfang Liu, Qiang Guan
https://arxiv.org/abs/2603.27836 https://mastoxiv.page/@arXiv_quantph_bot/116323164660887506
- EffiSkill: Agent Skill Based Automated Code Efficiency Optimization
Zimu Wang, Yuling Shi, Mengfan Li, Zijun Liu, Jie M. Zhang, Chengcheng Wan, Xiaodong Gu
https://arxiv.org/abs/2603.27850 https://mastoxiv.page/@arXiv_csSE_bot/116322989347928729
- Efficient Inference of Large Vision Language Models
Surendra Pathak
https://arxiv.org/abs/2603.27960 https://mastoxiv.page/@arXiv_csLG_bot/116323256085918152
- CDH-Bench: A Commonsense-Driven Hallucination Benchmark for Evaluating Visual Fidelity in Vision-...
Kesheng Chen, Yamin Hu, Qi Zhou, Zhenqian Zhu, Wenjian Luo
https://arxiv.org/abs/2603.27982 https://mastoxiv.page/@arXiv_csCV_bot/116323319000206060
- MOSS-VoiceGenerator: Create Realistic Voices with Natural Language Descriptions
Huang, Fan, Jiang, Jiang, Tu, Zhu, Zhang, Zhao, Yang, Fei, Li, Yang, Cheng, Qiu
https://arxiv.org/abs/2603.28086 https://mastoxiv.page/@arXiv_csSD_bot/116322971980743316
- Does Claude's Constitution Have a Culture?
Parham Pourdavood
https://arxiv.org/abs/2603.28123 https://mastoxiv.page/@arXiv_csCY_bot/116322911684465443
- MiroEval: Benchmarking Multimodal Deep Research Agents in Process and Outcome
Fangda Ye, et al.
https://arxiv.org/abs/2603.28407 https://mastoxiv.page/@arXiv_csAI_bot/116323220038984883
- IsoQuant: Hardware-Aligned SO(4) Isoclinic Rotations for LLM KV Cache Compression
Zhongping Ji
https://arxiv.org/abs/2603.28430 https://mastoxiv.page/@arXiv_csLG_bot/116323286231537351
- Entropic Claim Resolution: Uncertainty-Driven Evidence Selection for RAG
Davide Di Gioia
https://arxiv.org/abs/2603.28444 https://mastoxiv.page/@arXiv_csAI_bot/116323220366355511
- Moving Beyond Review: Applying Language Models to Planning and Translation in Reflection
Seyed Parsa Neshaei, Richard Lee Davis, Tanja K\"aser
https://arxiv.org/abs/2603.28596 https://mastoxiv.page/@arXiv_csHC_bot/116323161382060848
- ResAdapt: Adaptive Resolution for Efficient Multimodal Reasoning
Huanxuan Liao, Zhongtao Jiang, Yupu Hao, Yuqiao Tan, Shizhu He, Jun Zhao, Kun Xu, Kang Liu
https://arxiv.org/abs/2603.28610 https://mastoxiv.page/@arXiv_csCV_bot/116323344559859277
- The Ultimate Tutorial for AI-driven Scale Development in Generative Psychometrics: Releasing AIGE...
Lara Russell-Lasalandra, Hudson Golino, Luis Eduardo Garrido, Alexander P. Christensen
https://arxiv.org/abs/2603.28643 https://mastoxiv.page/@arXiv_csAI_bot/116323236095523987
- SOLE-R1: Video-Language Reasoning as the Sole Reward for On-Robot Reinforcement Learning
Philip Schroeder, Thomas Weng, Karl Schmeckpeper, Eric Rosen, Stephen Hart, Ondrej Biza
https://arxiv.org/abs/2603.28730 https://mastoxiv.page/@arXiv_csRO_bot/116323253135037252
- ParaSpeechCLAP: A Dual-Encoder Speech-Text Model for Rich Stylistic Language-Audio Pretraining
Anuj Diwan, Eunsol Choi, David Harwath
https://arxiv.org/abs/2603.28737 https://mastoxiv.page/@arXiv_eessAS_bot/116322903493463665
toXiv_bot_toot
Empowering the Ecosystem: MISP’s 2025 Progress and the Open Source Future
In 2025, the MISP project hit its stride with the transition to the 2.5 branch, delivering a major UI/UX overhaul and modernized background processing to enhance platform performance. This progress was bolstered by significant updates to satellite projects, including taxonomies,
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[3/5]:
- Can Small Language Models Handle Context-Summarized Multi-Turn Customer-Service QA? A Synthetic D...
Lakshan Cooray, Deshan Sumanathilaka, Pattigadapa Venkatesh Raju
https://arxiv.org/abs/2602.00665 https://mastoxiv.page/@arXiv_csCL_bot/116006686092324902
- SEAD: Self-Evolving Agent for Multi-Turn Service Dialogue
Dai, Gao, Zhang, Wang, Luo, Wang, Wang, Wu, Wang
https://arxiv.org/abs/2602.03548
- OmniRAG-Agent: Agentic Omnimodal Reasoning for Low-Resource Long Audio-Video Question Answering
Yifan Zhu, Xinyu Mu, Tao Feng, Zhonghong Ou, Yuning Gong, Haoran Luo
https://arxiv.org/abs/2602.03707
- GreekMMLU: A Native-Sourced Multitask Benchmark for Evaluating Language Models in Greek
Zhang, Konomi, Xypolopoulos, Divriotis, Skianis, Nikolentzos, Stamou, Shang, Vazirgiannis
https://arxiv.org/abs/2602.05150
- Using LLMs for Knowledge Component-level Correctness Labeling in Open-ended Coding Problems
Zhangqi Duan, Arnav Kankaria, Dhruv Kartik, Andrew Lan
https://arxiv.org/abs/2602.17542 https://mastoxiv.page/@arXiv_csCL_bot/116102514058414603
- MetaState: Persistent Working Memory Enhances Reasoning in Discrete Diffusion Language Models
Kejing Xia, Mingzhe Li, Lixuan Wei, Zhenbang Du, Xiangchi Yuan, Dachuan Shi, Qirui Jin, Wenke Lee
https://arxiv.org/abs/2603.01331 https://mastoxiv.page/@arXiv_csCL_bot/116165314672421581
- A Browser-based Open Source Assistant for Multimodal Content Verification
Milner, Foster, Karmakharm, Razuvayevskaya, Roberts, Porcellini, Teyssou, Bontcheva
https://arxiv.org/abs/2603.02842 https://mastoxiv.page/@arXiv_csCL_bot/116170368271004704
- Nw\=ach\=a Mun\=a: A Devanagari Speech Corpus and Proximal Transfer Benchmark for Nepal Bhasha ASR
Sharma, Shrestha, Poudel, Tiwari, Shrestha, Ghimire, Bal
https://arxiv.org/abs/2603.07554 https://mastoxiv.page/@arXiv_csCL_bot/116204797995674104
- Model Merging in the Era of Large Language Models: Methods, Applications, and Future Directions
Mingyang Song, Mao Zheng
https://arxiv.org/abs/2603.09938 https://mastoxiv.page/@arXiv_csCL_bot/116210189810004206
- AgentDrift: Unsafe Recommendation Drift Under Tool Corruption Hidden by Ranking Metrics in LLM Ag...
Zekun Wu, Adriano Koshiyama, Sahan Bulathwela, Maria Perez-Ortiz
https://arxiv.org/abs/2603.12564 https://mastoxiv.page/@arXiv_csCL_bot/116237800898328349
- GhanaNLP Parallel Corpora: Comprehensive Multilingual Resources for Low-Resource Ghanaian Languages
Gyamfi, Azunre, Moore, Budu, Asare, Owusu, Asiamah
https://arxiv.org/abs/2603.13793 https://mastoxiv.page/@arXiv_csCL_bot/116243544688031749
- sebis at ArchEHR-QA 2026: How Much Can You Do Locally? Evaluating Grounded EHR QA on a Single Not...
Ibrahim Ebrar Yurt, Fabian Karl, Tejaswi Choppa, Florian Matthes
https://arxiv.org/abs/2603.13962 https://mastoxiv.page/@arXiv_csCL_bot/116243646346563497
- ExPosST: Explicit Positioning with Adaptive Masking for LLM-Based Simultaneous Machine Translation
Yuzhe Shang, Pengzhi Gao, Yazheng Yang, Jiayao Ma, Wei Liu, Jian Luan, Jinsong Su
https://arxiv.org/abs/2603.14903 https://mastoxiv.page/@arXiv_csCL_bot/116243711232778054
- BanglaSocialBench: A Benchmark for Evaluating Sociopragmatic and Cultural Alignment of LLMs in Ba...
Tanvir Ahmed Sijan, S. M Golam Rifat, Pankaj Chowdhury Partha, Md. Tanjeed Islam, Md. Musfique Anwar
https://arxiv.org/abs/2603.15949 https://mastoxiv.page/@arXiv_csCL_bot/116249122231759766
- EngGPT2: Sovereign, Efficient and Open Intelligence
G. Ciarfaglia, et al.
https://arxiv.org/abs/2603.16430 https://mastoxiv.page/@arXiv_csCL_bot/116249228411487178
- HypeLoRA: Hyper-Network-Generated LoRA Adapters for Calibrated Language Model Fine-Tuning
Bartosz Trojan, Filip G\k{e}bala
https://arxiv.org/abs/2603.19278 https://mastoxiv.page/@arXiv_csCL_bot/116277612915482857
- Automatic Analysis of Collaboration Through Human Conversational Data Resources: A Review
Yi Yu, Maria Boritchev, Chlo\'e Clavel
https://arxiv.org/abs/2603.19292 https://mastoxiv.page/@arXiv_csCL_bot/116277620779254916
- Alignment Whack-a-Mole : Finetuning Activates Verbatim Recall of Copyrighted Books in Large Langu...
Xinyue Liu, Niloofar Mireshghallah, Jane C. Ginsburg, Tuhin Chakrabarty
https://arxiv.org/abs/2603.20957 https://mastoxiv.page/@arXiv_csCL_bot/116283538317671552
- KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning
Shuai Wang, Yinan Yu
https://arxiv.org/abs/2603.21440 https://mastoxiv.page/@arXiv_csCL_bot/116283595007808076
toXiv_bot_toot
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[5/5]:
- AppellateGen: A Benchmark for Appellate Legal Judgment Generation
Yang, Wang, Fan, Hu, Wang, Liu, Zeng, Fu, Gong, Zhang, Li, Zheng, Xu
https://arxiv.org/abs/2601.01331 https://mastoxiv.page/@arXiv_csCY_bot/115847038572575387
- Vision-Language Agents for Interactive Forest Change Analysis
James Brock, Ce Zhang, Nantheera Anantrasirichai
https://arxiv.org/abs/2601.04497 https://mastoxiv.page/@arXiv_csCV_bot/115864542639529766
- FigEx2: Visual-Conditioned Panel Detection and Captioning for Scientific Compound Figures
Jifeng Song, Arun Das, Pan Wang, Hui Ji, Kun Zhao, Yufei Huang
https://arxiv.org/abs/2601.08026 https://mastoxiv.page/@arXiv_csCV_bot/115892719657942341
- Sparse-RL: Breaking the Memory Wall in LLM Reinforcement Learning via Stable Sparse Rollouts
Luo, Zhang, Hu, Zhang, Wang, Su, Sun, Liang, Zhang
https://arxiv.org/abs/2601.10079 https://mastoxiv.page/@arXiv_csLG_bot/115904206341755873
- Compounding Disadvantage: Auditing Intersectional Bias in LLM-Generated Explanations Across India...
Amogh Gupta (Neil), Niharika Patil (Neil), Sourojit Ghosh (Neil), SnehalKumar (Neil), S Gaikwad
https://arxiv.org/abs/2601.14506 https://mastoxiv.page/@arXiv_csCY_bot/115937624654783353
- Measuring Complexity at the Requirements Stage: Spectral Metrics as Development Effort Predictors
Vierlboeck, Pugliese, Nilchian, Grogan, Babu
https://arxiv.org/abs/2602.07182 https://mastoxiv.page/@arXiv_csSE_bot/116045826365214235
- CoPE-VideoLM: Leveraging Codec Primitives For Efficient Video Language Modeling
Sarkar, Pautrat, Miksik, Pollefeys, Armeni, Rad, Dusmanu
https://arxiv.org/abs/2602.13191 https://mastoxiv.page/@arXiv_csCV_bot/116079824094529198
- MoD-DPO: Towards Mitigating Cross-modal Hallucinations in Omni LLMs using Modality Decoupled Pref...
Ashutosh Chaubey, Jiacheng Pang, Mohammad Soleymani
https://arxiv.org/abs/2603.03192 https://mastoxiv.page/@arXiv_csCV_bot/116170511143131333
- Image Generation Models: A Technical History
Rouzbeh Shirvani
https://arxiv.org/abs/2603.07455 https://mastoxiv.page/@arXiv_csCV_bot/116204960613280699
- Rethinking Attention Output Projection: Structured Hadamard Transforms for Efficient Transformers
Shubham Aggarwal, Lokendra Kumar
https://arxiv.org/abs/2603.08343 https://mastoxiv.page/@arXiv_csLG_bot/116205064359384079
- FGTR: Fine-Grained Multi-Table Retrieval via Hierarchical LLM Reasoning
Chaojie Sun, Bin Cao, Tiantian Li, Chenyu Hou, Ruizhe Li, Jing Fan
https://arxiv.org/abs/2603.12702 https://mastoxiv.page/@arXiv_csIR_bot/116237827836520478
- CausalEvolve: Towards Open-Ended Discovery with Causal Scratchpad
Yongqiang Chen, Chenxi Liu, Zhenhao Chen, Tongliang Liu, Bo Han, Kun Zhang
https://arxiv.org/abs/2603.14575 https://mastoxiv.page/@arXiv_csLG_bot/116243782215605653
- Silicon Bureaucracy and AI Test-Oriented Education: Contamination Sensitivity and Score Confidenc...
Yiliang Song, Hongjun An, Jiangan Chen, Xuanchen Yan, Huan Song, Jiawei Shao, Xuelong Li
https://arxiv.org/abs/2603.21636 https://mastoxiv.page/@arXiv_csAI_bot/116283590092117172
- Problems with Chinchilla Approach 2: Systematic Biases in IsoFLOP Parabola Fits
Eric Czech, Zhiwei Xu, Yael Elmatad, Yixin Wang, William Held
https://arxiv.org/abs/2603.22339 https://mastoxiv.page/@arXiv_csLG_bot/116288991182888131
- X-OPD: Cross-Modal On-Policy Distillation for Capability Alignment in Speech LLMs
Di Cao, Dongjie Fu, Hai Yu, Siqi Zheng, Xu Tan, Tao Jin
https://arxiv.org/abs/2603.24596 https://mastoxiv.page/@arXiv_eessAS_bot/116300009464853696
toXiv_bot_toot