mmBERT: A Modern Multilingual Encoder with Annealed Language Learning
Marc Marone, Orion Weller, William Fleshman, Eugene Yang, Dawn Lawrie, Benjamin Van Durme
https://arxiv.org/abs/2509.06888
"Multilingual Scholarly Publishing and Artificial Intelligence Translation Tools: Weighing Social Justice and Climate Justice"
https://doi.org/10.3998/jep.7100
Checklist Engineering Empowers Multilingual LLM Judges
Mohammad Ghiasvand Mohammadkhani, Hamid Beigy
https://arxiv.org/abs/2507.06774 https://
AIxcellent Vibes at GermEval 2025 Shared Task on Candy Speech Detection: Improving Model Performance by Span-Level Training
Christian Rene Thelen, Patrick Gustav Blaneck, Tobias Bornheim, Niklas Grieger, Stephan Bialonski
https://arxiv.org/abs/2509.07459
OleSpeech-IV: A Large-Scale Multispeaker and Multilingual Conversational Speech Dataset with Diverse Topics
Wei Chu, Yuanzhe Dong, Ke Tan, Dong Han, Xavier Menendez-Pidal, Ruchao Fan, Chenfeng Miao, Chanwoo Kim, Bhiksha Raj, Rita Singh
https://arxiv.org/abs/2509.04702
Do LLMs exhibit the same commonsense capabilities across languages?
Ivan Mart\'inez-Murillo, Elena Lloret, Paloma Moreda, Albert Gatt
https://arxiv.org/abs/2509.06401 https:…
PRIM: Towards Practical In-Image Multilingual Machine Translation
Yanzhi Tian, Zeming Liu, Zhengyang Liu, Chong Feng, Xin Li, Heyan Huang, Yuhang Guo
https://arxiv.org/abs/2509.05146
Learning the Topic, Not the Language: How LLMs Classify Online Immigration Discourse Across Languages
Andrea Nasuto, Stefano Maria Iacus, Francisco Rowe, Devika Jain
https://arxiv.org/abs/2508.06435
fact check AI at SemEval-2025 Task 7: Multilingual and Crosslingual Fact-checked Claim Retrieval
Pranshu Rastogi
https://arxiv.org/abs/2508.03475 https://a…
Using LLMs for Multilingual Clinical Entity Linking to ICD-10
Sylvia Vassileva, Ivan Koychev, Svetla Boytcheva
https://arxiv.org/abs/2509.04868 https://arx…