
2025-08-19 11:38:30
Breaking Language Barriers: Equitable Performance in Multilingual Language Models
Tanay Nagar, Grigorii Khvatskii, Anna Sokol, Nitesh V. Chawla
https://arxiv.org/abs/2508.12662 …
Breaking Language Barriers: Equitable Performance in Multilingual Language Models
Tanay Nagar, Grigorii Khvatskii, Anna Sokol, Nitesh V. Chawla
https://arxiv.org/abs/2508.12662 …
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
Audio Flamingo Sound-CoT Technical Report: Improving Chain-of-Thought Reasoning in Sound Understanding
Zhifeng Kong, Arushi Goel, Joao Felipe Santos, Sreyan Ghosh, Rafael Valle, Wei Ping, Bryan Catanzaro
https://arxiv.org/abs/2508.11818
Qualifications for holding a conversation:
* A genuine interest in what your conversation partner has to say (Otherwise it’s a monologue).
There is no #2. Not even sharing a common language, believe it or not.
RAS-Eval: A Comprehensive Benchmark for Security Evaluation of LLM Agents in Real-World Environments
Yuchuan Fu, Xiaohan Yuan, Dongxia Wang
https://arxiv.org/abs/2506.15253
Using AI for User Representation: An Analysis of 83 Persona Prompts
Joni Salminen, Danial Amin, Bernard Jansen
https://arxiv.org/abs/2508.13047 https://arx…
Speaking of AI, CrowdStrike came out with two big announcements today on its big shift to agentic AI.
My latest piece for CSO.
CrowdStrike bets big on agentic AI with new offerings after $290M Onum buy
https://www.csoonline.com/article/405…
Accelerating Edge Inference for Distributed MoE Models with Latency-Optimized Expert Placement
Tian Wu, Liming Wang, Zijian Wen, Xiaoxi Zhang, Jingpu Duan, Xianwei Zhang, Jinhang Zuo
https://arxiv.org/abs/2508.12851
EgoCross: Benchmarking Multimodal Large Language Models for Cross-Domain Egocentric Video Question Answering
Yanjun Li, Yuqian Fu, Tianwen Qian, Qi'ao Xu, Silong Dai, Danda Pani Paudel, Luc Van Gool, Xiaoling Wang
https://arxiv.org/abs/2508.10729
Mind the Gap: A Closer Look at Tokenization for Multiple-Choice Question Answering with LLMs
Mario Sanz-Guerrero, Minh Duc Bui, Katharina von der Wense
https://arxiv.org/abs/2509.15020
#DH2025 Cha's group trying to come up with generalisable workflows. Working responsibly to apply this new technology. Looking for tasks that researchers have in common, working with LLM-friendly data. Rather than using very large language models, small models do amazing work on very specific tasks.
ViScratch: Using Large Language Models and Gameplay Videos for Automated Feedback in Scratch
Yuan Si, Daming Li, Hanyuan Shi, Jialu Zhang
https://arxiv.org/abs/2509.11065 https:…
Towards a Common Framework for Autoformalization
Agnieszka Mensfelt, David Tena Cucala, Santiago Franco, Angeliki Koutsoukou-Argyraki, Vince Trencsenyi, Kostas Stathis
https://arxiv.org/abs/2509.09810 …
Humans are more gullible than LLMs in believing common psychological myths
Bevan Koopman, Guido Zuccon
https://arxiv.org/abs/2507.12296 https://
Collapse of Irrelevant Representations (CIR) Ensures Robust and Non-Disruptive LLM Unlearning
Filip Sondej, Yushi Yang
https://arxiv.org/abs/2509.11816 https://
Personalized Distractor Generation via MCTS-Guided Reasoning Reconstruction
Tao Wu, Jingyuan Chen, Wang Lin, Jian Zhan, Mengze Li, Kun Kuang, Fei Wu
https://arxiv.org/abs/2508.11184
ICE memo outlines plan to deport migrants to countries where they are not citizens
Federal immigration officers may deport immigrants with as little as six hours’ notice to countries other than their own -- even if officials have not provided any assurances that the new arrivals will be safe from persecution or torture, a top official said in a memo this week.
Todd M. Lyons, the acting director of U.S. Immigration and Customs Enforcement, wrote in a memo to the ICE workforce We…
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
SafeConstellations: Steering LLM Safety to Reduce Over-Refusals Through Task-Specific Trajectory
Utsav Maskey, Sumit Yadav, Mark Dras, Usman Naseem
https://arxiv.org/abs/2508.11290
Pyrosome: Verified Compilation for Modular Metatheory
Dustin Jamner, Gabriel Kammer, Ritam Nag, Adam Chlipala
https://arxiv.org/abs/2507.06360 https://
VisFinEval: A Scenario-Driven Chinese Multimodal Benchmark for Holistic Financial Understanding
Zhaowei Liu, Xin Guo, Haotian Xia, Lingfeng Zeng, Fangqi Lou, Jinyi Niu, Mengping Li, Qi Qi, Jiahuan Li, Wei Zhang, Yinglong Wang, Weige Cai, Weining Shen, Liwen Zhang
https://arxiv.org/abs/2508.09641
Compartmentalised Agentic Reasoning for Clinical NLI
Ma\"el Jullien, Lei Xu, Marco Valentino, Andr\'e Freitas
https://arxiv.org/abs/2509.10222 https://
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
Interleaving Logic and Counting
Johan van Benthem, Thomas Icard
https://arxiv.org/abs/2507.05219 https://arxiv.org/pdf/2507.05219
OpenNav: Open-World Navigation with Multimodal Large Language Models
Mingfeng Yuan, Letian Wang, Steven L. Waslander
https://arxiv.org/abs/2507.18033 https://
Finding Common Ground: Using Large Language Models to Detect Agreement in Multi-Agent Decision Conferences
Selina Heller, Mohamed Ibrahim, David Antony Selby, Sebastian Vollmer
https://arxiv.org/abs/2507.08440
MedReasoner: Reinforcement Learning Drives Reasoning Grounding from Clinical Thought to Pixel-Level Precision
Zhonghao Yan, Muxi Diao, Yuxuan Yang, Jiayuan Xu, Kaizhou Zhang, Ruoyan Jing, Lele Yang, Yanxi Liu, Kongming Liang, Zhanyu Ma
https://arxiv.org/abs/2508.08177
Fine-Tuning Code Language Models to Detect Cross-Language Bugs
Zengyang Li, Yimeng Li, Binbin Huang, Peng Liang, Ran Mo, Hui Liu, Yutao Ma
https://arxiv.org/abs/2507.21954 https…
Can Common VLMs Rival Medical VLMs? Evaluation and Strategic Insights
Yuan Zhong, Ruinan Jin, Xiaoxiao Li, Qi Dou
https://arxiv.org/abs/2506.17337 https://…
Long Context Automated Essay Scoring with Language Models
Christopher Ormerod, Gitit Kehat
https://arxiv.org/abs/2509.10417 https://arxiv.org/pdf/2509.1041…
MSU-Bench: Towards Understanding the Conversational Multi-talker Scenarios
Shuai Wang, Zhaokai Sun, Zhennan Lin, Chengyou Wang, Zhou Pan, Lei Xie
https://arxiv.org/abs/2508.08155
HierMoE: Accelerating MoE Training with Hierarchical Token Deduplication and Expert Swap
Wenxiang Lin, Xinglin Pan, Lin Zhang, Shaohuai Shi, Xuan Wang, Xiaowen Chu
https://arxiv.org/abs/2508.09591
UDA: Unsupervised Debiasing Alignment for Pair-wise LLM-as-a-Judge
Yang Zhang, Cunxiang Wang, Lindong Wu, Wenbo Yu, Yidong Wang, Guangsheng Bao, Jie Tang
https://arxiv.org/abs/2508.09724
From Nodes to Narratives: Explaining Graph Neural Networks with LLMs and Graph Context
Peyman Baghershahi, Gregoire Fournier, Pranav Nyati, Sourav Medya
https://arxiv.org/abs/2508.07117
CORONA: A Coarse-to-Fine Framework for Graph-based Recommendation with Large Language Models
Junze Chen, Xinjie Yang, Cheng Yang, Junfei Bao, Zeyuan Guo, Yawen Li, Chuan Shi
https://arxiv.org/abs/2506.17281
Enhancing COBOL Code Explanations: A Multi-Agents Approach Using Large Language Models
Fangjian Lei, Jiawen Liu, Shayan Noei, Ying Zou, Derek Truong, William Alexander
https://arxiv.org/abs/2507.02182
LLMHoney: A Real-Time SSH Honeypot with Large Language Model-Driven Dynamic Response Generation
Pranjay Malhotra
https://arxiv.org/abs/2509.01463 https://a…
Evaluating the Effectiveness of Large Language Models in Solving Simple Programming Tasks: A User-Centered Study
Kai Deng
https://arxiv.org/abs/2507.04043 …
SocialNav-SUB: Benchmarking VLMs for Scene Understanding in Social Robot Navigation
Michael J. Munje, Chen Tang, Shuijing Liu, Zichao Hu, Yifeng Zhu, Jiaxun Cui, Garrett Warnell, Joydeep Biswas, Peter Stone
https://arxiv.org/abs/2509.08757
Evaluating LLMs on Chinese Idiom Translation
Cai Yang, Yao Dou, David Heineman, Xiaofeng Wu, Wei Xu
https://arxiv.org/abs/2508.10421 https://arxiv.org/pdf/…
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
CLIPin: A Non-contrastive Plug-in to CLIP for Multimodal Semantic Alignment
Shengzhu Yang, Jiawei Du, Shuai Lu, Weihang Zhang, Ningli Wang, Huiqi Li
https://arxiv.org/abs/2508.06434
The role of large language models in UI/UX design: A systematic literature review
Ammar Ahmed, Ali Shariq Imran
https://arxiv.org/abs/2507.04469 https://…
Dr.Copilot: A Multi-Agent Prompt Optimized Assistant for Improving Patient-Doctor Communication in Romanian
Andrei Niculae, Adrian Cosma, Cosmin Dumitrache, Emilian R\v{a}doi
https://arxiv.org/abs/2507.11299
The Impact of Critique on LLM-Based Model Generation from Natural Language: The Case of Activity Diagrams
Parham Khamsepour, Mark Cole, Ish Ashraf, Sandeep Puri, Mehrdad Sabetzadeh, Shiva Nejati
https://arxiv.org/abs/2509.03463
Anyone working in #hospice care in the UK, this might be of interest: pilot courses for teams of 6 to 12 people, on how to use "Clean Language" in conversations about end-of-life care.
1 day in-person, or 2 x half a day if online. CPD certificate.
"Clean Language" is about how you can talk with people and listen to people without accidentally bringing in your own baggage when you didn't mean to.
This skill can help you reach clearer/deeper understanding of other people's worlds. And it can mean you're less likely to set off unnecessary disagreements, so that conversations go more smoothly.
Pilot aimed at South West, Yorkshire & North East. I asked Judy why those areas in particular, and she said those are areas where they're building up "clusters of expertise", but actually it could be any UK end-of-life group.
(I slightly know Judy from having been to a little community event of hers about Clean Language in healthcare. And we have colleagues in common.)
#SouthWestEngland #Yorkshire #NorthEastEngland #healthcare #HospiceCare #death #dying #EndOfLife #communication #CleanLanguage #England #UK
CAD2DMD-SET: Synthetic Generation Tool of Digital Measurement Device CAD Model Datasets for fine-tuning Large Vision-Language Models
Jo\~ao Valente, Atabak Dehban, Rodrigo Ventura
https://arxiv.org/abs/2508.21732
LDI: Localized Data Imputation
Soroush Omidvartehrani, Davood Rafiei
https://arxiv.org/abs/2506.16616 https://arxiv.org/pdf/2506.1661…
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
An Investigation Into Secondary School Students' Debugging Behaviour in Python
Laurie Gale, Sue Sentance
https://arxiv.org/abs/2508.14833 https://arxiv…
Structural Code Search using Natural Language Queries
Ben Limpanukorn, Yanjun Wang, Zach Patterson, Pranav Garg, Murali Krishna Ramanathan, Xiaofei Ma, Anoop Deoras, Miryung Kim
https://arxiv.org/abs/2507.02107
TAGAL: Tabular Data Generation using Agentic LLM Methods
Beno\^it Ronval, Pierre Dupont, Siegfried Nijssen
https://arxiv.org/abs/2509.04152 https://arxiv.o…
Building High-Quality Datasets for Portuguese LLMs: From Common Crawl Snapshots to Industrial-Grade Corpora
Thales Sales Almeida, Rodrigo Nogueira, Helio Pedrini
https://arxiv.org/abs/2509.08824
Learning the non-Markovian features of subsystem dynamics
Michele Coppola, Mari Carmen Ba\~nuls, Zala Lenar\v{c}i\v{c}
https://arxiv.org/abs/2507.14133 htt…
ASP-FZN: A Translation-based Constraint Answer Set Solver
Thomas Eiter, Tobias Geibinger, Tobias Kaminski, Nysret Musliu, Johannes Oetsch
https://arxiv.org/abs/2507.22774 https:…
From CVE Entries to Verifiable Exploits: An Automated Multi-Agent Framework for Reproducing CVEs
Saad Ullah, Praneeth Balasubramanian, Wenbo Guo, Amanda Burnett, Hammond Pearce, Christopher Kruegel, Giovanni Vigna, Gianluca Stringhini
https://arxiv.org/abs/2509.01835
Less is More: Token-Efficient Video-QA via Adaptive Frame-Pruning and Semantic Graph Integration
Shaoguang Wang (The Hong Kong University of Science and Technology), Jianxiang He (The Hong Kong University of Science and Technology), Yijie Xu (The Hong Kong University of Science and Technology), Ziyang Chen (The Hong Kong University of Science and Technology), Weiyu Guo (The Hong Kong University of Science and Technology), Hui Xiong (The Hong Kong University of Science and Technology)
Multimodal Information Retrieval for Open World with Edit Distance Weak Supervision
KMA Solaiman, Bharat Bhargava
https://arxiv.org/abs/2506.20070 https://…
An Agentic Model Context Protocol Framework for Medical Concept Standardization
Jaerong Ahn, Andrew Wen, Nan Wang, Heling Jia, Zhiyi Yue, Sunyang Fu, Hongfang Liu
https://arxiv.org/abs/2509.03828
LLMxCPG: Context-Aware Vulnerability Detection Through Code Property Graph-Guided Large Language Models
Ahmed Lekssays, Hamza Mouhcine, Khang Tran, Ting Yu, Issa Khalil
https://arxiv.org/abs/2507.16585
AI Agents for Web Testing: A Case Study in the Wild
Naimeng Ye, Xiao Yu, Ruize Xu, Tianyi Peng, Zhou Yu
https://arxiv.org/abs/2509.05197 https://arxiv.org/…
Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla
Md Sazzadul Islam Ridoy, Sumi Akter, Md. Aminur Rahman
https://arxiv.org/abs/2507.01931
Who's Sorry Now: User Preferences Among Rote, Empathic, and Explanatory Apologies from LLM Chatbots
Zahra Ashktorab, Alessandra Buccella, Jason D'Cruz, Zoe Fowler, Andrew Gill, Kei Yan Leung, P. D. Magnus, John Richards, Kush R. Varshney
https://arxiv.org/abs/2507.02745
Data Diversification Methods In Alignment Enhance Math Performance In LLMs
Berkan Dokmeci, Qingyang Wu, Ben Athiwaratkun, Ce Zhang, Shuaiwen Leon Song, James Zou
https://arxiv.org/abs/2507.02173
Measuring How (Not Just Whether) VLMs Build Common Ground
Saki Imai, Mert \.Inan, Anthony Sicilia, Malihe Alikhani
https://arxiv.org/abs/2509.03805 https://
Cross-lingual Data Selection Using Clip-level Acoustic Similarity for Enhancing Low-resource Automatic Speech Recognition
Shunsuke Mitsumori, Sara Kashiwagi, Keitaro Tanaka, Shigeo Morishima
https://arxiv.org/abs/2506.22194
Architecture is All You Need: Improving LLM Recommenders by Dropping the Text
Kevin Foley, Shaghayegh Agah, Kavya Priyanka Kakinada
https://arxiv.org/abs/2506.15833
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
LLaSO: A Foundational Framework for Reproducible Research in Large Language and Speech Model
Yirong Sun, Yizhong Geng, Peidong Wei, Yanjun Chen, Jinghan Yang, Rongfei Chen, Wei Zhang, Xiaoyu Shen
https://arxiv.org/abs/2508.15418
Benchmarking LLMs for Unit Test Generation from Real-World Functions
Dong Huang, Jie M. Zhang, Mark Harman, Qianru Zhang, Mingzhe Du, See-Kiong Ng
https://arxiv.org/abs/2508.00408
Finance Language Model Evaluation (FLaME)
Glenn Matlin, Mika Okamoto, Huzaifa Pardawala, Yang Yang, Sudheer Chava
https://arxiv.org/abs/2506.15846 https://…
Evaluating LLMs for Visualization Generation and Understanding
Saadiq Rauf Khan, Vinit Chandak, Sougata Mukherjea
https://arxiv.org/abs/2507.22890 https://…
Beyond Two-Stage Training: Cooperative SFT and RL for LLM Reasoning
Liang Chen, Xueting Han, Li Shen, Jing Bai, Kam-Fai Wong
https://arxiv.org/abs/2509.06948 https://
Towards Recommending Usability Improvements with Multimodal Large Language Models
Sebastian Lubos, Alexander Felfernig, Gerhard Leitner, Julian Schwazer
https://arxiv.org/abs/2508.16165
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
The Anatomy of a Personal Health Agent
A. Ali Heydari, Ken Gu, Vidya Srinivas, Hong Yu, Zhihan Zhang, Yuwei Zhang, Akshay Paruchuri, Qian He, Hamid Palangi, Nova Hammerquist, Ahmed A. Metwally, Brent Winslow, Yubin Kim, Kumar Ayush, Yuzhe Yang, Girish Narayanswamy, Maxwell A. Xu, Jake Garrison, Amy Aremnto Lee, Jenny Vafeiadou, Ben Graef, Isaac R. Galatzer-Levy, Erik Schenck, Andrew Barakat, Javier Perez, Jacqueline Shreibati, John Hernandez, Anthony Z. Faranesh, Javier L. Prieto, Conn…
Time to Split: Exploring Data Splitting Strategies for Offline Evaluation of Sequential Recommenders
Danil Gusak, Anna Volodkevich, Anton Klenitskiy, Alexey Vasilev, Evgeny Frolov
https://arxiv.org/abs/2507.16289
Boosting Pathology Foundation Models via Few-shot Prompt-tuning for Rare Cancer Subtyping
Dexuan He, Xiao Zhou, Wenbin Guan, Liyuan Zhang, Xiaoman Zhang, Sinuo Xu, Ge Wang, Lifeng Wang, Xiaojun Yuan, Xin Sun, Yanfeng Wang, Kun Sun, Ya Zhang, Weidi Xie
https://arxiv.org/abs/2508.15904
Decision-oriented Text Evaluation
Yu-Shiang Huang, Chuan-Ju Wang, Chung-Chi Chen
https://arxiv.org/abs/2507.01923 https://arxiv.org/p…
LLMREI: Automating Requirements Elicitation Interviews with LLMs
Alexander Korn, Samuel Gorsch, Andreas Vogelsang
https://arxiv.org/abs/2507.02564 https://…
adjnoun: Word adjacencies of David Copperfield
A network of word adjacencies of common adjectives and nouns in the novel "David Copperfield" by Charles Dickens.
This network has 112 nodes and 425 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/adjnoun
NodeShield: Runtime Enforcement of Security-Enhanced SBOMs for Node.js
Eric Cornelissen, Musard Balliu
https://arxiv.org/abs/2508.13750 https://arxiv.org/p…
DaiFu: In-Situ Crash Recovery for Deep Learning Systems
Zilong He, Pengfei Chen, Hongyu Zhang, Xiaoyun Li, Guangba Yu, Hongyang Chen, Zibin Zheng
https://arxiv.org/abs/2507.01628 …
Insights into User Interface Innovations from a Design Thinking Workshop at deRSE25
Maximilian Frank, Simon Lund
https://arxiv.org/abs/2508.18784 https://a…
Interpretable Mnemonic Generation for Kanji Learning via Expectation-Maximization
Jaewook Lee, Alexander Scarlatos, Andrew Lan
https://arxiv.org/abs/2507.05137
LLM-as-classifier: Semi-Supervised, Iterative Framework for Hierarchical Text Classification using Large Language Models
Doohee You, Andy Parisi, Zach Vander Velden, Lara Dantas Inojosa
https://arxiv.org/abs/2508.16478
PRIMMDebug: A Debugging Teaching Aid For Secondary Students
Laurie Gale, Sue Sentance
https://arxiv.org/abs/2508.18875 https://arxiv.org/pdf/2508.18875
The Digital Sous Chef -- A Comparative Study on Fine-Tuning Language Models for Recipe Generation
Shubham Pundhir, Ganesh Bagler
https://arxiv.org/abs/2508.14718 https://…
Generalizing Verifiable Instruction Following
Valentina Pyatkin, Saumya Malik, Victoria Graf, Hamish Ivison, Shengyi Huang, Pradeep Dasigi, Nathan Lambert, Hannaneh Hajishirzi
https://arxiv.org/abs/2507.02833
Revisiting Active Learning under (Human) Label Variation
Cornelia Gruber, Helen Alber, Bernd Bischl, G\"oran Kauermann, Barbara Plank, Matthias A{\ss}enmacher
https://arxiv.org/abs/2507.02593
Agentic AI for Software: thoughts from Software Engineering community
Abhik Roychoudhury
https://arxiv.org/abs/2508.17343 https://arxiv.org/pdf/2508.17343
Going over Fine Web with a Fine-Tooth Comb: Technical Report of Indexing Fine Web for Problematic Content Search and Retrieval
In\'es Altemir Marinas, Anastasiia Kucherenko, Andrei Kucharavy
https://arxiv.org/abs/2508.21788
Middo: Model-Informed Dynamic Data Optimization for Enhanced LLM Fine-Tuning via Closed-Loop Learning
Zinan Tang, Xin Gao, Qizhi Pei, Zhuoshi Pan, Mengzhang Cai, Jiang Wu, Conghui He, Lijun Wu
https://arxiv.org/abs/2508.21589
Towards Better Requirements from the Crowd: Developer Engagement with Feature Requests in Open Source Software
Pragyan K C, Rambod Ghandiparsi, Thomas Herron, John Heaps, Mitra Bokaei Hosseini
https://arxiv.org/abs/2507.13553
Re-Representation in Sentential Relation Extraction with Sequence Routing Algorithm
Ramazan Ali Bahrami, Ramin Yahyapour
https://arxiv.org/abs/2508.21049 https://
Perspectives in Play: A Multi-Perspective Approach for More Inclusive NLP Systems
Benedetta Muscato, Lucia Passaro, Gizem Gezici, Fosca Giannotti
https://arxiv.org/abs/2506.20209 …
When Life Gives You Samples: The Benefits of Scaling up Inference Compute for Multilingual LLMs
Ammar Khairi, Daniel D'souza, Ye Shen, Julia Kreutzer, Sara Hooker
https://arxiv.org/abs/2506.20544
Evolutionary Feature-wise Thresholding for Binary Representation of NLP Embeddings
Soumen Sinha, Shahryar Rahnamayan, Azam Asilian Bidgoli
https://arxiv.org/abs/2507.17025
NVIDIA Nemotron Nano 2: An Accurate and Efficient Hybrid Mamba-Transformer Reasoning Model
NVIDIA, :, Aarti Basant, Abhijit Khairnar, Abhijit Paithankar, Abhinav Khattar, Adi Renduchintala, Adithya Renduchintala, Aditya Malte, Akhiad Bercovich, Akshay Hazare, Alejandra Rico, Aleksander Ficek, Alex Kondratenko, Alex Shaposhnikov, Ali Taghibakhshi, Amelia Barton, Ameya Sunil Mahabaleshwarkar, Amy Shen, Andrew Tao, Ann Guan, Anna Shors, Anubhav Mandarwal, Arham Mehta, Arun Venkatesan, As…
Prediction is not Explanation: Revisiting the Explanatory Capacity of Mapping Embeddings
Hanna Herasimchyk, Alhassan Abdelhalim, S\"oren Laue, Michaela Regneri
https://arxiv.org/abs/2508.13729