"The overall life expectancy of a programming language has dwindled in the past 56 years. A COBOL developer in the 1960s most probably retired in the 2000s, still writing COBOL. As a former professional VBScript, then C#, then Objective-C, later Swift, and finally Go developer, I can only see this trend accelerating. We should expect our favorite programming language to be replaced and removed from the market in a relatively shorter time every decade."
Proceedings of the 19th International Workshop on Logical and Semantic Frameworks, with Applications
Cynthia Kop (Radboud Universiteit Nijmegen), Helida Salles Santos (Universidade Federal do Rio Grande)
https://arxiv.org/abs/2506.05219
Tone recognition in low-resource languages of North-East India: peeling the layers of SSL-based speech models
Parismita Gogoi, Sishir Kalita, Wendy Lalhminghlui, Viyazonuo Terhiija, Moakala Tzudir, Priyankoo Sarmah, S. R. M. Prasanna
https://arxiv.org/abs/2506.03606
from my link log —
SKIM: The implementation of functional languages using custom hardware.
https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-81.html
saved 2025-03-22
Great news. Canada is doing offering government services (commercial driver’s license tests) in Ojibwe/Anishinaabemowin.
It’s a crime and a tragedy that indigenous languages are in danger of dying and everything that can be done to fight that is for the better, especially by governments using them.
"Professor Cain uses various programming languages to explain these concepts: he starts with C (roughly from the second lecture to the 8th), then Assembly (lectures 9 to 11), C (lectures 12 to 18), Scheme (lectures 19 to 23), and Python (from 24 to 26). The last lecture (27) dives into some other functional programming languages like ML, Miranda, and even Haskell, as well as some advanced type design concepts, to round up your general programming knowledge."
A look at India's push to compete in the global AI race, as the country's vast linguistic diversity poses a core challenge to building foundational AI models (Shadma Shaikh/MIT Technology Review)
https://www.technologyreview.com/2025/07/0
from my link log —
Vectorized interpreters: mass rapid transit for programming languages.
http://venge.net/graydon/talks/VectorizedInterpretersTalk-2023-05-12.pdf
saved 2025-06-05
[2025-06-06 Fri (UTC), 2 new articles found for cs.FL Formal Languages and Automata Theory]
#toXiv_bot_toot
Facts are Harder Than Opinions -- A Multilingual, Comparative Analysis of LLM-Based Fact-Checking Reliability
Lorraine Saju, Arnim Bleier, Jana Lasser, Claudia Wagner
https://arxiv.org/abs/2506.03655

Facts are Harder Than Opinions -- A Multilingual, Comparative Analysis of LLM-Based Fact-Checking Reliability
The proliferation of misinformation necessitates scalable, automated fact-checking solutions. Yet, current benchmarks often overlook multilingual and topical diversity. This paper introduces a novel, dynamically extensible data set that includes 61,514 claims in multiple languages and topics, extending existing datasets up to 2024. Through a comprehensive evaluation of five prominent Large Language Models (LLMs), including GPT-4o, GPT-3.5 Turbo, LLaMA 3.1, and Mixtral 8x7B, we identify signific…
Read it in Two Steps: Translating Extremely Low-Resource Languages with Code-Augmented Grammar Books
Chen Zhang, Jiuheng Lin, Xiao Liu, Zekai Zhang, Yansong Feng
https://arxiv.org/abs/2506.01796
hdl2v: A Code Translation Dataset for Enhanced LLM Verilog Generation
Charles Hong, Brendan Roberts, Huijae An, Alex Um, Advay Ratan, Yakun Sophia Shao
https://arxiv.org/abs/2506.04544
As I'm learning Dutch, I'm reminded that the idea that there are people who believe that the bible is to be taken literally. The idea that a several hundred year old translation of a collection of texts in multiple languages, that were themselves translated multiple times between languages, before the whole thing was translated to Latin, then being translated to English, could somehow perfectly reflect the original text... Yeah, it's only possible to believe that if you have no idea how languages work and have never learned another language.
Like, just from linguistic drift alone if the bible were written in King James English you're losing *so* much context. But Hebrew, Aramaic, and Greek translated to Latin, then to English, then to English again?
There are so many things that erg can't be translated, even as a beginner. Dutch and English are two of the closest languages that exist, they're both Germanic languages and they're the closest to each other (other than Friesian). You can't really be much closer, and yet, there are so many things you can't mutually represent. Hebrew and Latin, Aramaic and Latin, Latin and English, Greek and English, these aren't even the same families at all... They're extremely distant. There's absolutely no way to represent concepts from one to another without another book's worth of explanation.
And that ignores all the cultural context, which is mostly lost and a library and decade of education to get the stuff that we *do* know.
Only monolingual Americans could come up with an idea so incredibly asinine.
[2025-06-05 Thu (UTC), no new articles found for cs.PL Programming Languages]
#toXiv_bot_toot
MERIT: Multilingual Semantic Retrieval with Interleaved Multi-Condition Query
Wei Chow, Yuan Gao, Linfeng Li, Xian Wang, Qi Xu, Hang Song, Lingdong Kong, Ran Zhou, Yi Zeng, Yidong Cai, Botian Jiang, Shilin Xu, Jiajun Zhang, Minghui Qiu, Xiangtai Li, Tianshu Yang, Siliang Tang, Juncheng Li
https://arxiv.org/abs/2506.03144
Hubert-Félix Thiéfaine and Nick Cave are two impersonators of the same djinn singing in two different languages at once
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
[2025-06-05 Thu (UTC), no new articles found for cs.FL Formal Languages and Automata Theory]
#toXiv_bot_toot
[2025-06-06 Fri (UTC), no new articles found for cs.PL Programming Languages]
#toXiv_bot_toot
Excellent keynote in the #SemDH2025 workshop by Laura Hollink on Cultural Bias in Linked Open Data. Laura is addressing all bias related aspects in cultural heritage items itself, in the data representing it, the data schemata, vocabularies, and ontologies on which the data are based, as well as in the knowledge representation languages used to create the schemata.
How Morgan Stanley is using its DevGen.AI tool, built in-house on OpenAI's GPT models, to translate legacy code into modern coding languages (Isabelle Bousquette/Wall Street Journal)
https://www.wsj.com/article…
The Trump administration has accomplished something that Hitler, Stalin, Mao, and other dictators desired.
-- It destroyed the Voice of America.
Until mid-March, VOA had been on the air continuously for 83 years.
Starting in 1942 with shortwave broadcasts in German to counter Nazi propaganda,
America’s external voice had expanded to nearly 50 languages,
with a weekly combined audience of more than 350 million people worldwide, watching on TV, listening on radi…
Language learning has been part of me since high school. I'm solid in 2 non-English languages, crappy but survivable in 2 others. I've played with & started learning others many times.
I'm real busy rn, but language learning could be a fun thing to do for myself & make me feel like I'm still me.
But I'm stumped about my language picks. I learnt the obvious European languages in school; later tried key Asian languages. What do I want to do now?
African languages? I won't be getting a chance to use them much in Aus, & I'm unlikely to get to a stage where I can read literature.
I tried Slovenian/Slovene on a whim & really love it, but I'll never go there. Is the practical but unfun answer grind out more kanji/hanzi? Or is whimsically learning a language spoken by only 2.5 million people reasonable? I will continue struggling through with Ukrainian, 'cause I think it's important.
#LanguageLearning
Across Programming Language Silos: A Study on Cross-Lingual Retrieval-augmented Code Generation
Qiming Zhu, Jialun Cao, Xuanang Chen, Yaojie Lu, Hongyu Lin, Xianpei Han, Le Sun, Shing-Chi Cheung
https://arxiv.org/abs/2506.03535
Replaced article(s) found for cs.FL. https://arxiv.org/list/cs.FL/new
[1/1]:
- Dynamic Membership for Regular Tree Languages
Antoine Amarilli, Corentin Barloy, Louis Jachiet, Charles Paperman
Dhvani: A Weakly-supervised Phonemic Error Detection and Personalized Feedback System for Hindi
Arnav Rustagi, Satvik Bajpai, Nimrat Kaur, Siddharth Siddharth
https://arxiv.org/abs/2506.02166
"It's this brutal fragility of vector stacks — which are used by most modern computer languages — which makes software people so wary of fully exploiting the beauty and power of recursion, and I really think that's a shame"
#Lisp
Breaking the Barriers of Text-Hungry and Audio-Deficient AI
Hamidou Tembine, Issa Bamia, Massa NDong, Bakary Coulibaly, Oumar Issiaka Traore, Moussa Traore, Moussa Sanogo, Mamadou Eric Sangare, Salif Kante, Daryl Noupa Yongueng, Hafiz Tiomoko Ali, Malik Tiomoko, Frejus Laleye, Boualem Djehiche, Wesmanegda Elisee Dipama, Idris Baba Saje, Hammid Mohammed Ibrahim, Moumini Sanogo, Marie Coursel Nininahazwe, Abdul-Latif Siita, Haine Mhlongo, Teddy Nelvy Dieu Merci Kouka, Mariam Serine Jerid…
Decision algorithms for fragments of real analysis. III: A theory of differentiable functions with (semi-)open intervals
G. Buriola, D. Cantone, G. Cincotti, E. G. Omodeo, G. T. Spart\`a
https://arxiv.org/abs/2507.02742
Eka-Eval : A Comprehensive Evaluation Framework for Large Language Models in Indian Languages
Samridhi Raj Sinha, Rajvee Sheth, Abhishek Upperwal, Mayank Singh
https://arxiv.org/abs/2507.01853
Replaced article(s) found for cs.PL. https://arxiv.org/list/cs.PL/new
[1/1]:
- A Lightweight Method for Generating Multi-Tier JIT Compilation Virtual Machine in a Meta-Tracing ...
Yusuke Izawa, Hidehiko Masuhara, Carl Friedrich Bolz-Tereick
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
Replaced article(s) found for cs.FL. https://arxiv.org/list/cs.FL/new
[1/1]:
- Universality Frontier for Asynchronous Cellular Automata
Ivan Baburin, Matthew Cook, Florian Gr\"otschla, Andreas Plesner, Roger Wattenhofer
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[2/3]:
- Traveling Across Languages: Benchmarking Cross-Lingual Consistency in Multimodal LLMs
Hao Wang, Pinzhi Huang, Jihan Yang, Saining Xie, Daisuke Kawahara
[2025-06-04 Wed (UTC), 2 new articles found for cs.PL Programming Languages]
#toXiv_bot_toot
[2025-07-04 Fri (UTC), 1 new article found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
from my link log —
HAFLANG: hardware acceleration of functional languages.
https://haflang.github.io/
saved 2025-03-22 https://dotat.at…
San Diego-based Clearspeed, which offers AI-driven voice-based risk assessment tech for 60 languages, raised a $60M Series D, taking its total funding to $110M (Duncan Riley/SiliconANGLE)
https://siliconangle.com/2025/06/26/cl
[2025-06-04 Wed (UTC), no new articles found for cs.FL Formal Languages and Automata Theory]
#toXiv_bot_toot
[2025-06-03 Tue (UTC), 2 new articles found for cs.PL Programming Languages]
#toXiv_bot_toot
Programmable Co-Transcriptional Splicing: Realizing Regular Languages via Hairpin Deletion
Da-Jung Cho, Szil\'ard Zsolt Fazekas, Shinnosuke Seki, Max Wiedenh\"oft
https://arxiv.org/abs/2506.23384
Flow2Code: Evaluating Large Language Models for Flowchart-based Code Generation Capability
Mengliang He, Jiayi Zeng, Yankai Jiang, Wei Zhang, Zeming Liu, Xiaoming Shi, Aimin Zhou
https://arxiv.org/abs/2506.02073
Adaptability of ASR Models on Low-Resource Language: A Comparative Study of Whisper and Wav2Vec-BERT on Bangla
Md Sazzadul Islam Ridoy, Sumi Akter, Md. Aminur Rahman
https://arxiv.org/abs/2507.01931
[2025-07-03 Thu (UTC), 1 new article found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
Replaced article(s) found for cs.PL. https://arxiv.org/list/cs.PL/new
[1/1]:
- The Cyan Language
Jos\'e de Oliveira Guimar\~aes
https://
[2025-06-03 Tue (UTC), 1 new article found for cs.FL Formal Languages and Automata Theory]
#toXiv_bot_toot
[2025-06-02 Mon (UTC), 1 new article found for cs.PL Programming Languages]
#toXiv_bot_toot
Replaced article(s) found for cs.FL. https://arxiv.org/list/cs.FL/new
[1/1]:
- Computing Threshold Budgets in Discrete-Bidding Games
Guy Avni, Suman Sadhukhan
MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining
Zhixun Chen, Ping Guo, Wenhan Han, Yifan Zhang, Binbin Liu, Haobin Lin, Fengze Liu, Yan Zhao, Bingni Zhang, Taifeng Wang, Yin Zheng, Meng Fang
https://arxiv.org/abs/2507.01785
Efficient Multilingual ASR Finetuning via LoRA Language Experts
Jiahong Li, Yiwen Shao, Jianheng Zhuo, Chenda Li, Liliang Tang, Dong Yu, Yanmin Qian
https://arxiv.org/abs/2506.21555
Replaced article(s) found for cs.FL. https://arxiv.org/list/cs.FL/new
[1/1]:
- Bridging Chaos Game Representations and $k$-mer Frequencies of DNA Sequences
Haoze He, Lila Kari, Pablo Millan Arias
Transferable Modeling Strategies for Low-Resource LLM Tasks: A Prompt and Alignment-Based
Shuangquan Lyu, Yingnan Deng, Guiran Liu, Zhen Qi, Ruotong Wang
https://arxiv.org/abs/2507.00601
[2025-07-02 Wed (UTC), 1 new article found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
Contrasting Cognitive Styles in Vision-Language Models: Holistic Attention in Japanese Versus Analytical Focus in English
Ahmed Sabir, Azinovi\v{c} Gasper, Mengsay Loem, Rajesh Sharma
https://arxiv.org/abs/2507.00700
[2025-06-02 Mon (UTC), no new articles found for cs.FL Formal Languages and Automata Theory]
#toXiv_bot_toot