
2025-06-18 08:16:09
Measure-Theoretic Aspects of Star-Free and Group Languages
Ryoma Sin'ya, Takao Yuyama
https://arxiv.org/abs/2506.14134 https://ar…
Measure-Theoretic Aspects of Star-Free and Group Languages
Ryoma Sin'ya, Takao Yuyama
https://arxiv.org/abs/2506.14134 https://ar…
word_adjacency: Word Adjacency Networks
Directed Networks of word adjacency in texts of several languages including English, French, Spanish and Japanese.
This network has 7381 nodes and 46281 edges.
Tags: Informational, Language, Unweighted
https://networks.skewed.de/net/word_ad
Optimized Execution of FreeCHR
Sascha Rechenberger, Thom Fr\"uhwirth
https://arxiv.org/abs/2506.14485 https://arxiv.org/pdf/2506…
The Teacher's Dilemma: Balancing Trade-Offs in Programming Education for Emergent Bilingual Students
Emma R. Dodoo, Tamara Nelson-Fromm, Mark Guzdial
https://arxiv.org/abs/2506.14147
Evolving music theory for emerging musical languages
Emmanuel Deruty
https://arxiv.org/abs/2506.14504 https://arxiv.org/pdf/2506.1450…
A Rigorous Evaluation of LLM Data Generation Strategies for Low-Resource Languages
Tatiana Ankinina, Jan Cegin, Jakub Simko, Simon Ostermann
https://arxiv.org/abs/2506.12158
Forty years ago I moved to Scotland and tried to read the classic “A Scots Quair” by Lewis Grassic Gibbon. It’s written in a variant of Scots that aimed to be accessible to English readers. I had the Scots dictionary by my side and was looking up words constantly until I gave up. A few years later (with some more experience of hearing Scots spoken) I read it without the dictionary and, while I was occasionally guessing at words from context, I went with the flow and loved it.
Like mos…
We have translators who work on commission for the journal I edit (it's a bilingual publication - articles are in either English or French with abstracts and author bios for every article in both languages). The translators tend to be pretty good, most often only having trouble with technical terms (which is understandable - this happens when you're not a specialist). Still, on occasion, they manage to produce something spectacularly bad. No, the French for "He is" is not &…
I've been testing "vibe coding" lately. Takeaways:
* It works
* Gemini is my favorite
* They all make mistakes, but if you tell them what broke they will try to fix the issue
* Important: I already know how to program in multiple languages but not to this level
https://github.com/DennisF…
Group then Scale: Dynamic Mixture-of-Experts Multilingual Language Model
Chong Li, Yingzhuo Deng, Jiajun Zhang, Chengqing Zong
https://arxiv.org/abs/2506.12388
Replaced article(s) found for cs.PL. https://arxiv.org/list/cs.PL/new
[1/1]:
A Performance Model for Warp Specialization Kernels
https://
Atys: An Efficient Profiling Framework for Identifying Hotspot Functions in Large-scale Cloud Microservices
Jiaqi Sun, Dingyu Yang, Shiyou Qian, Jian Cao, Guangtao Xue
https://arxiv.org/abs/2506.15523
from my link log —
Implementing dependent types in pi-forall.
https://arxiv.org/abs/2207.02129
saved 2025-06-06 https://dotat.at/…
Replaced article(s) found for cs.PL. https://arxiv.org/list/cs.PL/new
[1/1]:
- A refined operational semantics for FreeCHR
Sascha Rechenberger, Thom Fr\"uhwirth
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
Empirical Evaluation of Large Language Models in Automated Program Repair
Jiajun Sun, Fengjie Li, Xinzhu Qi, Hongyu Zhang, Jiajun Jiang
https://arxiv.org/abs/2506.13186
Towards Safety and Security Testing of Cyberphysical Power Systems by Shape Validation
Alexander Geiger, Immanuel Hacker, \"Omer Sen, Andreas Ulbig
https://arxiv.org/abs/2506.12466
[2025-06-19 Thu (UTC), 1 new article found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
Harvard releases Institutional Books 1.0, a dataset for AI researchers with 242B tokens, from 394M scanned pages and 983K public domain books in 254 languages (Matt O'Brien/Associated Press)
https://apnews.com/article/ai-chatbot-
Identifying and Investigating Global News Coverage of Critical Events Such as Disasters and Terrorist Attacks
Erica Cai, Xi Chen, Reagan Grey Keeney, Ethan Zuckerman, Brendan O'Connor, Przemyslaw A. Grabowicz
https://arxiv.org/abs/2506.12925
I have time to experiment with different programming languages and while I'm a big fan of functional or functional style programming, my recent obsession is with #Go
It is a tremendously simple language, without surprises or elaborate mechanisms, procedural and totally boring.. and I love it.
Most satisfying thing is life reload with air and it's usually already compiled and…
{tesseract} allows you to read text from images https://docs.ropensci.org/tesseract/ it can also be combined with {magick} https://
Nominal Equational Rewriting and Narrowing
Mauricio Ayala-Rinc\'on (University of Bras\'ilia, Brazil), Maribel Fern\'andez (King's College London, UK), Daniele Nantes-Sobrinho (University of Bras\'ilia, Brazil,Imperial College London, UK), Daniella Santaguida (University of Bras\'ilia, Brazil)
https://arx…
[2025-06-18 Wed (UTC), 2 new articles found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
[2025-06-18 Wed (UTC), 1 new article found for cs.PL Programming Languages]
toXiv_bot_toot
GLAP: General contrastive audio-text pretraining across domains and languages
Heinrich Dinkel, Zhiyong Yan, Tianzi Wang, Yongqing Wang, Xingwei Sun, Yadong Niu, Jizhong Liu, Gang Li, Junbo Zhang, Jian Luan
https://arxiv.org/abs/2506.11350
[2025-06-19 Thu (UTC), 2 new articles found for cs.PL Programming Languages]
toXiv_bot_toot
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
Can we "seamlessly" divide a polygon?
Byungchang So
https://arxiv.org/abs/2506.11742 https://arxiv.org/pdf/2506.11742
Magnetoencephalography (MEG) Based Non-Invasive Chinese Speech Decoding
Zhihong Jia, Hongbin Wang, Yuanzhong Shen, Feng Hu, Jiayu An, Kai Shu, Dongrui Wu
https://arxiv.org/abs/2506.12817
Looks like github copilot PR review now supports all the languages in public preview. This will be useful for me as I commit #fsharp code a lot. In fact I had a PR today that it reviewed, found a few decent suggestions actually.
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists
Lynn Khellaf, Ipek Baris Schlicht, Tilman Mirass, Julia Bayer, Tilman Wagner, Ruben Bouwmeester
https://arxiv.org/abs/2506.13188
https://youtube.com/watch?v=wfpjNdhpMzg
You know when a foreigner teaches you more about your own country than you know? This guy is such a good subject matter expert at languages, that by accident he knows things about Australian languages and culture that I guarantee you won…
Language Surgery in Multilingual Large Language Models
Joanito Agili Lopo, Muhammad Ravi Shulthan Habibi, Tack Hwa Wong, Muhammad Ilham Ghozali, Fajri Koto, Genta Indra Winata, Peerat Limkonchotiwat, Alham Fikri Aji, Samuel Cahyawijaya
https://arxiv.org/abs/2506.12450
Understanding nature's choice of genetic languages
Apoorva D. Patel
https://arxiv.org/abs/2505.06718 https://arxiv.org/pdf/2505.0…
ProtocolLLM: RTL Benchmark for SystemVerilog Generation of Communication Protocols
Arnav Sheth, Ivaxi Sheth, Mario Fritz
https://arxiv.org/abs/2506.07945 h…
Multimodal Zero-Shot Framework for Deepfake Hate Speech Detection in Low-Resource Languages
Rishabh Ranjan, Likhith Ayinala, Mayank Vatsa, Richa Singh
https://arxiv.org/abs/2506.08372
“The UK must spend more on defence or be ready to speak Russian”
I think Mark Rutte is grossly overestimating the British ability to learn foreign languages 😂😂
https://www.standard.co.uk/news/politics/nato-chief-m…
A former employee says fewer than 10,000 people use Ola Krutrim's LLM chatbot, which supports 10 Indian languages, and that over 60% of them are random testers (Swathi Moorthy/The Economic Times)
https://
Replaced article(s) found for cs.FL. https://arxiv.org/list/cs.FL/new
[1/1]:
Regular Grammars for Sets of Graphs of Tree-Width 2
https://
I think strong and weak typing in programming languages is actually a spectrum rather than a binary classification.
See terraform for example:
> All values have a type, which dictates where that value can be used and what transformations can be applied to it.
https://developer.hashicorp…
Training-free LLM Merging for Multi-task Learning
Zichuan Fu, Xian Wu, Yejing Wang, Wanyu Wang, Shanshan Ye, Hongzhi Yin, Yi Chang, Yefeng Zheng, Xiangyu Zhao
https://arxiv.org/abs/2506.12379
(1/4) “When the day of Pentecost had come, they were all together in 1 place. And suddenly from heaven there came a sound like the rush of a violent wind, & it filled the entire house where they were sitting. Divided tongues, as of fire, appeared among them, & a tongue rested on each of them. All of them were filled with the Holy Spirit & began to speak in other languages, as the Spirit gave them ability. Now there were devout Jews from every nation under heaven living in Jerusal…
YouTube rolls out a tool to let some creators upload different thumbnails for each video dubbed into a different language, to help expand their global audience (Dan Whateley/Business Insider)
https://www.businessinsider.com/youtube-te
TIL: »Specific typographic rules have been developed for each language for centuries, but in recent decades, especially due to globalisation and the unification of software tools, they have been disregarded. The international project of a typographic proofreader for European languages will preserve these rules as an expression of European cultural diversity for future generations.«
Source:
Replaced article(s) found for cs.PL. https://arxiv.org/list/cs.PL/new
[1/1]:
Opportunistically Parallel Lambda Calculus
https://
Proceedings of the 19th International Workshop on Logical and Semantic Frameworks, with Applications
Cynthia Kop (Radboud Universiteit Nijmegen), Helida Salles Santos (Universidade Federal do Rio Grande)
https://arxiv.org/abs/2506.05219
"We cannot preclude developers from “vibe coding” their way into a working application; but we can teach them how to properly integrate the very likely spaghetti mess produced by those bullshit machines, how to understand it, and how to make it work with today’s compilers, which, let us be honest: are the best we have ever had, and it would be a shame to ignore them completely."
wikipedia_link: Wikipedia links (2016)
Networks of hyperlinks among articles on Wikipedia, for all available languages. A directed edge (i,j) indicates that article i hyperlinks to j.
This network has 8758 nodes and 335267 edges.
Tags: Informational, Web graph, Unweighted
https://networks.skewed.de/net…
Hubert-Félix Thiéfaine and Nick Cave are two impersonators of the same djinn singing in two different languages at once
from my link log —
Alan Kay did not invent object-oriented programming.
https://www.hillelwayne.com/post/alan-kay/
saved 2025-05-11 https://
Large Language Models for Toxic Language Detection in Low-Resource Balkan Languages
Amel Muminovic, Amela Kadric Muminovic
https://arxiv.org/abs/2506.09992
Textual-Based vs. Thinging Machines Conceptual Modeling
Sabah Al-Fedaghi
https://arxiv.org/abs/2506.02646 https://arxiv.org/pdf/2506.…
[2025-06-17 Tue (UTC), 1 new article found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
[2025-06-17 Tue (UTC), 3 new articles found for cs.PL Programming Languages]
toXiv_bot_toot
Tone recognition in low-resource languages of North-East India: peeling the layers of SSL-based speech models
Parismita Gogoi, Sishir Kalita, Wendy Lalhminghlui, Viyazonuo Terhiija, Moakala Tzudir, Priyankoo Sarmah, S. R. M. Prasanna
https://arxiv.org/abs/2506.03606
Apple announces a new live translation feature across Messages, FaceTime, and Phone apps, but has not yet said how many languages will be supported (Rebecca Bellan/TechCrunch)
https://techcrunch.com/2025/06/09/appl…
Saturation Problems for Families of Automata
Le\'on Bohn, Yong Li, Christof L\"oding, Sven Schewe
https://arxiv.org/abs/2506.13197 https://…
from my link log —
EBCDIC is incompatible with GDPR.
https://shkspr.mobi/blog/2021/10/ebcdic-is-incompatible-with-gdpr/
saved 2024-10-28
Notes on applicative matching logic
Laurentiu Leustean
https://arxiv.org/abs/2506.10088 https://arxiv.org/pdf/2506.10088
Replaced article(s) found for cs.PL. https://arxiv.org/list/cs.PL/new
[1/1]:
QPanda3: A High-Performance Software-Hardware Collaborative Framework for Large-Scale Quantum-Cla...
Positive Varieties of Lattice Languages
Yusuke Inoue, Yuji Komatsu
https://arxiv.org/abs/2506.05824 https://arxiv.org/pdf/2506.05824
StacKAT: Infinite State Network Verification
Jules Jacobs, Nate Foster, Tobias Kapp\'e, Dexter Kozen, Lily Saada, Alexandra Silva, Jana Wagemaker
https://arxiv.org/abs/2506.13383
wikipedia_link: Wikipedia links (2016)
Networks of hyperlinks among articles on Wikipedia, for all available languages. A directed edge (i,j) indicates that article i hyperlinks to j.
This network has 83330 nodes and 2095962 edges.
Tags: Informational, Web graph, Unweighted
https://networks.skewed.de/n…
Replaced article(s) found for cs.FL. https://arxiv.org/list/cs.FL/new
[1/1]:
A complete formalization of Fermat's Last Theorem for regular primes in Lean
S2ST-Omni: An Efficient and Scalable Multilingual Speech-to-Speech Translation Framework via Seamlessly Speech-Text Alignment and Streaming Speech Decoder
Yu Pan, Yuguang Yang, Yanni Hu, Jianhao Ye, Xiang Zhang, Hongbin Zhou, Lei Ma, Jianjun Zhao
https://arxiv.org/abs/2506.11160
Read it in Two Steps: Translating Extremely Low-Resource Languages with Code-Augmented Grammar Books
Chen Zhang, Jiuheng Lin, Xiao Liu, Zekai Zhang, Yansong Feng
https://arxiv.org/abs/2506.01796
from my link log —
How to take the inverse of a type.
https://2022.ecoop.org/details/ecoop-2022-papers/6/How-to-Take-the-Inverse-of-a-Type
saved 2025-06-03
Beyond C/C : Probabilistic and LLM Methods for Next-Generation Software Reverse Engineering
Zhuo Zhuo, Xiangyu Zhang
https://arxiv.org/abs/2506.03504 http…
YouTube rolls out a tool to let some creators upload different thumbnails for each video dubbed into a different language, to help expand their global audience (Dan Whateley/Business Insider)
https://www.businessinsider.com/youtube-te
Syntactic Effectful Realizability in Higher-Order Logic
Liron Cohen (BGU), Ariel Grunfeld (BGU), Dominik Kirst (PICUBE), \'Etienne Miquey (I2M)
https://arxiv.org/abs/2506.09458
[2025-06-16 Mon (UTC), 3 new articles found for cs.PL Programming Languages]
toXiv_bot_toot
Minimality and computability of languages of G-shifts
Djamel Eddine Amir, Benjamin Hellouin de Menibus
https://arxiv.org/abs/2506.10610 https://
[2025-06-16 Mon (UTC), 1 new article found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
How Morgan Stanley is using its DevGen.AI tool, built in-house on OpenAI's GPT models, to translate legacy code into modern coding languages (Isabelle Bousquette/Wall Street Journal)
https://www.wsj.com/article…
This https://arxiv.org/abs/2503.19217 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
from my link log —
SKIM: The implementation of functional languages using custom hardware.
https://www.cl.cam.ac.uk/techreports/UCAM-CL-TR-81.html
saved 2025-03-22
The Emergence of Abstract Thought in Large Language Models Beyond Any Language
Yuxin Chen, Yiran Zhao, Yang Zhang, An Zhang, Kenji Kawaguchi, Shafiq Joty, Junnan Li, Tat-Seng Chua, Michael Qizhe Shieh, Wenxuan Zhang
https://arxiv.org/abs/2506.09890
Proceedings of the 23rd International Overture Workshop
Hugo Daniel Macedo, Ken Pierce
https://arxiv.org/abs/2506.08680 https://arxiv…
from my link log —
Wasm SpecTec has been adopted.
https://webassembly.org/news/2025-03-27-spectec/
saved 2025-03-28 https://
This https://arxiv.org/abs/2410.05460 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csPL_…
This https://arxiv.org/abs/2506.02943 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSE_…
[2025-06-13 Fri (UTC), 4 new articles found for cs.PL Programming Languages]
toXiv_bot_toot
Hazel Deriver: A Live Editor for Constructing Rule-Based Derivations
Zhiyao Zhong, Cyrus Omar
https://arxiv.org/abs/2506.10781 https://
Large Language Models for Multilingual Vulnerability Detection: How Far Are We?
Honglin Shu, Michael Fu, Junji Yu, Dong Wang, Chakkrit Tantithamthavorn, Junjie Chen, Yasutaka Kamei
https://arxiv.org/abs/2506.07503
[2025-06-13 Fri (UTC), 2 new articles found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
Choreographic Quick Changes: First-Class Location (Set) Polymorphism
Ashley Samuelson, Andrew K. Hirsch, Ethan Cecchetti
https://arxiv.org/abs/2506.10913 h…
[2025-06-12 Thu (UTC), no new articles found for cs.PL Programming Languages]
toXiv_bot_toot
[2025-06-12 Thu (UTC), no new articles found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
Using Code Snippets to Teach Programming Languages
Joshua Akingbade, Jianhua Yang, Mir Seyedebrahimi
https://arxiv.org/abs/2506.00404 https://
[2025-06-11 Wed (UTC), no new articles found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
[2025-06-11 Wed (UTC), 3 new articles found for cs.PL Programming Languages]
toXiv_bot_toot
Gradual Metaprogramming
Tianyu Chen, Darshal Shetty, Jeremy G. Siek, Chao-Hong Chen, Weixi Ma, Arnaud Venet, Rocky Liu
https://arxiv.org/abs/2506.09043 htt…
[2025-06-10 Tue (UTC), no new articles found for cs.FL Formal Languages and Automata Theory]
toXiv_bot_toot
Verification of the Release-Acquire Semantics
Parosh Abdulla, Elli Anastasiadi, Mohamed Faouzi Atig, Samuel Grahn
https://arxiv.org/abs/2506.08238 https://…
[2025-06-10 Tue (UTC), 3 new articles found for cs.PL Programming Languages]
toXiv_bot_toot
This https://arxiv.org/abs/2505.16764 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csPL_…
[2025-06-09 Mon (UTC), 3 new articles found for cs.PL Programming Languages]
#toXiv_bot_toot #toXiv_bot_new_article_summary_toot