
2025-10-19 01:43:01
U 00A0: NON-BINARY SPACE
#Unicode
U 00A0: NON-BINARY SPACE
#Unicode
Breaking the Baryon Density$\unicode{x2013}$Hubble Constant Degeneracy in Fast Radio Burst Applications with Associated Gravitational Waves
Joscha N. Jahns-Schindler, Laura G. Spitler
https://arxiv.org/abs/2508.14434
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
Landmark-Assisted Monte Carlo Planning
David H. Chan, Mark Roberts, Dana S. Nau
https://arxiv.org/abs/2508.11493 https://arxiv.org/pdf/2508.11493
Do you want to stop putting dots or some random stuff in forms to skip required fields? The website is too smart and rejects all #Unicode spaces?
You can always use a U 200E Left-To-Right Mark.
Why Data Anonymization Has Not Taken Off
Matthew J. Schneider, James Bailie, Dawn Iacobucci
https://arxiv.org/abs/2509.10165 https://arxiv.org/pdf/2509.101…
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
Interesting… 🤔 "ICU4X - Solving i18n for client-side and resource-constrained environments" https://icu4x.unicode.org/
> Why ICU4X?
> Small and fast
> ICU4X floats like a butterfly and stings like a bee
😅🦋🐝
New probes of nuclear gluon dynamics through photoproduction of charm in inelastic ultra-peripheral Pb$\unicode{x2014}$Pb collisions with ALICE
Sigurd Nese (for the ALICE Collaboration)
https://arxiv.org/abs/2509.11814
From virtual Z gates to virtual Z pulses
Christopher K. Long, Crispin H. W. Barnes
https://arxiv.org/abs/2509.13453 https://arxiv.org/pdf/2509.13453…
This list of Unicode character name errata feels like back-matter for some SF novel
https://www.unicode.org/notes/tn27/
Via @…
the unicode standard was a little slow to catch on to the dangers of overlong UTF-8
https://lobste.rs/s/mlbsfi/utf_8_is_brilliant_design#c_eg9xzn
Three small announcements:
1. RFC 9839, a guide to which Unicode characters you should never use: https://www.rfc-editor.org/rfc/rfc9839.html
2. Blog piece with background and context, “RFC 9839 and Bad Unicode”:
All that structure matches does not glitter
Maya M. Martirossyan, Thomas Egg, Philipp Hoellmer, George Karypis, Mark Transtrum, Adrian Roitberg, Mingjie Liu, Richard G. Hennig, Ellad B. Tadmor, Stefano Martiniani
https://arxiv.org/abs/2509.12178
A few days back I tweeted and blogged about the brand-new RFC 9839, and also published the first draft of tiny Go library to help enforce the subsets defined in the RFC. Got lots of useful input on the library and have progressed it enough to do a v0.8.0 release: https://github.com/timbray/rfc9839…
Honestly, #emoji and icons in #Unicode are a true horror.
Yeah, sure. It's great that you don't have to use <img/> anymore and you can just paste a random Unicode character. You can get graphics into fields where only text was originally intended (like bug summaries). Even better, you can now easily get cool colorful icons on terminal with almost no effort.
However, it is an #accessibility nightmare. People are now encoding *information* in random graphical symbols. Symbols that require huge fonts to render, or huge character tables to describe.
Yeah, a bare <img/> carrying information sucks. However, you can add a *meaningful* alt-text to the image, and accessibility tools can use that text to provide meaningful context. Like "bug fix".
However, emojis and icons are symbolic. The best you can get is some description like "hammer and wrench", so people can kinda figure out that it's probably a "bug fix". Or maybe it was a "maintenance task"? Or you'll get a "unknown character 0x1F6E0". And I'm sure people will surely enjoy cross-referencing a "legend" of such "unknown characters".
from my link log —
libu8ident: Unicode security guidelines for programming language identifiers.
https://github.com/rurban/libu8ident
saved 2025-02-13
Unicode folks: why are there emoji for "spoon" and combined "fork knife" but no standalone fork and knife?
Non-programmers Assessing AI-Generated Code: A Case Study of Business Users Analyzing Data
Yuvraj Virk, Dongyu Liu
https://arxiv.org/abs/2508.06484 https://
Computable Bounds for Strong Approximations with Applications
Haoyu Ye, Morgane Austern
https://arxiv.org/abs/2508.03833 https://arxiv.org/pdf/2508.03833…
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
Taylor$\unicode{x2013}$Aris dispersion of active particles in oscillatory channel flow
Bohan Wang, Weiquan Jiang, Li Zeng, Zi Wu, Ping Wang
https://arxiv.org/abs/2507.18241 http…
Evidence of Titanate Clouds in the Day-side Atmosphere of the Ultra-Hot Jupiter WASP-121b
Suman Saha, James S. Jenkins
https://arxiv.org/abs/2508.20022 https://
Inverse Optimal Feedback and Gain Margins for Unicycle Stabilization
Kwang Hak Kim, Velimir Todorovski, Miroslav Krsti\'c
https://arxiv.org/abs/2509.25563 https://
"Wie willst du das Herz haben? <3 oder Anders?"
"Unicode!"
"Was ist das?"
#wampleaks
The Five Safes as a Privacy Context
James Bailie, Ruobin Gong
https://arxiv.org/abs/2510.05803 https://arxiv.org/pdf/2510.05803
Imperceptible Jailbreaking against Large Language Models
Kuofeng Gao, Yiming Li, Chao Du, Xin Wang, Xingjun Ma, Shu-Tao Xia, Tianyu Pang
https://arxiv.org/abs/2510.05025 https:/…
Gamma-Convergence of Convex Functions, Conjugates, and Subdifferentials
Rafael Correa, Pedro P\'erez-Aros, Jos\'e Pablo Santander
https://arxiv.org/abs/2509.21863 https:…
The charge-singlet measurement toolbox
Abhijit Chakraborty, Randy Lewis, Christine A. Muschik
https://arxiv.org/abs/2510.08718 https://arxiv.org/pdf/2510.0…
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
And… the bundle size for JS emoji pickers will increase for every site… every year… 📈🤷♂️
https://mastodon.world/@emojipedia/115174595873838133
Thermodynamics in a split Hilbert space: Quantum impurity at the edge of a one-dimensional superconductor
Pradip Kattel, Abay Zhakenov, Natan Andrei
https://arxiv.org/abs/2508.19330
When I highlight text in #Confluence and copy it, my clipboard manager warns me that the clipping is in the double digit MBs (yes, with an M! 42 in this case.).
`osascript-e 'clipboard info’`
«class weba», 42079409, «class RTF », 650, «class HTML», 867, «class utf8», 244, «class ut16», 490, string, 244, Unicode text, 488
The heck is it copying text in a weba forma…
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
The Unicode character 🗿 (U 1F5FF) is named "Moyai" which I thought it was typo mistake for "Moai" which are the stone statues in Easter Island, Chile.
Turns "Moyai" are statues in Niijima, Japan which were inspired on the ones from Easter Island.
This makes me a little disappointed but it makes me very happy that the Japanese like our statues.
#chile
Euclidean Distance Deflation Under High-Dimensional Heteroskedastic Noise
Keyi Li, Yuval Kluger, Boris Landa
https://arxiv.org/abs/2507.18520 https://arxiv…
Optimizing sparse quantum state preparation with measurement and feedforward
Yao-Cheng Lu, Han-Hsuan Lin
https://arxiv.org/abs/2508.21346 https://arxiv.org…
Unveiling Unicode's Unseen Underpinnings in Undermining Authorship Attribution
Robert Dilworth
https://arxiv.org/abs/2508.15840 https://arxiv.org/pdf/2…
from my link log —
The modern text rendering pipeline: unicode, bidi, segmentation, shaping, …
https://www.newroadoldway.com/text1.html
saved 2025-06-24
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
Replaced article(s) found for math.MG. https://arxiv.org/list/math.MG/new
[1/1]:
- A note on Erd\H{o}s matrices and Marcus\unicode{x2013}Ree inequality
Aman Kushwaha, Raghavendra Tripathi
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
The Double-edged Sword of LLM-based Data Reconstruction: Understanding and Mitigating Contextual Vulnerability in Word-level Differential Privacy Text Sanitization
Stephen Meisenbacher, Alexandra Klymenko, Andreea-Elena Bodea, Florian Matthes
https://arxiv.org/abs/2508.18976
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted
unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted