Tootfinder

Opt-in global Mastodon full text search. Join the index!

@veit@mastodon.social
2025-12-21 14:25:20

I took a look at the changes coming with Python 3.15 – and I can’t wait to put them to productive use. I’ve already updated our tutorials:
• utf-8 as the default encoding: python-basics-tutorial.readthe

@netzschleuder@social.skewed.de
2026-02-20 22:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@timbray@cosocial.ca
2026-01-14 23:53:49

In which I build Unicode character tables and fail to serialize large automata, “Losing 1½ Million Lines of Go“:
tbray.org/ongoing/When/202x/20
(And in which I find myself sliding into the

Confession: My title is clickbait-y, this is really about building on the Unicode Character Database to support character-property regexp features in Quamina. Just halfway there, I’d already got to 775K lines of generated code so I abandoned that particular approach. Thus, this is about (among other things) avoiding those 1½M lines. And really only of interest to people whose pedantry includes some combination of Unicode, Go programming, and automaton wrangling. Oh, and GenAI, which (*gasp*) I …
@netzschleuder@social.skewed.de
2026-01-22 11:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@fanf@mendeddrum.org
2026-02-16 09:42:02

from my link log —
Unicode TOFU: detect changes to strings over time that might be spoofing.
github.com/begriffs/utofu
saved 2019-03-17

@keithp@fosstodon.org
2025-12-20 04:22:46

I received a bug report about picolibc's iconv implementation. So I added a test case. Which pushed the CI runtime up over *six hours* at which point github gave up and killed it.
I spent a bit of time yesterday and today re-implementing the JIS to Unicode conversion function to make it faster. Lines changed: 5125 additions & 7495 deletions. Runtime is back to normal now. And I have a deeper understanding of these encodings.

@netzschleuder@social.skewed.de
2025-12-20 20:00:03

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@whitequark@mastodon.social
2025-12-12 01:31:51

Go / Unicode folks: any idea what character set does the IDNA2008 implementation in Go with StrictDomainName=false actually accepts?
github.com/golang/go/issues/76
I think this is actually somewhat security-relevant

@frankstohl@mastodon.social
2026-01-09 07:42:18

Wir die Gurke die Aubergine ersetzen? #emojie mobiflip.de/diese-emojis-solle

@GroupNebula563@mastodon.social
2025-11-24 16:09:46

@… #AskFedi I need a regex (for Discord’s AutoMod) that prevents any text in all Unicode “fonts” from being sent. By “Unicode fonts” I mean things like these:

@eana@s.1a23.studio
2025-11-25 19:30:00

New toy: Font coverage analyzer (alpha)
blueset.github.io/font-coverage/

Screenshot of Font Coverage Analyzer, on font CLFN 24x CN Regular, showing coverage of various Unicode Blocks
A list of coverage checks:

Unicode Blocks























Unicode Script Extensions














China Encodings



China Foundries



China Standards










Japan Encodings




Japan JIS




Japan Kyoiku Kanji






Japan Kanken












Japan Joyo Kanji


Taiwan Encodings






Taiwan Standards




Hong Kong









Korean Encodings


Adobe Latin





Adobe Greek


Adobe Cyrillic



Adobe CMAP





GF Latin







GF Greek



GF Cyrillic




GF Arabic


GF Phonetics



GF Tran…
@fell@ma.fellr.net
2025-11-26 11:15:55

RE: #Unicode is a wonderful thing.

@netzschleuder@social.skewed.de
2026-01-17 18:00:05

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@detondev@social.linux.pizza
2025-12-04 12:20:15

𓁿 𓉩 𓊲 𓀬 𓇽 𓅽 𓎩 𓇇
when i was 12 i fantasized abt making a roguelike using hieroglyph unicode instead of ascii, thinking itd lean into some sorta farming lifesim thing, or go all in on depicting a journey thru death as close to the mythology as wasnt boring, or put the two together for a transition after u die. i havent been super tapped into the roguelike scene like i used to be, does anything remotely like this exist?
𓈋𓈋𓈋𓈏𓈋𓈋
𓈖𓈄𓈏𓇾𓉐𓈋
𓈖𓊛𓀛𓇿𓇿𓇿
𓈖𓈘𓈘𓈘𓈘𓈘
𓈄𓇇𓇿𓈋𓈋𓈏

@fanf@mendeddrum.org
2026-01-08 09:42:04

from my link log —
Why the trans flag emoji uses a sequence of 5 unicode codepoints.
hecate.pink/blog/2026/trans-fl
saved 2026-01-07

@netzschleuder@social.skewed.de
2025-12-14 22:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@imaginaryrobots@social.linux.pizza
2025-12-05 02:04:34

I'm writing a #befunge interpreter in #rust (for fun) , and I've decided to make it unicode compliant. That means that emojis will be fair game in the funge source code, and I think those programs are going to be amazing to look at!

@netzschleuder@social.skewed.de
2026-02-14 15:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@fanf@mendeddrum.org
2025-12-01 21:42:01

from my link log —
Locales, encodings, Unicode, and C .
open-std.org/jtc1/sc22/wg21/do
saved 2022-04-18

@netzschleuder@social.skewed.de
2025-12-11 02:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@netzschleuder@social.skewed.de
2025-12-06 09:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@netzschleuder@social.skewed.de
2026-01-04 09:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@netzschleuder@social.skewed.de
2026-01-30 11:00:05

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@netzschleuder@social.skewed.de
2025-12-28 11:00:04

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang
@netzschleuder@social.skewed.de
2026-01-24 00:00:05

unicodelang: Languages spoken by country (2015)
A bipartite network of languages and the countries in which they are spoken, as estimated by Unicode. Edges are weighted by the proportion of the given country's population that is literate in a particular language.
This network has 868 nodes and 1255 edges.
Tags: Informational, Relatedness, Weighted

unicodelang: Languages spoken by country (2015). 868 nodes, 1255 edges. https://networks.skewed.de/net/unicodelang