Tootfinder

Opt-in global Mastodon full text search. Join the index!

@mgorny@social.treehouse.systems
2026-03-29 02:24:49

Finally a package where I can't suspect the author of vibecoding!
"""
Version: 2026.3.32
Fixed segfault.
Version: 2026.3.31
Fixed bug again.
Version: 2026.3.30
Fixed bug.
"""
github.com/mrabarnett/mrab-reg
(It's also a horribly bad package, with tons of unmaintainable code, heavily relying on CPython internals.)

@kubikpixel@chaos.social
2026-04-27 06:20:13

Vidar Malware Conceals Payloads in JPEG, TXT Files to Evade Detection
Vidar has evolved from a basic Arkei-based credential stealer into a multi-stage, stealth-focused infostealer that now hides second‑stage payloads within JPEG and TXT files to evade modern defenses.
🔓 gbhackers.com/vidar-malware-co

@jackie@social.linux.pizza
2026-03-28 03:23:54

I'm now blocking AI scrapers on my server using robots.txt and user agent rules
microblog.jackiejude.me/posts/

@danyork@mastodon.social
2026-05-27 11:12:35

@DNSresolver
paris.now.weather.dyn.bortzmeyer.fr TXT

@aredridel@kolektiva.social
2026-04-26 21:37:23

One thing I don't see anyone talking about that we probably should is the proliferation of captcha-busting anubis-busting browser-as-a-service services.
It's not that the big model companies are scraping the web and ignoring robots.txt. (Some are, almost certainly, but there are datasets to train on already and they're not scraping random sites so much)
It's that agent _users_ and the people serving them have a very large demand to access information with semi-automated systems. And they're building whole armies of ways around blocking.

@danyork@mastodon.social
2026-05-27 10:43:18

@DNSresolver
montreal.now.weather.dyn.bortzmeyer.fr TXT

If I was as bad at my job as Lindsey Graham is – I would be fired and children would die.
I’m Dr. Annie Andrews – a mom of 3, a pediatrician, and I’m running for US Senate in South Carolina to send far-right Trump apologist Lindsey Graham packing.
If you’re with me, chip in here: secure.actblue.c…

@mxp@mastodon.acm.org
2026-03-17 20:55:09

I’ve been using idioms like
find . -name \*.foo | while read f; do mv $f ${f%%.foo}.bar; done
for decades. Now I’m preparing an exercise where students are to rename all files that do not end in .txt; so something like this should do the trick:
find . -not -name \*.txt | while read f; do mv $f ${f%%.*}.txt; done
Except... ${f%%.*} is empty! ${f%.*} works, but why!?
Finding the obvious took me an hour… And why do I only stumble upon such things when preparing exe…

‪@mxp@mastodon.acm.org‬
2026-03-17 20:55:09

I’ve been using idioms like
find . -name \*.foo | while read f; do mv $f ${f%%.foo}.bar; done
for decades. Now I’m preparing an exercise where students are to rename all files that do not end in .txt; so something like this should do the trick:
find . -not -name \*.txt | while read f; do mv $f ${f%%.*}.txt; done
Except... ${f%%.*} is empty! ${f%.*} works, but why!?
Finding the obvious took me an hour… And why do I only stumble upon such things when preparing exe…

@mxp@mastodon.acm.org‬
2026-03-17 20:55:09

I’ve been using idioms like
find . -name \*.foo | while read f; do mv $f ${f%%.foo}.bar; done
for decades. Now I’m preparing an exercise where students are to rename all files that do not end in .txt; so something like this should do the trick:
find . -not -name \*.txt | while read f; do mv $f ${f%%.*}.txt; done
Except... ${f%%.*} is empty! ${f%.*} works, but why!?
Finding the obvious took me an hour… And why do I only stumble upon such things when preparing exe…

@danyork@mastodon.social
2026-05-27 11:18:10

@…
montreal.now.weather.dyn.bortzmeyer.fr TXT

@usul@piaille.fr
2026-03-18 10:39:40

Too bad these didn't make it market
m88k.com/orig/misc/next88.txt

@ruario@vivaldi.net
2026-04-16 07:05:46

@… The last security updates to Slackware 15 were released 2 days ago.
ftp.osuosl.org/pub/slackware/s

@inthehands@hachyderm.io
2026-05-20 16:19:34

OK, a •lot• of replies need this reponse:
Yes, of •course• they will start ignoring robots.txt etc as soon as they think it hurts their business. Of course.
It is important to •force that fight•, rather than just capitulating in advance.

@vosje62@mastodon.nl
2026-03-21 10:53:59

Ik kom het niet vaak tegen in mn mail, maar zo'n melding is een van de redenen dat ik ooit voor FairEmail koos.
De email zelf (foto 2, txt only, want ik wil geen ellende) ziet er onschuldig genoeg uit.
(En ik weet t van het bedrijf. Groot en heel ver doorgeautomatiseerd, dus niet te vermijden...)
#FairEmail

@bammerlaan@mastodon.nl
2026-03-22 21:26:37

@… First up is changing the SD partition table to #GPT. It's more modern for a reason, I say. I dislike using anything called "msdos" in 2026.
So here I am breaking open the img file, copying the contents over and updating cmdline.txt and fstab. 🙈 Here's hoping it'll boot.

@acka47@openbiblio.social
2026-05-20 15:19:57

Im Vortrag "Umarme deinen Feind: Datenbestände von Kulturerbeeinrichtungen für AI-Algorithmen öffnen, ohne die Kontrolle zu verlieren" sagt @… , das System der robots.txt sei tot und für die aktuellen Herausforderungen im Hinblick auf massenhafte Abfragen durch KI-Bots gescheitert.
Das stimmt wohl leider, und der einzige Grund dafür i…

@danyork@mastodon.social
2026-05-26 12:08:41

@DNSresolver
montreal.now.weather.dyn.bortzmeyer.fr TXT

@joe@toot.works
2026-04-18 15:34:19

I saw the “Turning "cat readme.txt" into arbitrary code execution in iTerm2” article and my first thought was that I am glad that I switched to Prompt 3. 😏

@aral@mastodon.ar.al
2026-05-06 09:21:33

Scientifically speaking, what’s the furthest you could be told to fuck right off? @… thereisno.computer/@yhancik/st

@grahamperrin@bsd.cafe
2026-05-04 17:40:01

RE: chaos.social/@txt_file/1165037
@…
NB the issue with FreeBSD-base (currently pinned at <

@inthehands@hachyderm.io
2026-05-20 15:29:52

Going with meta noindex for now. My thinking is that this actively tells Google to yank already-crawled content from their index, whereas they might take a robots.txt entry to mean “do not update, but keep showing last fetched.”

@kineticdiplomacy@infosec.exchange
2026-04-22 09:45:03

Sadly, the time has come to retire the security.txt from my website. It is only used by l33t haxxors who think they can use ai tools to find amazing bugs in my site, but only if i have a bug bounty program. Yet another net positive is turned into a net negative thanks to LLM's.
#personalwebsite #smallweb

@cosmos4u@scicomm.xyz
2026-03-04 23:25:41

So the new #Kreutz #comet #MAPS is *still* following the constant rapid rise in brightness it has shown since discovery: a dumb extrapolation - cobs.si/analysis/?comet=2688&f - has it get 10,000-times brighter than the Sun at its extremely close perihelion which makes so sense at all, of course, physically.
"It must therefore be assumed that this increase in activity will level off significantly in the near future," writes fg-kometen.vdsastro.de/koj_202: "More likely are parameters m m0=12.0 mag / n=4 (or even lower), which would still result in a (very short-term) maximum brightness of about –9 mag (but this would probably still be significantly too bright) – always assuming that the comet survives its perihelion passage unscathed."
For other views see cbat.eps.harvard.edu/iau/cbet/ and arxiv.org/abs/2602.17626 and facebook.com/photo?fbid=102365 and cometografia.es/cometa-kreutz- - and the actual brightness is tracked at cobs.si/obs_list?id=2688 where it has reached ~11.5 mag. now.

@Adam@social.lein.us
2026-05-15 16:39:52

@… Are any hosting providers in your hall of fame located in North America? Or do I have to click each one to try to figure it out? internet.nl/halloffame/hosters

@lil5@social.linux.pizza
2026-05-04 16:15:39

isitagentready.com/
“Is your site agent ready?”
Looks like I need to add some more to my robot.txt
contentsignals.org/

@inthehands@hachyderm.io
2026-05-20 14:52:50

Quick strategy discussion, for those who understand Google indexing and SEO:
If I want to yank a web site out of Google’s now-fully-extractive search, should I (1) disallow googlebot in robots.txt or (2) add `<meta name="googlebot" content="noindex">` to all the page headers?
The goal here is not just to remove my contributions to the commons from Google’s results, but to •make Google aware• that sites are pulling consent. What will best do that?
2/2

@ruario@vivaldi.net
2026-05-16 06:36:47

@… Yeah I see he updated 15.0 as well so I should probably upgrade… again. 😆
ftp.osuosl.org/pub/slackware/s

@cdamian@rls.social
2026-05-04 16:01:07

Yesterday I got an alert that my disk space on my Mastodon instance server was running out.
All the media seemed to suddenly be refreshed and at a very fast rate.
It took a while to figure out it was the Meta AI scraper hitting the instance quite a lot.
I finally managed to block it in Caddy abs and robots.txt together with some other AI scraper.
#mastodon

graph of data on disk growth by time going from 2.5 MB/s growth to below 200 K/s.
@waidler@bayerwald.social
2026-05-06 05:56:12

Mein Dialog mit der KI:
Es ist nicht wirklich wichtig. aber ich wollte es verstehen. Ich verstehe es aber nicht.
07:52
Claude hat geantwortet:
Ich auch nicht — und das ist die ehrliche Antwort. Wir haben jeden Layer systematisch ausgeschlossen:
nginx ✅ identisch
Ghost ✅ identisch
Firewall ✅ kein IP-Blocking
robots.txt ✅ identisch
og:image Tags ✅ korrekt und abrufbar
Security-Header ✅ identisch
Ghost-Logs …

@dawid@social.craftknight.com
2026-05-06 06:29:58
@… Chciałbym zobaczyć jak budujesz nowoczesną platforme ecommerce czy elerningową i sprzedajesz ją klientowi na zwykłym pliku txt ;)

Ja wiem, że swojego bloga, to i w gemini mogę pisać, ale reszta świata nie podziela tego minimalizmu.