
2025-06-11 20:42:03
from my link log —
ai.robots.txt: A list of AI agents and robots to block.
https://github.com/ai-robots-txt/ai.robots.txt
saved 2025-01-16
from my link log —
ai.robots.txt: A list of AI agents and robots to block.
https://github.com/ai-robots-txt/ai.robots.txt
saved 2025-01-16
Writing code that ignores robots.txt is a professional ethics violation.
This is a toot about #AI
Is there anything I can put in robots.txt that will stop Scrapy?
Failing that, let’s take the ship up and nuke the site from orbit. It’s the only way to be sure.
Fsck GMail!
@ IN TXT "v=spf1 all"
I just discovered TXT... feels all kinds of uplifting for a Monday morning :)
#TomorrowXTogether #KPop
From HOSTS.TXT to Modern Internet Infrastructure
🌐 #hoststxt
There is now also a CBET about the new interstellar #comet 3I/ATLAS: http://www.cbat.eps.harvard.edu/iau/cbet/005500/CBET005578.txt - it comes with an even more precise orbit based on astrometry back to 5 June and predicts 13th magnitude with 60° elongation after perihelion in November. The current magnitude is about 17.7.
Control How Your Content Is Used for AI Training With Cloudflare (Cloudflare Blog, 1 July 2024)
#MediaLit
🥱
The day SHALL start.
Regards not given,
RFC2119
https://ietf.org/rfc/rfc2119.txt
Scrapers selectively respect robots.txt directives: evidence from a large-scale empirical study
Taein Kim, Karstan Bock, Claire Luo, Amanda Liswood, Emily Wenger
https://arxiv.org/abs/2505.21733
📝🗃️ 𝗿𝗱𝗼𝗰𝗱𝘂𝗺𝗽: Dump ‘R’ Package Source, Documentation, and Vignettes into One File for use in LLMs #rstats #LLM is on CRAN https://www.ekotov.pro/rdocdum…
『DOSの人が困るので、ファイル名は8文字のアルファベット大文字と _ と数字の組みあわせ(8.3形式)でお願いします』 -- README~1.TXT
Today, I got notified about spamhaus not responding anymore to requests from our mailserver due to using an "open resolver".
Huh?
I found the command `dig short test.openresolver.com TXT @<ip_of_dns_server_to_test>` to test if my DNS server is deemed an open resolver. And yes, the mailserver uses a DNS server that got recognized as an open resolver.
Out of curiosity, I tried the same in my local network where I have a dnsmasq serving DHCP and DNS for my cli…
Been designing distributed counters for NATS. Pretty happy with this.
50k/second unoptimised and on a single counter - but we will support aggregation of regional to global etc.
Hard dist sys problems made trivial to use and operate 💪💪
https://gist.github.com/ripienaar/d95d