2026-02-14 11:02:44
about the war against the Internet Archive :
🔁 https://www.techdirt.com/2026/02/13/news-publishers-are-now-blocking-the-internet-archive-and-we-may-all-regret-it/
about the war against the Internet Archive :
🔁 https://www.techdirt.com/2026/02/13/news-publishers-are-now-blocking-the-internet-archive-and-we-may-all-regret-it/
Fernleihe des Internet Archive (@…) funktioniert
https://archivalia.hypotheses.org/252272
A look at the Aadam Jacobs Collection, which is adding its 10,000 concert recordings to the Internet Archive; only one or two artists have requested takedowns (Christopher Weber/Associated Press)
https://apnews.com/article/aadam-jacob
News publishers limit Internet Archive access due to AI scraping concerns | Nieman Journalism Lab https://www.niemanlab.org/2026/01/news-publishers-limit-internet-archive-access-due-to-ai-scraping-concerns/
Originality AI: 23 major news websites and Reddit block the Internet Archive's crawler; journalists and advocacy groups sign a letter supporting the Archive (Kate Knibbs/Wired)
https://www.wired.com/story/the-internets-most-powerful-archiving-to…
The "Handbook of Upper Canadian Chronology" is simultaneously the most boringly titled book - though accurate, it's a super boring book - and one of the most useful for my current project. It's basically just a list of officeholders, regulations, laws, geographical divisions, etc., etc. But gloriously helpful. Thank you to my forebear, the historian Frederick H. Armstrong, for making the lives of the handful of us who work on early 19th-century Ontario so much easier.
Aadam Jacobs has secretly recorded over 10,000 local concerts since 1989.
Now, they are cleaned up and ready to listen to for free online:
https://archive.org/details/@aadam_jacobs_collection
Originality AI: 23 major news websites and Reddit currently block the Internet Archive's crawler; journalists and advocacy groups sign a letter backing the IA (Kate Knibbs/Wired)
https://www.wired.com/story/the-internets-most-powerful-archiving-…
Does anyone ever feel nostalgia regarding PDAs? As a kid, I first got an old Palm and then a Pocket Loox 720 (with Windows Mobile 2003SE) as a hand-down from my dad.
I have very fond memories of browsing through the depths of the Internet trying to find some cool new program. Most notably though, there was a port of Android you could boot from a CF or SD card, and it was the coolest thing ever:
Hier mal eine positive Nachricht über Online-Archive aus der Schweiz aber auch wider KI:
(Der Begriff KI ist eine breit gefächerte Formulation für Automatisierung)
«Internet Archive erhält Standort in der Schweiz:
Internet Archive Switzerland hat am 5. Mai 2026 seine operative Arbeit aufgenommen. Die Stiftung will Schweizer Inhalte sichern, KI-Modelle archivieren und gefährdeten Dokumenten einen digitalen Zufluchtsort bieten.»
🗃️
If you didn't know about this collection of shows, check it out. Aadam Jacobs collection at the Internet Archive. Seeing he was Chicago based I took the chance to search and sure enough, he has a few Troubled Hubble shows. Amazing. https://archive.org/details/@aadam_jac
Internet Archive: Aadams Jacob Collection
No tapes left behind
#bootlegme
https://archive.org/details/@aadam_jacobs_collection
I donated to the Internet Archive today.
They do valuable work and are a huge resource for humanity.
https://archive.org/
"The Internet Archive, a non-profit, is building a digital library of Internet sites and other cultural artifacts in digital form. Like a paper library, we provide free access to rese…
The AI Hard Drive Shortage Is Making It More Expensive and Harder to Archive the Internet https://www.404media.co/the-ai-hard-drive-shortage-is-making-it-more-expensive-and-harder-to-archive-the-internet/
Claude Code is good at doing research!
In this case helping find the disposition of 126,000 digitized US Supreme Court dockets (cert denied or full opinion), and then reporting why in a archive.org review (using like a wikipedia discussion).
This took real hand-holding and QA to be sure, but it is super helpful. Looked at court listener, the supreme court site (and the old one via the wayback machine!), the Caselaw Access Project at harvard. So good.
from my link log —
Apple Scorpius CPU architectural specification. (1989)
https://archive.org/details/scorpius_architecture
saved 2019-12-29 https:/…
Does anyone remember this?
Little Computer People, by Activision (1985).
#gaming
What LLMs and the Turing test¹ tell us: most of us not only are² stochastic parrots³ but are also fine with that – otherwise we would not happily use LLMs to produce all the output we communicate to others.
One could frame this as “insult to humanity” but I prefer to call it telling.
__
¹the Turing test does _not_ measure “intelligence”. I recommend to read the original paper:
Did you miss one of the Web414 meetings in 2007? Looks like we recorded them!
#mke
As so often, the Internet Archive has come to save the day.
At its newly opened Aadam Jacobs Archive, you can now listen to nearly 2,500 of the concert recordings that volunteers have digitized and uploaded so far.
In that more than a terabyte of files, you’ll find concerts by Nirvana, Phish, Tracy Chapman, Depeche Mode, Flaming Lips, Stereolab, Liz Phair, Sonic Youth, Nick Cave and the Bad Seeds, Björk, They Might Be Giants (recorded four times in 1988 alone), and the Mekons, a…
us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…
gentlemen of a certain age will perhaps appreciate the first issue of dirt, launched in 1992 as sassy magazine's teen boy-aimed spinoff. you know, with crispin glover on the cover. https://archive.org/details/dirt-issue-1-spring-1992/
EFF Tells Publishers: Blocking the Internet Archive Won't Stop AI, But It Will Erase The Historical Record - Slashdot
https://yro.slashdot.org/story/26/03/21/0649247/eff-tells-publishers-blocking-the-internet-archive-wont-stop-ai-but-it-will-erase-the-historical-record
Damals, als die iX noch eine Sonderpublikation der c't war: #vintagecomputing
The fact that so many - including Wikipedia - rely on a shady website that (unsurprisingly?) is doing some really shady stuff recently shines a light on major shortcomings of newspaper websites: they want your data to share it with 900 "partners", are nevertheless full of ads and every redesign breaks deep links to their articles.
Like Napster/Limewire before, archive.today provides a service that solves an actual need of many Internet users.
Just released a small but useful update for my little Internet Archive Plugin for Craft CMS. 🏛️
You can now opt individual entries out of automatic archiving via a lightswitch field – and manually send any entry to the Wayback Machine straight from the Craft CP entries index. ✨
https://github.com/matthiasott/cr…
Aadam Jacobs ging 1989 mit einem Rekorder zu einem Konzert von #Nirvana. Seitdem hat er sich seinen Spitznamen "Taper Guy"" redlich verdient. Mittlerweile nahm er mehr als 10.000 Konzerte auf Band auf. Jetzt werden die Aufnahmen digitalisiert und bei
This is an absolute gold mine for music fans, especially us #genx folks. After listening to pre-Grohl Nirvana, I'm now listening to Tracey Chapman from 1988. Thank you Aadam Jacobs and the @… !!
I really like Google Search’s stated mission to “organize the world's information and make it universally accessible and useful” but I’m curious what organizations do y’all think are most ideally invested in that mission?
Wikipedia seems like an obvious one. Internet Archive seems like another. Any others?
What in the hell am I looking at 🧐
https://toot.community/@internetarchiveeurope/116340086494523511
Web archiving is troubled: The Internet Archive is under attack constantly, and archive.is/archive.today seems to be on the way towards turning itself and its users into personae non gratae on the web.
Is it now time to turn web archiving into a distributed affair where obtaining the snapshot happens while browsing, and produces a witness statement a la "at this point in time, I observed these responses to those requests", and storing/accessing those becomes decoupled from ob…
An analysis of Internet Archive data finds that by mid-2025, ~35% of new websites published since ChatGPT's launch in late 2022 were AI-generated or AI-assisted (Matthew Gault/404 Media)
https://www.404media.co/study-finds-a-third-of-new-websites-are-ai-gene…
Immer wieder toll, wie Internet-Konzerne unsere Daten schützen …
Homeland Security Wants Social Media Sites to Expose Anti-ICE Accounts
https://archive.is/mqXsZ
Apple only has keynotes/events from 2007 onwards archived on their website (as video podcast episodes).
Is there an ideally complete and high quality source for keynotes/events from 1998–2006?
(Yes, I know some are on the Internet Archive and YouTube, but they’re incomplete and often extremely low quality.)
It looks like the Internet Archive @… is down at the moment.
#InternetArchive
A voyage towards the South Pole performed in the years 1822–24. Containing an examination of the Antarctic Sea, to the seventy-fourth degree of latitude; and a visit to Tierra del Fuego, with a particular account of the inhabitants. To which is added, much useful information on the coasting navigation of Cape Horn, and the adjacent lands.
htt…
An analysis of Internet Archive data: by mid-2025, ~35% of newly published websites since ChatGPT's launch in November 2022 were AI-generated or AI-assisted (Matthew Gault/404 Media)
https://www.404media.co/study-finds-a-third-of-new-websites-are-ai-generate…
»Die Kunst, die uns lehrt, die Nahrungsstoffe, welche uns die Natur verschwenderisch reicht, so zu behandeln, daß daraus eine gesunde und schmackhafte Speise bereitet wird, ist gar nicht leicht, nie ist dieselbe ohne Nachdenken, Vorsicht und Mühe zu erreichen.«
Aus: Sophie Wilhelmine Scheibler – Allgemeines Deutsches Kochbuch für alle Stände, Leipzig, 1887. #kochen
A woman calling out the patriarchal system and rape culture... in 1771 London.
This was a great episode of #WhatsHerName podcast, and Catherine Jemmat's memoir is so old that you can read the full scanned version on the Internet Archive from the New York Library.
us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…
The Removed DOGE Deposition Videos Have Already Been Backed Up Across the Internet https://www.404media.co/the-removed-doge-deposition-videos-have-already-been-backed-up-across-the-internet/
I should check to make sure this Turnip executive order has been archived in the Wayback Machine...
"Saved 641 times between February 5, 2025 and March 25, 2026."
*nods* Good job, Internet Archive.
#InternetArchive #WaybackMachine