Tootfinder

Opt-in global Mastodon full text search. Join the index!

@brewsterkahle@mastodon.archive.org
2026-05-02 20:37:00

Linking data with the help of AI.
There are old medical PhD theses coming online (yea!).
Found Wikipedia articles about the doctors, and then edit the wikipedia articles that would be improved by pointing to them. Also added a review to the Internet Archive items to point to the wikipedia.

@karlauerbach@sfba.social
2026-06-02 19:52:46

Time to mention my "First Law of the Internet" again...
# The First Law of the Internet
Every person shall be free to use the Internet in any way that is privately beneficial without being publicly detrimental.
* The burden of demonstrating public detriment shall be on those who wish to prevent the private use.
* Such a demonstration shall require clear and convincing evidence of public detriment.
* The public detriment must be of such degree and e…

@sherold@mastodon.online
2026-05-04 09:22:51

Does anyone remember this?
Little Computer People, by Activision (1985).
#gaming

@privacity@social.linux.pizza
2026-06-03 16:27:30

Lo strumento di archiviazione più potente di Internet è in pericolo
Mentre i principali organi di informazione bloccano la Wayback Machine, giornalisti e gruppi di pressione si stanno mobilitando per proteggere la vasta collezione di pagine web dell'Internet Archive.

@matthiasott@mastodon.social
2026-04-04 09:31:28

Just released a small but useful update for my little Internet Archive Plugin for Craft CMS. 🏛️
You can now opt individual entries out of automatic archiving via a lightswitch field – and manually send any entry to the Wayback Machine straight from the Craft CP entries index. ✨
github.com/matthiasott/cr…

@fennek@cyberplace.social
2026-04-03 10:27:13

What in the hell am I looking at 🧐
toot.community/@internetarchiv

@brewsterkahle@mastodon.archive.org
2026-04-29 14:50:22

Claude Code is good at doing research!
In this case helping find the disposition of 126,000 digitized US Supreme Court dockets (cert denied or full opinion), and then reporting why in a archive.org review (using like a wikipedia discussion).
This took real hand-holding and QA to be sure, but it is super helpful. Looked at court listener, the supreme court site (and the old one via the wayback machine!), the Caselaw Access Project at harvard. So good.

@zachleat@zachleat.com
2026-05-01 15:56:45

I really like Google Search’s stated mission to “organize the world's information and make it universally accessible and useful” but I’m curious what organizations do y’all think are most ideally invested in that mission?
Wikipedia seems like an obvious one. Internet Archive seems like another. Any others?

@heiseonline@social.heise.de
2026-05-08 16:00:00

Gedächtnis der Menschheit: Das Internet Archive schlägt Wurzeln in der Schweiz
Mit einer Stiftung in St. Gallen will das Internet Archive nicht nur bedrohte Archive retten, sondern auch die KI-Ära konservieren. Doch Widerstände wachsen.

@brewsterkahle@mastodon.archive.org
2026-05-28 00:01:14

The Internet Archive working with the NOAA Library to bring much of it digital and as accessible as possible is wonderful, and a wonderful model: lets flip more libraries digital.
archive.org/details/noaa

noaa

History repeats itself more than it should.
If you think we cannot get out from under these ghouls, then all I can say is we did it several times before.
We should be getting good at it by now.
The closest Trump's gonna get to heaven is on an airplane.
Go figure, cause I can't
Keep protests peaceful.
Don't kill anyone.
They DO make a difference.
Here are some resistance related guides from around the world:
🇺🇸 Fundamentals …

@Techmeme@techhub.social
2026-04-28 06:10:50

An analysis of Internet Archive data finds that by mid-2025, ~35% of new websites published since ChatGPT's launch in late 2022 were AI-generated or AI-assisted (Matthew Gault/404 Media)
404media.co/study-finds-a-thir

@Mediagazer@mstdn.social
2026-04-29 04:30:45

An analysis of Internet Archive data: by mid-2025, ~35% of newly published websites since ChatGPT's launch in November 2022 were AI-generated or AI-assisted (Matthew Gault/404 Media)
404media.co/study-finds-a-thir

@digitalnaiv@mastodon.social
2026-05-27 06:23:00

241 Newsportale blockieren bereits die Web-Crawler des Internet Archive — darunter Guardian und NYT. Ausgerechnet Medienhäuser, die von der Beweismittelkette des offenen Webs profitieren, sägen daran. Carla Siepmann @CarlaSiepmann auf @netzpolitik_feed mit einem wichtigen Text. #LinkRot #InternetArchive

@servelan@newsie.social
2026-03-22 00:23:21

EFF Tells Publishers: Blocking the Internet Archive Won't Stop AI, But It Will Erase The Historical Record - Slashdot
yro.slashdot.org/story/26/03/2

@vform@openbiblio.social
2026-03-12 18:11:02

Fernleihe des Internet Archive (@…) funktioniert
archivalia.hypotheses.org/2522

@kubikpixel@chaos.social
2026-05-06 10:30:19

Hier mal eine positive Nachricht über Online-Archive aus der Schweiz aber auch wider KI:
(Der Begriff KI ist eine breit gefächerte Formulation für Automatisierung)
«Internet Archive erhält Standort in der Schweiz:
Internet Archive Switzerland hat am 5. Mai 2026 seine operative Arbeit aufgenommen. Die Stiftung will Schweizer Inhalte sichern, KI-Modelle archivieren und gefährdeten Dokumenten einen digitalen Zufluchtsort bieten.»
🗃️

@kctipton@mas.to
2026-04-14 02:43:40

News publishers limit Internet Archive access due to AI scraping concerns | Nieman Journalism Lab niemanlab.org/2026/01/news-pub

@qbi@freie-re.de
2026-04-14 18:13:53

Aadam Jacobs ging 1989 mit einem Rekorder zu einem Konzert von #Nirvana. Seitdem hat er sich seinen Spitznamen "Taper Guy"" redlich verdient. Mittlerweile nahm er mehr als 10.000 Konzerte auf Band auf. Jetzt werden die Aufnahmen digitalisiert und bei

@Xavier@infosec.exchange
2026-04-15 15:56:21

This is an absolute gold mine for music fans, especially us #genx folks. After listening to pre-Grohl Nirvana, I'm now listening to Tracey Chapman from 1988. Thank you Aadam Jacobs and the @… !!

@bourgwick@heads.social
2026-03-16 13:32:13

of interest to a few certain strains of music heads: huge stash of NME back issues now dropping on @…, 300 issues, 1969-1983 (so far).

@brewsterkahle@mastodon.archive.org
2026-05-19 01:12:28

Connie Chan Rocks. Thank you Nancy Pelosi-- good pick to endorse for your seat.
Connie Chan represents the Internet Archive district in San Francisco, and got unanimous support for: Digital Library Rights, and Internet Archive Day resolution in 2025, and got the Internet Archive Hero award in 2023!

Connie Chan
@chrysn@chaos.social
2026-05-22 12:44:44

RE: mastodon.archive.org/@internet
The web shutting out the Wayback Machine is bad. But I'd turn this around: If someone wants to move their website into the darknet, so be it. But I won't link to thin…

@avalon@jazztodon.com
2026-04-08 20:39:57

Internet Archive: Aadams Jacob Collection
No tapes left behind
#bootlegme
archive.org/details/@aadam_jac

@stiefkind@mastodon.social
2026-03-23 12:58:53

»Die Kunst, die uns lehrt, die Nahrungsstoffe, welche uns die Natur verschwenderisch reicht, so zu behandeln, daß daraus eine gesunde und schmackhafte Speise bereitet wird, ist gar nicht leicht, nie ist dieselbe ohne Nachdenken, Vorsicht und Mühe zu erreichen.«
Aus: Sophie Wilhelmine Scheibler – Allgemeines Deutsches Kochbuch für alle Stände, Leipzig, 1887. #kochen

As so often, the Internet Archive has come to save the day.
At its newly opened Aadam Jacobs Archive, you can now listen to nearly 2,500 of the concert recordings that volunteers have digitized and uploaded so far.
In that more than a terabyte of files, you’ll find concerts by Nirvana, Phish, Tracy Chapman, Depeche Mode, Flaming Lips, Stereolab, Liz Phair, Sonic Youth, Nick Cave and the Bad Seeds, Björk, They Might Be Giants (recorded four times in 1988 alone), and the Mekons, a…

@curiouscat@fosstodon.org
2026-05-04 19:55:30

I donated to the Internet Archive today.
They do valuable work and are a huge resource for humanity.
archive.org/
"The Internet Archive, a non-profit, is building a digital library of Internet sites and other cultural artifacts in digital form. Like a paper library, we provide free access to rese…

@niqdanger@social.linux.pizza
2026-04-09 21:56:56

If you didn't know about this collection of shows, check it out. Aadam Jacobs collection at the Internet Archive. Seeing he was Chicago based I took the chance to search and sure enough, he has a few Troubled Hubble shows. Amazing. archive.org/details/@aadam_jac

@esoriano@social.linux.pizza
2026-04-11 12:59:58

Aadam Jacobs has secretly recorded over 10,000 local concerts since 1989.
Now, they are cleaned up and ready to listen to for free online:
archive.org/details/@aadam_jac

@wraithe@mastodon.social
2026-05-15 22:25:20

Thread worth reading!
bsky.app/profile/did:plc:73dpz

@newsie@darktundra.xyz
2026-05-05 13:26:40

The AI Hard Drive Shortage Is Making It More Expensive and Harder to Archive the Internet 404media.co/the-ai-hard-drive-

@scottmiller42@mstdn.social
2026-03-25 17:13:26

It looks like the Internet Archive @… is down at the moment.
#InternetArchive

@Mediagazer@mstdn.social
2026-04-13 08:05:34

A look at the Aadam Jacobs Collection, which is adding its 10,000 concert recordings to the Internet Archive; only one or two artists have requested takedowns (Christopher Weber/Associated Press)
apnews.com/article/aadam-jacob

@ubuntourist@mastodon.social
2026-04-14 01:57:15

The Wayback Machine of the Internet Archive is in peril
#WaybackMachine

@Techmeme@techhub.social
2026-04-13 11:51:34

Originality AI: 23 major news websites and Reddit block the Internet Archive's crawler; journalists and advocacy groups sign a letter supporting the Archive (Kate Knibbs/Wired)
wired.com/story/the-internets-

@Life_is@no-pony.farm
2026-04-24 05:41:16

Menschen haben ein Gedächtnis. Ein Kurzzeitgedächtnis und ein Langzeitgedächtnis. Ein episodisches Gedächtnis und weitere Formen des Gedächtnisses. Fällt eines dieser Gedächtnisse aus, funktioniert auch der Mensch nicht mehr richtig.

Auch die Menschheit hat ein Gedächtnis. Es besteht aus dem Google Cache, aus dem Internet-Archive, aus Archive Today, aus Annas Archive, aus Wikipedia, aus Grokipedia.

Den Google Cache gibt es nicht mehr. Archiv Today hat sich selbst komp…

@life_is@no-pony.farm
2026-04-24 05:41:16

Menschen haben ein Gedächtnis. Ein Kurzzeitgedächtnis und ein Langzeitgedächtnis. Ein episodisches Gedächtnis und weitere Formen des Gedächtnisses. Fällt eines dieser Gedächtnisse aus, funktioniert auch der Mensch nicht mehr richtig.

Auch die Menschheit hat ein Gedächtnis. Es besteht aus dem Google Cache, aus dem Internet-Archive, aus Archive Today, aus Annas Archive, aus Wikipedia, aus Grokipedia.

Den Google Cache gibt es nicht mehr. Archiv Today hat sich selbst komp…

@brian_gettler@mas.to
2026-05-12 15:44:55

The "Handbook of Upper Canadian Chronology" is simultaneously the most boringly titled book - though accurate, it's a super boring book - and one of the most useful for my current project. It's basically just a list of officeholders, regulations, laws, geographical divisions, etc., etc. But gloriously helpful. Thank you to my forebear, the historian Frederick H. Armstrong, for making the lives of the handful of us who work on early 19th-century Ontario so much easier.

@thomasfuchs@hachyderm.io
2026-03-17 04:30:38

Apple only has keynotes/events from 2007 onwards archived on their website (as video podcast episodes).
Is there an ideally complete and high quality source for keynotes/events from 1998–2006?
(Yes, I know some are on the Internet Archive and YouTube, but they’re incomplete and often extremely low quality.)

@hikingdude@mastodon.social
2026-05-17 13:43:31

"Doom" soundtrack becomes a cultural asset in the USA
I just found it in the Internet archive and I can tell: that is NOT how Doom did sound with my Soundblaster16. 🤔
heise.de/en/news/Doom-soundtra

@Mediagazer@mstdn.social
2026-05-21 06:25:49

Analysis: 382 news sites, including 342 local outlets, are blocking Internet Archive's crawlers amid AI concerns, an increase from 241 sites in January 2026 (Nieman Lab)
niemanlab.org/2026/05/more-tha

@katrinakatrinka@infosec.exchange
2026-03-18 20:36:22

A woman calling out the patriarchal system and rape culture... in 1771 London.
This was a great episode of #WhatsHerName podcast, and Catherine Jemmat's memoir is so old that you can read the full scanned version on the Internet Archive from the New York Library.

@compfu@mograph.social
2026-05-14 21:39:28

RE: mastodon.social/@kohnzn/116571
This vintage Commodore TV commercial is legit!
There's a whole collection on the Internet Archive with more of them. Ah, the 80s...

@brewsterkahle@mastodon.archive.org
2026-04-06 16:47:01

"Do you need a liberal education? We say that it is unpatriotic not to read these books."
""The death of democracy is not likely to be an assassination from ambush. It will be a slow extinction from apathy, indifference, and undernourishment."
Robert M Hutchins (of Great Books fame)

A voyage towards the South Pole performed in the years 1822–24. Containing an examination of the Antarctic Sea, to the seventy-fourth degree of latitude; and a visit to Tierra del Fuego, with a particular account of the inhabitants. To which is added, much useful information on the coasting navigation of Cape Horn, and the adjacent lands.

@scottmiller42@mstdn.social
2026-03-26 14:50:28

I should check to make sure this Turnip executive order has been archived in the Wayback Machine...
"Saved 641 times between February 5, 2025 and March 25, 2026."
*nods* Good job, Internet Archive.
#InternetArchive #WaybackMachine

@newsie@darktundra.xyz
2026-03-14 16:01:25

The Removed DOGE Deposition Videos Have Already Been Backed Up Across the Internet 404media.co/the-removed-doge-d

@darius@social.linux.pizza
2026-03-12 23:36:17

Does anyone ever feel nostalgia regarding PDAs? As a kid, I first got an old Palm and then a Pocket Loox 720 (with Windows Mobile 2003SE) as a hand-down from my dad.
I have very fond memories of browsing through the depths of the Internet trying to find some cool new program. Most notably though, there was a port of Android you could boot from a CF or SD card, and it was the coolest thing ever:

@Mediagazer@mstdn.social
2026-04-14 06:05:38

Originality AI: 23 major news websites and Reddit currently block the Internet Archive's crawler; journalists and advocacy groups sign a letter backing the IA (Kate Knibbs/Wired)
wired.com/story/the-internets-

@netzschleuder@social.skewed.de
2026-03-10 06:00:05

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 561 nodes, 2949 edges. https://networks.skewed.de/net/us_agencies#arkansas
@stev3yd@social.linux.pizza
2026-05-13 14:39:53

I was able to pull up my old Dragon Ball Z website from the Internet Archive. I made this website 25 years ago on Angelfire. I was 12 at the time. Time flies by!
#memories #webdev

Early 2000s Dragon Ball Z website
@Mediagazer@mstdn.social
2026-05-20 12:40:49

A look at Disney's 10-year mismanagement of FiveThirtyEight as Nate Silver says Disney is refusing to negotiate with him about restoring the site's archive (Nate Silver/Silver Bulletin)
natesilver.net/p/disney-erased

@netzschleuder@social.skewed.de
2026-03-04 23:00:05

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 561 nodes, 2949 edges. https://networks.skewed.de/net/us_agencies#arkansas