Tootfinder

@brewsterkahle@mastodon.archive.org
2025-11-29 16:14:33

Please Donate to the Internet Archive. $25 helps.... a lot.
Useful to Journalists,
Useful to Students,
Useful to more than 2 million people a day.
Collections growing at 150TBytes/day
@…

@Mediagazer@mstdn.social
2026-01-29 17:41:12

The Guardian, FT, NYT, USA Today Co., and others are blocking or limiting the Internet Archive's crawlers to prevent AI crawlers from using IA as a backdoor (Nieman Lab)
https://www.niemanlab.org/2026/01/news-publishers-limit-i…

News publishers limit Internet Archive access due to AI scraping concerns
Outlets like The Guardian and The New York Times are scrutinizing digital archives as potential backdoors for AI crawlers.

@bourgwick@heads.social
2026-01-30 01:14:40

*screams in historian/normal person who uses the wayback machine* https://mstdn.social/@Mediagazer/115979614480582544

Mediagazer (@Mediagazer@mstdn.social)
The Guardian, FT, NYT, USA Today Co., and others are blocking or limiting the Internet Archive's crawlers to prevent AI crawlers from using IA as a backdoor (Nieman Lab) https://www.niemanlab.org/2026/01/news-publishers-limit-internet-archive-access-due-to-ai-scraping-concerns/ http://mediagazer.com/260129/p9#a260129p9

@detondev@social.linux.pizza
2026-01-28 15:49:29

For the last few days i was avoiding the online as much as i could on a suicidal episode, self isolating from the noise of the world. Yet paradoxically to some, the best thing to happen during this time and the main reason im back so soon was listening to almost nothing but cassettes from the NOISE-ARCH archive. tap in.
https://archive.org…

@markhburton@mstdn.social
2025-11-27 08:52:22

The Internet Archive and Wayback Machine is s valuable resource but only 1/1000 users donate to its running costs.
You can join them.
Donate to the Internet Archive.
https://archive.org/donate/

@stiefkind@mastodon.social
2025-11-29 09:28:35

ORWO Rezepte. Nein, das ist nichts zu kochen/essen, darin geht es um die »Behandlung fotografischer Materialien«. Think Chemiebaukasten: https://archive.org/details/orwo-rezepte-1972/mode/2up

ORWO Rezepte Ausgabe 1972 : VEB Filmfabrik Wolfen : Free Download, Borrow, and Streaming : Internet Archive
ORWO Rezepte: Vorschriften zur Behandlung fotografischer MaterialienThis book contains instructions for processing all common ORWO photographic materials. It...

@jorgecandeias@mastodon.social
2025-11-25 16:27:12

I guess the Internet Archive is now a for-Proffitt...
https://mastodon.archive.org/@internetarchive/115611174274779543

internetarchive (@internetarchive@mastodon.archive.org)
Attached: 1 image The Internet Archive welcomes Merrilee Proffitt as director of Democracy’s Library, US! With decades in digital libraries & open knowledge, she’ll expand free, online access to government research, supporting transparency, equity, and democratic engagement. Learn more at our blog ⤵️ https://blog.archive.org/2025/11/18/meet-merrilee-proffitt-director-of-democracys-library-us/ @internetarchive

@pre@boing.world
2025-12-29 20:52:35

It’s about this time of year I like to check my backups and download my archives.
One archive I download is the archive of my Mastodon posts. Pretty much the only one now I’ve left the corporate web really.
I also like to copy the contents of my public fediverse posts into my own diary within my vimwiki.
Keep it all in one place for easy and local search.
Here’s the script I use, it’s very short and just copies the content of every post in the archive into a new diary entry in the vimwiki diary.
If it finds something already there, it appends.
It checks if it’s already written this post into the diary to avoid duplicating it when you run it over and every again every month or year or whatever.
Paste it into a new text-file called toVimWiki.php, download and unzip your mastodon archive, and run the script with php, passing it the path to the archive’s outbox.json and the root diary directory.
My diary is honestly mostly just public posts these days. Ain’t much in it I won’t blab about on the internet for likes and lols.
#archive #mastodon #vimwiki #endOfYear

@brewsterkahle@mastodon.archive.org
2025-12-23 18:14:32

Horse Bots ! -- but not what you think-- but I couldn't resist.
Thank you, Department of Ag leaflets, this one from 1973.
https://archive.org/details/horsebotshowtoco450unit_2/page/n1/mode/2up
(lots of leaflets:

Horse bots: how to control them : United States. Agricultural Research Service. Northeastern Region : Free Download, Borrow, and Streaming : Internet Archive
7 p. 23 cm

@netzschleuder@social.skewed.de
2026-01-31 05:00:05

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 1161 nodes, 9977 edges. https://networks.skewed.de/net/us_agencies#missouri

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

@fanf@mendeddrum.org
2026-01-20 18:42:03

from my link log —
Engineering and operations at the Internet Archive.
https://hackernoon.com/the-long-now-of-the-web-inside-the-internet-archives-fight-against-forgetting
saved 2026-01-20

The Long Now of the Web: Inside the Internet Archive’s Fight Against Forgetting | HackerNoon
A deep dive into the Internet Archive's custom tech stack.

@bourgwick@heads.social
2026-01-27 04:05:48

evening peace: brian eno & j. peter schwalm perform "4-D music" inside a volcano in spain in 2001, studio-quality radio broadcast. https://archive.org/details/01.-part-i

Brian Eno & J. Peter Schwalm - 2001-10-13 - Volcán del Cuervo, Lanzarote, Spain : Brian Eno & J. Peter Schwalm : Free Download, Borrow, and Streaming : Internet Archive
BRIAN ENO & J. PETER SCHWALM BETWEEN US & IT A performed installation of 4-D MUSIC13th October, 2001. Volcán del Cuervo, Lanzarote.digital radio > wav >...

@brewsterkahle@mastodon.archive.org
2025-12-23 06:47:08

"Rabbit Recipes," US dept of Ag 1930. (We are looking for things to cook to celebrate public domain day Jan 21st 2026 re: 1930!)
Not so sure about Rabbit Recipes, but nice layout. Thank you US Dept of Ag for digitizing this (I love my job).
What will you cook to celebrate public domain day?
http…

Rabbit recipes : Yeatman, Fanny Walker, b. 1876 : Free Download, Borrow, and Streaming : Internet Archive
Cover title

@oligneisti@social.linux.pizza
2026-01-28 10:09:49

There is someone who has taken a lot of old films and used AI to colorize them badly and "boosted" the frame rate from 24 to 60fps and then uploaded this trash to the Internet Archive.
I hate this. Films shot in b&w have an aesthetic that is destroyed by colorizing them. Video games might look better with more frames per second but movies don't. We didn't accidentally settle on 24fps, it was trial and error. A film shot in 60fps looks bad. A movie that has been ar…

@netzschleuder@social.skewed.de
2025-12-30 17:00:04

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 1385 nodes, 7439 edges. https://networks.skewed.de/net/us_agencies#indiana

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

@Techmeme@techhub.social
2025-11-09 14:01:25

An interview with Internet Archive founder Brewster Kahle on copyright lawsuits that threatened to bankrupt the nonprofit, fair use, IA's future, AI, and more (Ashley Belanger/Ars Technica)
https://arstechnica.com/tech-policy/20

Internet Archive’s legal fights are over, but its founder mourns what was lost
“We survived, but it wiped out the library,” Internet Archive’s founder says.

@nohillside@smnn.ch
2025-12-23 13:14:47

Pulled 60 Minutes segment on CECOT : CBS : Free Download, Borrow, and Streaming : Internet Archive https://archive.org/details/60minutes-cecotsegment

@cosmos4u@scicomm.xyz
2025-12-23 01:12:10

RE: #CECOT piece is everywhere on the web - even the Internet Archive: #60Minutes ... a #StreisandEffect writ large: https://en.wikipedia.org/wiki/Streisand_effect

@cdarwin@c.im
2025-11-03 20:40:43

The Internet Archive might sound like a thriving organization,
but it only recently emerged from years of bruising copyright battles that threatened to bankrupt the beloved library project.
In the end, the fight led to more than 500,000 books being removed from the Archive’s “Open Library.”
“We survived,” Internet Archive founder Brewster Kahle told Ars.
“But it wiped out the Library.”
An Internet Archive spokesperson confirmed to Ars that the archive currently…

Internet Archive’s legal fights are over, but its founder mourns what was lost
“We survived, but it wiped out the library,” Internet Archive’s founder says.

@floheinstein@chaos.social
2025-12-22 08:34:27

Holy moly, Anna's Archive hat Spotify gescraped und archiviert: ~ 300 TiB
https://annas-archive.org/blog/backing-up-spotify.html
Alleine die Datenbank mit allen Metadaten ist schon 200 GiB und wird munter via Torrent geteilt

Szene aus Asterix Der Seher. Zenturio Gaius Ausgus spricht mit einem Legionär. Er sagt ihm in der Sprechblase: "Geh ins Internet, den Usern Bericht erstatten. Sag ihnen: Ganz Spotify ist heruntergeladen." Dann werden sie dich fragen: "Ganz?" und du antwortest ihnen "Ganz!" und sie werden verstehen

@makeratschool@kanoa.de
2026-01-12 17:47:51

Ihr kennt natürlich alle die Greatful Dead Collection im Internet Archive. Nicht?
Dann nehmt euch ein bisschen Zeit, es gibt da knapp 18000 Aufnahmen von diversen Konzerten.
https://archive.org/details/GratefulDead?tab=collection

@jorgecandeias@mastodon.social
2025-11-23 15:00:57

Of all the lies of the internet, the worst is probably "the internet is forever".
Anyone who's been here for a while can name hundreds of internet stuff that is completely gone.
(yes, I know about the Internet Archive. I'm also aware of the many blind spots and broken links the Archive has. And of the fact that it isn't really searchable)

@StephenRees@mas.to
2025-11-13 22:16:14

The link leads to a video. Generally speaking I prefer to read rather than watch - but this is an exception
"While the early web promised connection and creativity, today’s internet is increasingly fragmented, paywalled, and dominated by a few powerful platforms.
“The truth is paywalled, and the lies are free”

@gevoel@mastodon.green
2026-01-23 06:51:17

Jarl: Wat in een schimmig hoekje van het internet begon, is nu overal, van Davos tot het Binnenhof | de Volkskrant
https://archive.ph/ueF7e

@brewsterkahle@mastodon.archive.org
2026-01-21 20:12:33

Public Domain Day short film contest by @… on NPR!
https://www.npr.org/2026/01/21/nx-s1-56777

@teledyn@mstdn.ca
2025-11-17 23:44:34

the Cornell University Library is now online, for free, no registration or login, just there, 76,474 books now tucked in at the Internet Archive
https://archive.org/details/cornell

@brewsterkahle@mastodon.archive.org
2025-12-16 19:38:03

new way to see the breadth and depth of the web, in this case Dutch websites.
fun.
https://display.archive.org/nl
congratulations @…

@rmdes@mstdn.social
2026-01-21 19:18:16

Currently retrieving my old blog.rmendes.net data from the internet archive 2020-2023 with the intent to migrate it to this place !

@fgraver@hcommons.social
2025-12-02 17:18:40

I bought my first modem at a post-Christmas sale in December 1992, and spent a few shot months discovering BBS’s and the text-based net before getting Mosaic and discover WWW in 1993. It opened up a whole new world! https://mastodon.archive.org/@internetarchive/11565105…

internetarchive (@internetarchive@mastodon.archive.org)
Attached: 1 image Mosaic was the first web browser to hit the mainstream in 1993, built by NCSA at Illinois. 🌐 It integrated text, images, data, audio & video, sparking a web boom. Not the first browser, but the one that made the web usable for millions. Its legacy? Every browser since. Visit its old website using your modern browser using the #WaybackMachine ⤵️ https://web.archive.org/web/19961220041605/http://www.ncsa.uiuc.edu/SDG/Software/Mosaic/NCSAMosaicHome.html #Wayback1T #In…

@Mediagazer@mstdn.social
2025-11-09 12:15:34

An interview with Internet Archive founder Brewster Kahle on copyright lawsuits that threatened to bankrupt the nonprofit, fair use, IA's future, AI, and more (Ashley Belanger/Ars Technica)
https://arstechnica.com/tech-policy/20

Internet Archive’s legal fights are over, but its founder mourns what was lost
“We survived, but it wiped out the library,” Internet Archive’s founder says.

@UP8@mastodon.social
2025-11-12 17:51:53

🗾 The second life of Japan's net cafes
https://www.japantimes.co.jp/business/2025/11/03/companies/internet-cafe-tokyo-kaiketsu-club/
🆓 …

The second life of Japan's net cafes
Once symbols of urban solitude, these spaces are finding new purpose as coworking hubs for a changing, wired generation.

@jorgecandeias@mastodon.social
2025-12-22 18:38:59

Yeah. I miss this internet. I really do.
https://mastodon.archive.org/@internetarchive/115764640377949472

@netzschleuder@social.skewed.de
2026-01-28 23:00:05

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 2452 nodes, 17736 edges. https://networks.skewed.de/net/us_agencies#newyork

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

@thomasfuchs@hachyderm.io
2025-11-13 17:10:05

Gotta love the Internet Archive posting AI slop into my timeline

@servelan@newsie.social
2025-11-03 16:47:57

Internet Archive’s legal fights are over, but its founder mourns what was lost - Ars Technica
https://arstechnica.com/tech-policy/2025/11/the-internet-archive-survived-major-copyright-losses-whats-next/

@simon_lucy@mastodon.social
2025-11-21 00:26:55

@… @…
We rely upon the Internet Archive, original copies and printed material.

@brewsterkahle@mastodon.archive.org
2025-12-18 16:33:46

"The Richmond is Home to 1 Trillion Web Pages
A Friday afternoon tour of the Internet Archive, which recently celebrated its 29th anniversary".
Nice article about the free tours at the
@…
every friday at 1pm.

@davej@dice.camp
2026-01-03 17:17:03

Content warning: CW: uspol, Venezuela.

If anyone needs me, I’ll be right here rereading Butler’s “War is a Racket”(https://archive.org/details/WarIsARacket/) with Creedence’s “Fortunate Son” (https://

War Is A Racket : Major General Smedley Butler : Free Download, Borrow, and Streaming : Internet Archive
Famous booklet by the ex high ranking Marine

@cdarwin@c.im
2025-12-13 06:11:11

93 photos released from the Jeffrey Epstein estate (Dec. 12, 2025)
Usage Public Domain Mark 1.0Topics
Jeffrey Epstein
Item Size 63.6M
https://archive.org/details/house-oversight-034614
Photos originally hosted here:

93 photos released from the Jeffrey Epstein estate (Dec. 12, 2025) : Free Download, Borrow, and Streaming : Internet Archive
Photos originally hosted here:...

@mela@zusammenkunft.net
2025-12-15 21:15:41

Mela, warum röchelt Jellyfin auf dem Heimserver so?
Hm, vielleicht habe ich herausgefunden, dass das Internet Archive ein umfangreiches Repository uralter Filk-Tapes hat.

@mia@hcommons.social
2026-01-14 12:16:57

I love how everything I write now (or edit after reviews) involves extra steps in looking up Internet Archive links for British Library web pages and blog posts so that links in footnotes actually work.
* I lied, I don't love it.

@nic@geno.social
2025-11-06 22:16:48

“What the coming of the computer did, "just in time," was to make it unnecessary to create social inventions, to change the system in any way. So in that sense, the computer has acted as fundamentally a conservative force, which kept power or even solidified power where it already existed.” Joseph Weizenbaum, 1985

Weizenbaum examines computers abd society - The Tech
An article from the Tuesday, April 9, 1985 issue of The Tech - MIT's oldest and largest newspaper and the first newspaper published on the Internet.

@brewsterkahle@mastodon.archive.org
2025-12-16 06:13:29

"The very first question to be considered is the applicability of the Copyright Law to the Moon. "
Thinking ahead (in 1952) about interplanetary copyrights :). if aliens have "Two Heads, Two Authors?"
really fun, worth reading.
ht…

American Library Association. ALA Bulletin 1953-01: Vol 47 Iss 1 : Free Download, Borrow, and Streaming : Internet Archive
American Library Association. ALA Bulletin 1953-01: Volume 47, Issue 1.Digitized from IA1514523-07.Previous issue:...

@nelson@tech.lgbt
2025-12-06 23:58:03

Recently reminded of Straight to Hell, the 1975—20?? zine featuring porny gay stories that were often real life narratives. Transgressive and important documentation. The Internet Archive has PDF scans of 5 edited book collections!
Harvard man, Harvard man,
blond for no reason. I'd
sure as shit quit the Quad
for him, scratch every dance
from my card for him; kiss
sunshine goodbye to open
his fly; trade clothes
for sheets, miss other meets
for him, for him

@brewsterkahle@mastodon.archive.org
2025-12-14 01:16:00

project idea (possibly using AI): record a take of songs from old sheet music books
Starting with something like the Grange book of sheet music below and record them ... Apparently the music can get to midi via Play Score 2, and other programs can could sing the lyrics.
or better yet, if anyone wants to perform the songs and upload them to the archive, you could link it into a review of the book.

Grange melodies : National Grange : Free Download, Borrow, and Streaming : Internet Archive
Don Yoder Collection of American Hymnody

@brewsterkahle@mastodon.archive.org
2026-01-12 18:39:50

media conglomerates are leading the way against open internet law by fining cloudflare:
Italy's "shadowy cabal of European media elites" ... "scheme to censor the Internet. The scheme, which even the EU has called concerning, required us within a mere 30 minutes of notification to fully censor from the Internet any sites a shadowy cabal of European media elites deemed against their interests. No judicial oversight. No due process. No appeal. No transparency. "

Matthew Prince 🌥 (@eastdakota) on X
Yesterday a quasi-judicial body in Italy fined @Cloudflare $17 million for failing to go along with their scheme to censor the Internet. The scheme, which even the EU has called concerning, required us within a mere 30 minutes of notification to fully censor from the Internet any

@stiefkind@mastodon.social
2026-01-10 15:31:57

»Beim Erscheinen des zweiten Theiles der Geschichte von Pommern hat der Verfasser zunächst die Pflicht des Dankes gegen das Gedächtniß des in Gott ruhenden Königs zu bekennen, dessen huldreiche Unterstützung dem Forscher Muße und Freudigkeit zu seinem schweren Werke gewährte.«
SO beginnt man Vorworte 🙂
Quelle: F. W. Barthold, Geschichte von Rügen und Pommern (1840)

Geschichte von Rügen und Pommern : Barthold, Friedrich Wilhelm : Free Download, Borrow, and Streaming : Internet Archive
1. Th. Von den ältesten Zeiten bis auf den Untergang des heidenthums -- 2. Th. Von der Bekehrung Pommerns zum Christenthume bis zum Tode Barnims I i. J. 1278....

@thomasfuchs@hachyderm.io
2025-12-10 14:24:44

Sure would be cool if one of the big tech companies would give the Internet Archive money to make a full-text searchable index of the Wayback Machine.

@stiefkind@mastodon.social
2025-11-02 20:23:43

Zufallsfund: im @… liegen alle 49 Jahrgänge der Satire-Zeitschrift "Simplicissimus" (1896-1944). Darin zu blättern ist gar großartig: https://

Simplicissimus magazine : Langen, Albert, 1869-1909 : Free Download, Borrow, and Streaming : Internet Archive
Simplicissimus was a literary, illustrated satirical German weekly magazine started by Albert Langen in April 1896. The last issue appeared on September 13,...

@Mediagazer@mstdn.social
2025-11-05 08:25:53

A profile of nonprofit Common Crawl, which has scraped billions of webpages since 2013, including paywalled ones, to build an archive used by OpenAI and others (Alex Reisner/The Atlantic)
https://www.theatlantic.com/technology/202…

Common Crawl Is Doing the AI Industry’s Dirty Work
“You shouldn’t have put your content on the internet if you didn’t want it to be on the internet,” Common Crawl’s executive director says.

@netzschleuder@social.skewed.de
2025-12-20 18:00:04

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 446 nodes, 4920 edges. https://networks.skewed.de/net/us_agencies#delaware

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

@netzschleuder@social.skewed.de
2025-11-14 21:00:04

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 560 nodes, 6460 edges. https://networks.skewed.de/net/us_agencies#virginia

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

@netzschleuder@social.skewed.de
2025-11-09 14:00:04

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 1133 nodes, 8161 edges. https://networks.skewed.de/net/us_agencies#illinois

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

@netzschleuder@social.skewed.de
2025-12-10 11:00:04

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 999 nodes, 11825 edges. https://networks.skewed.de/net/us_agencies#washington

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

@netzschleuder@social.skewed.de
2026-01-01 21:00:03

us_agencies: U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and U…

us_agencies: U.S. government agency websites (2018). 515 nodes, 3607 edges. https://networks.skewed.de/net/us_agencies#northdakota

us_agencies — U.S. government agency websites (2018)
50 networks, one for each U.S. state, representing the web-based links between their associated government agencies websites. A node is an entire agency website and a directed edge (i,j) represents the existence of a hyperlink from any webpage in website i to some webpage in website j. Data was collected with a crawler. Nodes are annotated with the number of webpages per website, website name (related to its government function) and URL, and year the website was indexed by the Internet Archive …

Tootfinder

Opt-in global Mastodon full text search. Join the index!