Tootfinder

Opt-in global Mastodon full text search. Join the index!

@aral@mastodon.ar.al
2025-11-17 09:19:45

“TABS [by Mozilla] pulls exactly the data you need—from HTML to Markdown to JSON—using the fastest, most efficient method for each page. It adapts to the structure and complexity of the site, staying stealthy and reliable so your [AI] agents always get what they need without friction.”
Ethical Stealthy AI Scraping (tm) by Mozilla.
#Mozilla

@lysander07@sigmoid.social
2025-10-14 09:44:11

Today, I'm at Bundesarchiv in Koblenz for the Strategy & Planning meeting of our project "Wiedergutmachung". Our task in this project is to develop efficient information extraction from historical case files of the German Postwar recompensation process of nationalsocialist injustice.
@… @…

Promo postcard for the Wiedergutmachung project. Title: The German Wiedergutmachung. The theme depicted on the card is a blue tinted historical bw photo showing a woman (right) looking through files that are piled on a desk. Opposite there are two men sitting. One is wearing some traditional foreign hat. In the lower right corner, the logo of the German ministry of finances is depicted next to the Wiedergutmachung logo. There is also a small British flag (Union Jack) depicted on the right side,…
@simon_brooke@mastodon.scot
2025-10-12 22:35:42

This evening I have been listening to one of @… 's podcasts and thinking about my failure in trying to lead the village's planning working group, and about the cognitive dissonance underlying my Tricycle project. I suspect this essay will be a grim read; it's not well formed in my mind as I sit down to write.

@Techmeme@techhub.social
2025-10-11 04:46:04

Google plans to invest $10B to build a new 1GW data center cluster near Visakhapatnam in Andhra Pradesh, India, with operations expected to begin by July 2028 (Shashank Pathak/Entrackr)
entrackr.com/snippets/google-t

@karlauerbach@sfba.social
2025-12-08 19:24:48

Back when I worked at SDC in Santa Monica (roughly 1971 through 1980) one of our department's projects was an early AI project.
We had an entire IBM 370 mainframe running CP/67 (which became IBM's VM) running LISP based AI code.
I think that the group may have been using it for continuous speech recognition - which we knew our "national security' customer was planning to use to automate wiretaps.
(I used the speech recognition project's soundproof room …

@arXiv_csRO_bot@mastoxiv.page
2025-10-07 11:20:42

SITCOM: Scaling Inference-Time COMpute for VLAs
Ayudh Saxena, Harsh Shah, Sandeep Routray, Rishi Rajesh Shah, Esha Pahwa
arxiv.org/abs/2510.04041

@arXiv_physicsaoph_bot@mastoxiv.page
2025-10-07 08:22:12

Score-based generative emulation of impact-relevant Earth system model outputs
Shahine Bouabid, Andre Nogueira Souza, Raffaele Ferrari
arxiv.org/abs/2510.04358