Tootfinder

Opt-in global Mastodon full text search. Join the index!

@bbhorne@kolektiva.social
2024-04-05 23:48:36

I just watched the Antisocial Network. I found it questionable that they let Kirtaner appear to take credit for the Parler scrape of video data about January 6. But I did appreciate that they got his character arc generally right, about how Kirtaner fell squarely on the "troll" side of the Anonymous schism, and not the "activist" side of the schism, originally... He only tried to take up the "activist" nomenclature and symbols, after coming out of retirement, to fight this battle against QAnon.

@philip@mastodon.mallegolhansen.com
2024-03-05 16:19:49

@… Not saying this alone is good enough, but a starting point:
If you’re writing a scraper, make sure you actually respect the damn robots.txt, it’s there for a reason.
If someone took the time and effort to explicitly indicate what you’re allowed to scrape, listen.

@Techmeme@techhub.social
2024-03-02 05:50:56

iiMedia Research: China's food delivery market reached $208B in 2023, 2.3x the size in 2020; Meituan and Ele.me employ over 10M gig workers combined (Takashi Kawakami/Nikkei Asia)
t.co/TXfYpEbLPr

@samerfarha@mastodon.social
2024-05-04 14:19:13

Had some leftover homemade pizza sauce (puréed canned crushed tomatoes, red pepper flakes, citric acid, and salt) so I made some “pizza spice”.
Poured it on a silpat, added some basil leaves, dehydrated at 150°F overnight (stopping once to scrape the tomato leather off and put it on the rack to dry out faster). Then blitz it all to a powder and sieve it.
Very tomatoey and a bit spicy/salty.

Dehydrated tomato sauce on a sheet pan
Powdered pizza spice in a container
@kexpmusicbot@mastodonapp.uk
2024-03-03 08:09:14

🔊 #NowPlaying on KEXP's #SeekAndDestroy
Midnight:
🎵 Gash Scrape
#Midnight

@samerfarha@mastodon.social
2024-05-04 14:19:13

Had some leftover homemade pizza sauce (puréed canned crushed tomatoes, red pepper flakes, citric acid, and salt) so I made some “pizza spice”.
Poured it on a silpat, added some basil leaves, dehydrated at 150°F overnight (stopping once to scrape the tomato leather off and put it on the rack to dry out faster). Then blitz it all to a powder and sieve it.
Very tomatoey and a bit spicy/salty.

Dehydrated tomato sauce on a sheet pan
Powdered pizza spice in a container
@askans@bonn.social
2024-03-19 20:31:30

I am tiny little bit proud that I could give back something to the #n8n community:
my workflow on web scraping has been published.
n…

@jake4480@c.im
2024-02-27 22:35:51

Yeah, you're really gonna see which companies are just gonna allow the AI to scrape all their stuff now. I'm a copyleft/creative commons kinda guy. But if you have art that you don't want stolen, the answer is simple.
MAKE YOUR OWN WEBSITE and put your art there (edit: and use that Glaze type of stuff on your art that wrecks AI, just to be sure)! Neocities is SO easy to set up! Or your own domain and hosting via porkbun, GoDaddy (non-WordPress) - anything at all other than …

@Adam@social.lein.us
2024-02-29 13:32:07

How does retention.com/ work to scrape email addresses from web visitors? Does it sniff out webmail cookies from the same browser?

@MamasPinkyToe@mastodon.world
2024-03-27 21:47:49

Three months? Six months? A year? How long after you've got the kid do you scrape off the "Baby On Board" sticker?

@Techmeme@techhub.social
2024-03-26 16:46:02

An interview with Adobe executives about training Firefly on content licensed specifically for AI training, the decision not to scrape the internet, and more (Melissa Heikkilä/MIT Technology Review)
technologyreview.com/2024/03/2

@tante@tldr.nettime.org
2024-03-09 13:50:55

"It turns out that generative AI companies don’t like it when you steal, sorry, scrape, images from them. Cue the world’s smallest violin."
In Moment of Unbelievable Irony, Midjourney Accuses Stability AI of Image Theft
themarysue.com/midjourney-accu

@cwensel@fosstodon.org
2024-04-20 21:14:53

Instead of Apps with maps, it should be a Map with apps/layers.
E.g. a map with a Find My layer and public transit layer. Or public transit with est arrivals and Yelp reviews.
Google kinda does this but it’s all Google or what they scrape.

@nuthatch@infosec.exchange
2024-04-15 17:54:32

Just footfalls and the occasional scrape of a chair against the floor, they’re pretty quiet, so quiet it takes me time to realize they’ve been away for days or weeks (they travel a lot.)

@adrianco@mastodon.social
2024-04-09 02:51:24

Decided I wanted to extract the transcripts from videos of talks I’ve given, and used ChatGPT to write Python code (I‘m not a Python programmer) to read the YouTube API (which I’ve not used before). First thing I found is that OAuth is needed so that Google knows who you are. Then found that I can only read the transcript for videos I personally posted. So now ChatGPT is helping me write code to scrape the web page that I can see the Transcript on. No way I could have done this on my own.

@simoncox@seocommunity.social
2024-04-11 15:55:32

Todays fun has been recovering content from 18 sites in 8 languages for some site migrations I am helping out on.
The pagination on the pages could not be crawled, and none of these pages were in xml sitemaps either, so the vast majority of the pages are not indexed in search engines. Had to do a lot of manual scraping to get a list of URLs to scrape the content from.
Time for a beverage. 🍷

@cybertailor@craba.cab
2024-02-08 06:28:09

"ЦИПСО крутит клоуна": Matrix Arc

Скриншот из Matrix:

Hi, is there a fork or a FAQ to install nitter with the data of a real account? I need a nitter instance to scrape twitter data and have <10k requests a month.

Под сообщением 8 реакций клоуна.
@arXiv_physicsplasmph_bot@mastoxiv.page
2024-03-28 07:34:34

Global fluid turbulence simulations in the SOL of a stellarator island divertor
Brendan Shanahan, David Bold, Ben Dudson
arxiv.org/abs/2403.18220

@johnhobbs@mstdn.ca
2024-04-18 13:06:54

4/7
Moreover, zinc has a starring role in wound healing and maintaining skin integrity. Whether it's a minor scrape or a surgical wound, zinc serves as a co-factor in collagen synthesis and inflammatory response reduction, speeding up the healing process. Its antibacterial properties also prevent wound infections, making it an all-encompassing ally for your skin’s health.

@adrianco@mastodon.social
2024-04-09 02:51:24

Decided I wanted to extract the transcripts from videos of talks I’ve given, and used ChatGPT to write Python code (I‘m not a Python programmer) to read the YouTube API (which I’ve not used before). First thing I found is that OAuth is needed so that Google knows who you are. Then found that I can only read the transcript for videos I personally posted. So now ChatGPT is helping me write code to scrape the web page that I can see the Transcript on. No way I could have done this on my own.

@philip@mastodon.mallegolhansen.com
2024-02-12 01:31:32

@… This sounds like it could be fairly low cost on the hosting end, if you were doing some kind of event driven architecture.
Once a day scrape the featured apps (or however often they update) -> lookup if any of those apps match a client you have in your db -> send out notifications.
With something like AWS Lambda I doubt this is a high cost so…

@arXiv_physicsplasmph_bot@mastoxiv.page
2024-03-18 07:19:37

Tokamak H-mode edge-SOL global turbulence simulations with an electromagnetic, transcollisional drift-fluid model
W. Zholobenko, K. Zhang, A. Stegmeir, J. Pfennig, K. Eder, C. Pitzal, P. Ulbl, M. Griener, L. Radovanovic, U. Plank, ASDEX Upgrade Team
arxiv.org/abs/2403.10113