Tootfinder

Opt-in global Mastodon full text search. Join the index!

@teledyn@mstdn.ca
2025-12-23 21:02:25

Still working the kinks out of transporting my computing home to a new laptop, and this one popped up today attempting to generate and view #Emacs #org as HTML. I recognize the words as english, but can anyone parse the meaning?
org-open-file: Please see Org News for version 9.0 about ‘org-file-apps’--Error: Deprecated usage of (browse-url file)
needless to say, search engines were no help in finding Org News, for any version.

@newsie@darktundra.xyz
2025-10-21 15:18:45

Russia pressures Apple to make Russian search engines default on locally-sold iPhones therecord.media/russia-apple-s

@netzschleuder@social.skewed.de
2025-12-22 02:00:52

trec_web: TREC WT10g (2003)
A web graph network originally constructed in 2003 as a testbed for information-retrieval techniques, including web search engines. Distributed by University of Glasgow.
This network has 1601787 nodes and 8063026 edges.
Tags: Informational, Web graph, Unweighted
networks.skewed.de/net…

trec_web: TREC WT10g (2003). 1601787 nodes, 8063026 edges. https://networks.skewed.de/net/trec_web
@Techmeme@techhub.social
2025-10-14 17:50:47

Document: to avoid an EU fine, Google offered to tweak its search results to show vertical search engines in their own box on Search (Foo Yun Chee/Reuters)
reuters.com/legal/litigation/g

@tante@tldr.nettime.org
2025-10-28 09:20:05

When "AI" search engines look for sources, they go for fringe and not established quality sites. What could possibly go wrong?
(Original title: AI-powered search engines rely on “less popular” sources, researchers find)
arstechnica.com/a…

@ethanwhite@hachyderm.io
2025-10-17 13:08:13

One of the things I love about
@… (and several other modern search engines) is the !w to go straight to the search results from Wikipedia. It's a useful way to go straight to a reliable source right from your search bar.
404media.co/wikipedia-says-ai-

@thomasfuchs@hachyderm.io
2025-11-19 01:50:10

Anyone using UTM/QEMU with a NT 3.51 VM?
I’m looking for correct network settings. It detects NE2000 ISA and assigns an IP but I can’t ping anything.
UTM mentions to set IRQ to 9 but that’s not an option I see (it’s set to 2).
Mainly I want to access a FTP server on the host computer to transfer files.
👉 I know how to use search engines, therefore please only make suggestions if you are personally familiar with this, thank you. 👈

@newsie@darktundra.xyz
2025-12-23 16:33:48

US disrupts multimillion-dollar bank account takeover operation targeting Americans therecord.media/us-disrupts-ba

@gedankenstuecke@scholar.social
2025-10-17 13:53:55

«younger generations are seeking information on social video platforms rather than the open web. This gradual shift is not unique to #Wikipedia. Many other publishers and content platforms are reporting similar shifts as users spend more time on search engines, AI chatbots, and social media to find information.»
All of those "alternatives" are facilitated through algorithmic recommendation engines designed to maximize profits, what could possible go wrong there…
404media.co/wikipedia-says-ai-

@aral@mastodon.ar.al
2025-11-17 09:19:45

“TABS [by Mozilla] pulls exactly the data you need—from HTML to Markdown to JSON—using the fastest, most efficient method for each page. It adapts to the structure and complexity of the site, staying stealthy and reliable so your [AI] agents always get what they need without friction.”
Ethical Stealthy AI Scraping (tm) by Mozilla.
#Mozilla

@karlauerbach@sfba.social
2025-12-17 20:32:56

Bing's web crawler is a true putz of the Internet.
By-the-way, various other search engines use Bing as the underlying web crawler, so this is far from being a Microsoft issue.
Bing's web crawler refuses to index content that does not meet its standards.
And what are those standards? It is that pages contain a bunch of meta tags and such.
Well, some of my archival content was written around 1995, long before those meta tags were conceived.
Thus, from Bi…

@arXiv_csIR_bot@mastoxiv.page
2025-10-14 11:32:29

What Generative Search Engines Like and How to Optimize Web Content Cooperatively
Yujiang Wu, Shanshan Zhong, Yubin Kim, Chenyan Xiong
arxiv.org/abs/2510.11438

@Techmeme@techhub.social
2025-11-09 20:45:36

Zurich-based DeepJudge, which builds customized search indexes for law firms that plug into AI tools like ChatGPT, raised a $41M Series A at a $300M valuation (Alicia Park/Forbes)
forbes.com/sites/aliciapark/20

@arXiv_csLG_bot@mastoxiv.page
2025-10-08 11:01:09

Stratified GRPO: Handling Structural Heterogeneity in Reinforcement Learning of LLM Search Agents
Mingkang Zhu, Xi Chen, Bei Yu, Hengshuang Zhao, Jiaya Jia
arxiv.org/abs/2510.06214

@maxheadroom@hub.uckermark.social
2025-10-17 05:24:56

I won the internet today. Was searching for how to modify #Ollama environment variables in #homebrew on #macOS. The second entry in

Screenshot of an Internet search engines result for a question about environment variables for home-brew services. Second result is highlighted with a red frame to mark the domain it points to: falko.zurell.de.
@me@mastodon.peterjanes.ca
2025-12-18 20:01:46

Apparently the "Brother printer support scam" is still a thing. Great job, search engines, certificate providers, browser developers, and every other tech field that's failed to stanch this over the last decade or two.

‪@zydecopaws@pnw.zone‬
2025-11-19 16:59:43

And by going on with the rest of my day, it means I then have to go down the rabbit hole of “I wonder what else is being celebrated today” and of to the search engines I went.

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:17:59

TaoSR-SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
Pengkun Jiao, Yiming Jin, Jianhui Yang, Chenhe Dong, Zerui Huang, Shaowei Yao, Xiaojiang Zhou, Dan Ou, Haihong Tang
arxiv.org/abs/2510.07972

@teledyn@mstdn.ca
2025-10-16 17:43:53

Search is Dead.
Really, this should not have taken two hours across oodles of prompts using four different search engines, but every one of those prompts was deconstructed into syllables to match anything even remotely thesaurused, regardless of details provided, regardless quote marks, minus or plus signs. "Malcolm Strauss"? oh, you mean RICHARD! No, fuck you bot, and Diana Allen is neither Rita Hayworth or Woody Allen and 1923 is not 1945 and a 2006 is unlikely to be SILENT, and this shouldn't take two hours before, finally, only Google would find this mention with the key phrase "presumed lost".
Silent Era : Progressive Silent Film List
silentera.com/PSFL/data/S/Salo

@kubikpixel@chaos.social
2025-12-04 06:05:26

Curlie - The Collector of URLs
Some search engines for the web are also based on @…, among other things. You can also enter your websites in this so that they can be found more easily when people search for keywords that relate to your pages.
🐿️

@arXiv_csCR_bot@mastoxiv.page
2025-10-09 09:42:11

Exposing Citation Vulnerabilities in Generative Engines
Riku Mochizuki, Shusuke Komatsu, Souta Noguchi, Kazuto Ataka
arxiv.org/abs/2510.06823

@netzschleuder@social.skewed.de
2025-12-15 12:00:54

trec_web: TREC WT10g (2003)
A web graph network originally constructed in 2003 as a testbed for information-retrieval techniques, including web search engines. Distributed by University of Glasgow.
This network has 1601787 nodes and 8063026 edges.
Tags: Informational, Web graph, Unweighted
networks.skewed.de/net…

trec_web: TREC WT10g (2003). 1601787 nodes, 8063026 edges. https://networks.skewed.de/net/trec_web
@thomasfuchs@hachyderm.io
2025-10-05 04:05:31

You can easily make text-based interfaces that don’t do this.
E.g. normal search engines don’t say “I found 37 results, that’s a really great search term, Thomas!”.
That would be weird, wouldn’t it?
Yet…

@dcm@social.sunet.se
2025-10-02 14:24:39

Cool little adapted tale on AI, by Alison Gopnik:
simons.berkeley.edu/news/stone

@arXiv_csHC_bot@mastoxiv.page
2025-09-25 09:34:32

Into the Void: Understanding Online Health Information in Low-Web Data Languages
Hellina Hailu Nigatu, Nuredin Ali Abdelkadir, Fiker Tewelde, Stevie Chancellor, Daricia Wilkinson
arxiv.org/abs/2509.20245

@arXiv_csIR_bot@mastoxiv.page
2025-10-02 09:49:01

Deep Learning-Based Approach for Improving Relational Aggregated Search
Sara Saad Soliman, Ahmed Younes, Islam Elkabani, Ashraf Elsayed
arxiv.org/abs/2510.00966

@philip@mastodon.mallegolhansen.com
2025-10-29 22:42:10

I just spent ~20 minutes trying to find information on how to do something in a Golang library on my client computer, before I decided it was worth pulling up @… on my personal computer to look it up instead.
Immediately found the answer I needed to get me moving.
We don’t need “vibe coding”, we need search engines that actually work.

@netzschleuder@social.skewed.de
2025-10-07 08:00:50

trec_web: TREC WT10g (2003)
A web graph network originally constructed in 2003 as a testbed for information-retrieval techniques, including web search engines. Distributed by University of Glasgow.
This network has 1601787 nodes and 8063026 edges.
Tags: Informational, Web graph, Unweighted
networks.skewed.de/net…

trec_web: TREC WT10g (2003). 1601787 nodes, 8063026 edges. https://networks.skewed.de/net/trec_web
@thomasfuchs@hachyderm.io
2025-10-05 16:02:11

Wanna feel old? Back in the now seemingly forever-ago days of 2021 we had an interlinked network of human-curated knowledge we called the world wide web and search engines that could pinpoint accurate information in fractions of a second

@arXiv_csCY_bot@mastoxiv.page
2025-09-25 07:59:12

DSA, AIA, and LLMs: Approaches to conceptualizing and auditing moderation in LLM-based chatbots across languages and interfaces in the electoral contexts
Natalia Stanusch, Raziye Buse Cetin, Salvatore Romano, Miazia Schueler, Meret Baumgartner, Bastian August, Alexandra Rosca
arxiv.org/abs/2509.19890

@arXiv_csCR_bot@mastoxiv.page
2025-10-07 08:41:42

Backdoor-Powered Prompt Injection Attacks Nullify Defense Methods
Yulin Chen, Haoran Li, Yuan Sui, Yangqiu Song, Bryan Hooi
arxiv.org/abs/2510.03705

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 09:44:21

Observation-Free Attacks on Online Learning to Rank
Sameep Chattopadhyay, Nikhil Karamchandani, Sharayu Mohair
arxiv.org/abs/2509.22855 arx…

@netzschleuder@social.skewed.de
2025-09-30 18:00:53

trec_web: TREC WT10g (2003)
A web graph network originally constructed in 2003 as a testbed for information-retrieval techniques, including web search engines. Distributed by University of Glasgow.
This network has 1601787 nodes and 8063026 edges.
Tags: Informational, Web graph, Unweighted
networks.skewed.de/net…

trec_web: TREC WT10g (2003). 1601787 nodes, 8063026 edges. https://networks.skewed.de/net/trec_web
@teledyn@mstdn.ca
2025-11-29 18:36:10

I figure even if it is someday possible for the transhumanists to upload themselves into a machine, it will be a nightmare attempting to later transplant themselves into the new improved updated wow machine that will someday replace it!
Folks who balk about Mastodon not letting you migrate your entire presence elsewhere likely have never tried to extricate their personal workflow from a 15 year old laptop! 😅
That is, of course, AFTER previously spending an hour or so to weed out the about:config AI functions and add no-AI search engines to the fresh Firefox install 😞
Yes, I have contemplated installing Emacs and nothing else. Would seem the only sensible thing to do, but 'sensible' was never listed on my report card.

@arXiv_csIR_bot@mastoxiv.page
2025-09-30 07:58:14

How good are LLMs at Retrieving Documents in a Specific Domain?
Nafis Tanveer Islam, Zhiming Zhao
arxiv.org/abs/2509.22658 arxiv.org/pdf/25…

@netzschleuder@social.skewed.de
2025-09-30 01:00:54

trec_web: TREC WT10g (2003)
A web graph network originally constructed in 2003 as a testbed for information-retrieval techniques, including web search engines. Distributed by University of Glasgow.
This network has 1601787 nodes and 8063026 edges.
Tags: Informational, Web graph, Unweighted
networks.skewed.de/net…

trec_web: TREC WT10g (2003). 1601787 nodes, 8063026 edges. https://networks.skewed.de/net/trec_web
@netzschleuder@social.skewed.de
2025-10-26 04:00:48

trec_web: TREC WT10g (2003)
A web graph network originally constructed in 2003 as a testbed for information-retrieval techniques, including web search engines. Distributed by University of Glasgow.
This network has 1601787 nodes and 8063026 edges.
Tags: Informational, Web graph, Unweighted
networks.skewed.de/net…

trec_web: TREC WT10g (2003). 1601787 nodes, 8063026 edges. https://networks.skewed.de/net/trec_web