2026-04-05 14:42:03
from my link log —
Roogle: a Rust API search engine.
https://github.com/roogle-rs/roogle?tab=readme-ov-file
saved 2026-04-05 https://
from my link log —
Roogle: a Rust API search engine.
https://github.com/roogle-rs/roogle?tab=readme-ov-file
saved 2026-04-05 https://
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
it's high time that such tools were developed: #wikicommons is a pain and mostly doesn't yield results that are in any sense representative of what commons actuall…
"Ecosia’s Purpose-Driven Climate Investment Reaches 250M Trees"
#Ecosia #Trees #Environment
I don't fully understand why firefox provides option to have ecosia as the default search engine in some limited countries only. Anyway, I now set up a plugin to do the same on my main computer.
https://blog.ecosia.org/ecosia-firefox-switch/
RE: https://mastodon.ie/@raymaccarthy/116517490459071198
Google search engine with udm=14 preset
a.k.a. udm14 <
Using your AI chatbot as a search engine? Be careful what you believe https://theconversation.com/using-your-ai-chatbot-as-a-search-engine-be-careful-what-you-believe-277616
I use linkding for my bookmarks and I tried to do a search using the term "linkding" and the search engine assumed I wanted to search for "LinkedIn" and in other news Fuck You search engine!
As soon as users log into Perplexity’s home page, trackers are downloaded onto their devices, giving Meta and Google full access to the conversations between them and Perplexity’s AI Machine search engine.
https://www.bloomberg.com/news/articles/20
Baidu plans to let users access OpenClaw via its search app and integrate OpenClaw's capabilities into its e-commerce business and other services (Evelyn Cheng/CNBC)
https://www.cnbc.com/2026/02/13/baidu-openclaw-ai-search-app-integratio…
Heute ist #SaferInternetDay und das wäre doch der perfekte Tag für die #DIDay Kampagne, um sich von #Ecosia zu verabschieden.
Eine Suchmaschine, die mit 'grüner KI' wirbt: Ganz schle…
DHS Wants a Single Search Engine to Flag Faces and Fingerprints Across Agencies (Dell Cameron/Wired)
https://www.wired.com/story/dhs-wants-a-single-search-engine-to-flag-faces-and-fingerprints-across-agencies/
http://www.memeorandum.com/260220/p88#a260220p88
Not seen https://www.ecosia.org before. This is really good, seem to source the results from the usual places so results seem pretty good.
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
AI bros are just loving open source — loving it to death... maybe quite literally! (Godot being latest popular example[1])
More and more projects are impacted by floods of bogus AI pull requests and resulting discussions, stealing precious time and nerves away from their maintainers doing actual productive work. More buggy and insecure software (incl. commercial offerings) due to slopcoding, more websites getting attacked daily by AI crawlers in desperate search for any new bits (liter…
What really makes me irate about how LLMs are marketed and sold is that if these companies instead spend their time and money to make highly specialized versions for them we could have amazing and actually helpful tools without the ick.
This could both work much better for many use cases (for example for correlating documents and giving a list of results like a search engine instead of tedious palaver) and they wouldn't need to steal data (Professor Bender calls it succinctly "datasets too large to care")[1].
But they're pursuing "AGI" (which is provenly impossible to do with LLMs) and endless growth.
[1] https://dair-community.social/@emilymbender/116109627131276897
Google rolls out its Veo video model globally within Google Ads, allowing advertisers to create 10-second videos for YouTube from up to three static images (Anu Adegbola/Search Engine Land)
https://searchengineland.com/google-brings-its…
Filing: OpenAI petitions the UK CMA to include AI chatbots with search function in Google's mandated default search engine choice screen for Chrome and Android (James Titcomb/Telegraph)
https://www.telegraph.co.uk/business/2026/03/23…
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
If operating systems had, built in, a quick fuzzy single-word spelling suggestion factilty, search engine traffic would halve overnight.
What's the best search engine these days for avoiding tracking and AI?
I've used Qwant for some time but now they are heading the feed with an AI "Flash Summary". Ecosia is worse.
#NoToAI
Sometimes it makes sense to act smart rather than brute-force.
For example, when Intel makes another #MKL release and you get version like "2026.0.0", and you need to figure out the remaining "-n" suffix for the .deb packages. And you really don't want to start a Debian container to figure that out.
Well, you could just keep brute-forcing until you find the right number. Or you can figure out that the index URL is #Gentoo
I started using SearXNG recently, which is a metasearch engine you can install locally to keep all your search data away from search engine companies.
https://docs.searxng.org/
There are also a bunch of public instances you can try out.
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
from my link log —
jsongrep is faster than {jq, jmespath, jsonpath-rust, jql}
https://micahkepe.com/blog/jsongrep/
saved 2026-03-27 https://dotat.a…
Google rolls out its Veo video model globally within Google Ads, allowing advertisers to create 10-second videos for YouTube from up to three static images (Anu Adegbola/Search Engine Land)
https://searchengineland.com/google-brings-its…
Q&A with CEO Jim Lanzone on Yahoo being "very profitable", its new AI search engine, focusing on sports content, including original video and podcasts, and more (Nilay Patel/The Verge)
https://www.theverge.com/podcast/895221/yahoo-jim-…
In der Juristerei wird das Aufkommen der LLMs begeistert gefeiert. Man versucht sich zu profilieren. Man ist vielleicht besorgt, dass die Stundensätze herunter gehen könnten. Aber sonst? Und dann diese Studie, die zeigt, dass bei Benutzung von LLMs die cognitive Kapazität und damit auch die Qualität dauernd nach unten zeigt. Kurz: Ein LLM-Anwalt bietet teure 0815-Soße, die man auch ohne Anwalt haben kann.
As someone who has long toiled in the harsh fields of OSS user support mailing lists…
Do not try to be “helpful” to people seeking advice in an expert forum of any sort by feeding their plea into a digester & providing an answer from its excrement.
It insults the seeker, implying that they don’t know how to type their question into a LLM-infested search engine.
It insults the actual experts who put their time & thought into such fora as a way to give back value …
An alle auf der Instanz social.tchncs.de: #HolosDiscover (#Suchmaschine
Replaced article(s) found for cs.LG. https://arxiv.org/list/cs.LG/new
[1/6]:
- Towards Attributions of Input Variables in a Coalition
Xinhao Zheng, Huiqi Deng, Quanshi Zhang
https://arxiv.org/abs/2309.13411
- Knee or ROC
Veronica Wendt, Jacob Steiner, Byunggu Yu, Caleb Kelly, Justin Kim
https://arxiv.org/abs/2401.07390
- Rethinking Disentanglement under Dependent Factors of Variation
Antonio Almud\'evar, Alfonso Ortega
https://arxiv.org/abs/2408.07016 https://mastoxiv.page/@arXiv_csLG_bot/112959235461894530
- Minibatch Optimal Transport and Perplexity Bound Estimation in Discrete Flow Matching
Etrit Haxholli, Yeti Z. Gurbuz, Ogul Can, Eli Waxman
https://arxiv.org/abs/2411.00759 https://mastoxiv.page/@arXiv_csLG_bot/113423933393275133
- Predicting Subway Passenger Flows under Incident Situation with Causality
Xiannan Huang, Shuhan Qiu, Quan Yuan, Chao Yang
https://arxiv.org/abs/2412.06871 https://mastoxiv.page/@arXiv_csLG_bot/113632934357523592
- Characterizing LLM Inference Energy-Performance Tradeoffs across Workloads and GPU Scaling
Paul Joe Maliakel, Shashikant Ilager, Ivona Brandic
https://arxiv.org/abs/2501.08219 https://mastoxiv.page/@arXiv_csLG_bot/113831081884570770
- Universality of Benign Overfitting in Binary Linear Classification
Ichiro Hashimoto, Stanislav Volgushev, Piotr Zwiernik
https://arxiv.org/abs/2501.10538 https://mastoxiv.page/@arXiv_csLG_bot/113872351652969955
- Safe Reinforcement Learning for Real-World Engine Control
Julian Bedei, Lucas Koch, Kevin Badalian, Alexander Winkler, Patrick Schaber, Jakob Andert
https://arxiv.org/abs/2501.16613 https://mastoxiv.page/@arXiv_csLG_bot/113910356206562660
- A Statistical Learning Perspective on Semi-dual Adversarial Neural Optimal Transport Solvers
Roman Tarasov, Petr Mokrov, Milena Gazdieva, Evgeny Burnaev, Alexander Korotin
https://arxiv.org/abs/2502.01310
- Improving the Convergence of Private Shuffled Gradient Methods with Public Data
Shuli Jiang, Pranay Sharma, Zhiwei Steven Wu, Gauri Joshi
https://arxiv.org/abs/2502.03652 https://mastoxiv.page/@arXiv_csLG_bot/113961314098841096
- Using the Path of Least Resistance to Explain Deep Networks
Sina Salek, Joseph Enguehard
https://arxiv.org/abs/2502.12108 https://mastoxiv.page/@arXiv_csLG_bot/114023706252106865
- Distributional Vision-Language Alignment by Cauchy-Schwarz Divergence
Wenzhe Yin, Zehao Xiao, Pan Zhou, Shujian Yu, Jiayi Shen, Jan-Jakob Sonke, Efstratios Gavves
https://arxiv.org/abs/2502.17028 https://mastoxiv.page/@arXiv_csLG_bot/114063477202397951
- Armijo Line-search Can Make (Stochastic) Gradient Descent Provably Faster
Sharan Vaswani, Reza Babanezhad
https://arxiv.org/abs/2503.00229 https://mastoxiv.page/@arXiv_csLG_bot/114103018985567633
- Semantic Parallelism: Redefining Efficient MoE Inference via Model-Data Co-Scheduling
Yan Li, Zhenyu Zhang, Zhengang Wang, Pengfei Chen, Pengfei Zheng
https://arxiv.org/abs/2503.04398 https://mastoxiv.page/@arXiv_csLG_bot/114120014622063602
- A Survey on Federated Fine-tuning of Large Language Models
Wu, Tian, Li, Sun, Tam, Zhou, Liao, Xiong, Guo, Li, Xu
https://arxiv.org/abs/2503.12016 https://mastoxiv.page/@arXiv_csLG_bot/114182234054681647
- Towards Trustworthy GUI Agents: A Survey
Yucheng Shi, Wenhao Yu, Jingyuan Huang, Wenlin Yao, Wenhu Chen, Ninghao Liu
https://arxiv.org/abs/2503.23434 https://mastoxiv.page/@arXiv_csLG_bot/114263024618476521
- CONTINA: Confidence Interval for Traffic Demand Prediction with Coverage Guarantee
Chao Yang, Xiannan Huang, Shuhan Qiu, Yan Cheng
https://arxiv.org/abs/2504.13961 https://mastoxiv.page/@arXiv_csLG_bot/114380404041503229
- Regularity and Stability Properties of Selective SSMs with Discontinuous Gating
Nikola Zubi\'c, Davide Scaramuzza
https://arxiv.org/abs/2505.11602 https://mastoxiv.page/@arXiv_csLG_bot/114538965060456498
- RECON: Robust symmetry discovery via Explicit Canonical Orientation Normalization
Alonso Urbano, David W. Romero, Max Zimmer, Sebastian Pokutta
https://arxiv.org/abs/2505.13289 https://mastoxiv.page/@arXiv_csLG_bot/114539124884913788
- RefLoRA: Refactored Low-Rank Adaptation for Efficient Fine-Tuning of Large Models
Yilang Zhang, Bingcong Li, Georgios B. Giannakis
https://arxiv.org/abs/2505.18877 https://mastoxiv.page/@arXiv_csLG_bot/114578778213033886
- SuperMAN: Interpretable and Expressive Networks over Temporally Sparse Heterogeneous Data
Bechler-Speicher, Zerio, Huri, Vestergaard, Gilad-Bachrach, Jess, Bhatt, Sazonovs
https://arxiv.org/abs/2505.19193 https://mastoxiv.page/@arXiv_csLG_bot/114578790124778172
toXiv_bot_toot
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
I tried the Quant search engine and looked myself up and its AI bullshit showed me so much wrong information I up on it.
Yes, you can disable the AI stuff, but I like that Mojeek doesn't do the AI slop and focuses on search and results.
My opinion on chatbot is starting to shape into it's final form:
It's a search engine that gives you only average results, and that is incapable of citing the source.
It's also capable of mixing in multiple averages to make one giant brown color.
And this is a direct consequence of the design: it's trained to generate generic credible text and that's what it will do.
Qdrant, which develops an open-source vector search engine for production AI systems, raised a $50M Series B led by AVP (Tamara Djurickovic/Tech.eu)
https://tech.eu/2026/03/12/qdrant-closes-50m-series-b-to-expand-vector-search-infrastructure/
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
Q&A with CEO Jim Lanzone on Yahoo being "very profitable", its new AI search engine, focusing on sports content, including original video and podcasts, and more (Nilay Patel/The Verge)
https://www.theverge.com/podcast/895221/yahoo-jim-…
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
Got an AI search engine response to a question about the membership of the International Criminal Court which then mentioned it includes bodies like the "Asian Cricket Council". Ah, I think I see what's happened here.
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
> #WebinarTV, a company that bills itself as “a search engine for the best #webinars,” is secretly scanning the internet for #Zoom meeting links, recording the calls, and turning them into #AI-generated #podcasts for profit.¹
cc @… @… @… @…
¹ #404Media @… Emanuel Maiberg, Mar 24, 2026
#privacy #Datenschutz #TeamDatenschutz #DiDay
@…
Thank you for the follow!
I wouldn't mind a more advanced search engine indexing posts on the fediverse.
Tootfinder is one way but let's hope an even more advanced way is found!
Edit: I moved HolosDiscover hashtag to the front! :)
#HolosDiscover
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted
cora: CORA citations (1998)
Citations among papers indexed by CORA, from 1998, an early computer science research paper search engine. If a paper i cites a paper j also in this data set, then a directed edge connects i to j. (Papers not in the data set are excluded.) Self-loops may be present. The dates of these snapshots are uncertain.
This network has 23166 nodes and 91500 edges.
Tags: Informational, Citation, Unweighted