Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csDS_bot@mastoxiv.page
2026-02-10 11:10:06

Welfarist Formulations for Diverse Similarity Search
Siddharth Barman, Nirjhar Das, Shivam Gupta, Kirankumar Shiragur
arxiv.org/abs/2602.08742 arxiv.org/pdf/2602.08742 arxiv.org/html/2602.08742
arXiv:2602.08742v1 Announce Type: new
Abstract: Nearest Neighbor Search (NNS) is a fundamental problem in data structures with wide-ranging applications, such as web search, recommendation systems, and, more recently, retrieval-augmented generations (RAG). In such recent applications, in addition to the relevance (similarity) of the returned neighbors, diversity among the neighbors is a central requirement. In this paper, we develop principled welfare-based formulations in NNS for realizing diversity across attributes. Our formulations are based on welfare functions -- from mathematical economics -- that satisfy central diversity (fairness) and relevance (economic efficiency) axioms. With a particular focus on Nash social welfare, we note that our welfare-based formulations provide objective functions that adaptively balance relevance and diversity in a query-dependent manner. Notably, such a balance was not present in the prior constraint-based approach, which forced a fixed level of diversity and optimized for relevance. In addition, our formulation provides a parametric way to control the trade-off between relevance and diversity, providing practitioners with flexibility to tailor search results to task-specific requirements. We develop efficient nearest neighbor algorithms with provable guarantees for the welfare-based objectives. Notably, our algorithm can be applied on top of any standard ANN method (i.e., use standard ANN method as a subroutine) to efficiently find neighbors that approximately maximize our welfare-based objectives. Experimental results demonstrate that our approach is practical and substantially improves diversity while maintaining high relevance of the retrieved neighbors.
toXiv_bot_toot

@publicvoit@graz.social
2026-01-08 09:28:11

Im Geiste der aktuellen Bestrebungen, sich von ausbeuterischen Großkonzernen frei zu machen (#DID, #DUD), möchte ich auf meinen Artikel hinweisen, wo ich erkläre, weshalb man bei der Auswahl, wo man sich im Internet einbringt, genauer hinschauen soll.
Don't Contribute Anything Relevant in

@Techmeme@techhub.social
2026-01-09 10:20:57

How Craigslist has stayed relevant for users as a place to find jobs, housing, and personal connections without relying on algorithmic feeds or public profiles (Jennifer Swann/Wired)
wired.com/story/is-craigslist-

@tante@tldr.nettime.org
2026-03-10 08:26:41

Weil die "KI" Unternehmen alle Hardware kaufen, wird der "du bekommst nur noch ein dummes Terminal und musst alle relevante Infrastruktur mieten" Trend noch weiter angeheizt. Hardware wird nicht nur teuer, sie wird komplett unzugänglich. Da hilft uns dann auch keine Open Source Software mehr.

@chris@mstdn.chrisalemany.ca
2026-01-09 16:29:35

Tom Mulcair: "Trump doesn't get the joke about the ‘Donroe Doctrine’”
.... but does Tom get the joke about his relevance to anything at all?
#canPoli #CdnPoli
ctvnews.ca/politics/article/to

@der_raddler@dresden.network
2025-12-10 18:09:40

Der einzig relevante Jahresrückblick kommt natürlich von @…! 😁
Mein Podcast-Jahr 2025. #AntennaPodEcho
1. 11KM: der @…@…

@Mediagazer@mstdn.social
2026-01-09 20:06:05

How Craigslist has stayed relevant for users as a place to find jobs, housing, and personal connections without relying on algorithmic feeds or public profiles (Jennifer Swann/Wired)
wired.com/story/is-craigslist-

@andycarolan@social.lol
2026-03-09 09:26:51

At a time when SEO is less relevant than ever, I'm receiving more SEO spam than ever.

@benny@norden.social
2026-01-10 16:05:02

Viele gucken nur auf die Ladezeit einer Website oder eines Webshops auf Nutzer-Ebene — also wie schnell die Seite für den Nutzer lädt. Fehler! 🙀
Für SEO ist es aber viel relevanter, wie schnell der Webserver auf Anfragen von Google und anderen Crawlern 🕷️ reagiert. Braucht das zu lange, kostet es Google mehr Ressourcen und Geld.
Die Seite ist dann weniger interessant und verliert Sichtbarkeit und Klicks aus der Google Suche.
Google Search Console > Einstellungen > Cra…

@Techmeme@techhub.social
2026-02-10 13:35:59

Polymarket partners with Singapore-based Kaito AI to launch "attention markets", letting users bet on "mindshare" and "sentiment" metrics from social media (Alicia Park/Forbes)
forbes.com/sites/aliciapark/20