Welfarist Formulations for Diverse Similarity Search
Siddharth Barman, Nirjhar Das, Shivam Gupta, Kirankumar Shiragur
https://arxiv.org/abs/2602.08742 https://arxiv.org/pdf/2602.08742 https://arxiv.org/html/2602.08742
arXiv:2602.08742v1 Announce Type: new
Abstract: Nearest Neighbor Search (NNS) is a fundamental problem in data structures with wide-ranging applications, such as web search, recommendation systems, and, more recently, retrieval-augmented generations (RAG). In such recent applications, in addition to the relevance (similarity) of the returned neighbors, diversity among the neighbors is a central requirement. In this paper, we develop principled welfare-based formulations in NNS for realizing diversity across attributes. Our formulations are based on welfare functions -- from mathematical economics -- that satisfy central diversity (fairness) and relevance (economic efficiency) axioms. With a particular focus on Nash social welfare, we note that our welfare-based formulations provide objective functions that adaptively balance relevance and diversity in a query-dependent manner. Notably, such a balance was not present in the prior constraint-based approach, which forced a fixed level of diversity and optimized for relevance. In addition, our formulation provides a parametric way to control the trade-off between relevance and diversity, providing practitioners with flexibility to tailor search results to task-specific requirements. We develop efficient nearest neighbor algorithms with provable guarantees for the welfare-based objectives. Notably, our algorithm can be applied on top of any standard ANN method (i.e., use standard ANN method as a subroutine) to efficiently find neighbors that approximately maximize our welfare-based objectives. Experimental results demonstrate that our approach is practical and substantially improves diversity while maintaining high relevance of the retrieved neighbors.
toXiv_bot_toot
How Craigslist has stayed relevant for users as a place to find jobs, housing, and personal connections without relying on algorithmic feeds or public profiles (Jennifer Swann/Wired)
https://www.wired.com/story/is-craigslist-the-last-real-place-on-the-internet/…
Tom Mulcair: "Trump doesn't get the joke about the ‘Donroe Doctrine’”
.... but does Tom get the joke about his relevance to anything at all?
#canPoli #CdnPoli
https://www.ctvnews.ca/politics/article/tom-mulcair-trump-doesnt-get-the-joke-about-the-donroe-doctrine/?utm_source=flipboard&utm_medium=activitypub
Der einzig relevante Jahresrückblick kommt natürlich von @…! 😁
Mein Podcast-Jahr 2025. #AntennaPodEcho
1. 11KM: der @…@…
How Craigslist has stayed relevant for users as a place to find jobs, housing, and personal connections without relying on algorithmic feeds or public profiles (Jennifer Swann/Wired)
https://www.wired.com/story/is-craigslist-the-last-real-place-on-the-internet/…
At a time when SEO is less relevant than ever, I'm receiving more SEO spam than ever.
Viele gucken nur auf die Ladezeit einer Website oder eines Webshops auf Nutzer-Ebene — also wie schnell die Seite für den Nutzer lädt. Fehler! 🙀
Für SEO ist es aber viel relevanter, wie schnell der Webserver auf Anfragen von Google und anderen Crawlern 🕷️ reagiert. Braucht das zu lange, kostet es Google mehr Ressourcen und Geld.
Die Seite ist dann weniger interessant und verliert Sichtbarkeit und Klicks aus der Google Suche.
Google Search Console > Einstellungen > Cra…
Polymarket partners with Singapore-based Kaito AI to launch "attention markets", letting users bet on "mindshare" and "sentiment" metrics from social media (Alicia Park/Forbes)
https://www.forbes.com/sites/aliciapark/20