Tootfinder

@mlawton@mstdn.social
2026-03-17 14:46:52

This makes me so twitchy. This patient status page, served over HTTP and not HTTPS, has the credentials as query parameters. Such shocking op sec in a healthcare environment, both as a deployed solution and a commercial product.
We know the username, have a head start on the password (with a good idea of the encoding), and the presence of a “user privileges” tab [not pictured] suggests the account has more permissions than necessary.
Dear god. 🤦

A waiting room patient status page displaying a URL with a username and obscured password as query parameters.

@toxi@mastodon.thi.ng
2026-04-10 18:48:56

tl;dr Using https://thi.ng/column-store to accelerate tag intersection queries by a factor of 880x...
Working on the static website generator/export plugin for my personal knowledge tool has been one of the main projects this past month. A key part of this setup is tagging, not just simple flat keywords/cate…

In-memory column store database with customizable column types, extensible query engine, bitfield indexing for query acceleration, JSON serialization with optional RLE compression

@fanf@mendeddrum.org
2026-02-09 15:42:04

from my link log —
JSONata: a JSON query and transformation language.
https://jsonata.org/
saved 2026-02-09 https://dotat.at/:/CEJOA.html

JSONata
JSONata: A declarative open-source query and transformation language for JSON data.

@Marwe@troet.cafe
2026-03-09 01:04:56

Diese zweiteilige Doku enthält einige Knaller:
Putins Netzwerk in Europa
Das konkrete Truppenangebot in Divisionsstärke für den bewaffneten Kampf von Separatisten in Europa war mir neu.
https://mediathekviewweb.de/#query=Putins Netzwerk in Europa

@karlauerbach@sfba.social
2026-03-14 23:12:04

General query: are you anticipating the return of 1974 gas lines and perhaps even/odd gasoline filling days?
(I am.)
By-the-way, when that was happening I would go fill my car at midnight - a time when it was ambiguous whether the day was an even one or an odd one.

@keen456@infosec.exchange
2026-03-08 02:18:26

@… Have you seen this story? https://www.phoronix.com/news/ATI-R300-Occlusion-Query-Fix Developer in Czechia working on fixing up R300…

Old ATI R300 Open-Source Driver Sees Another New Fix In 2026
The Radeon R300 series turns 24 years old this year and thanks to the open-source ATI R300 Gallium3D driver that began via reverse engineering, it's still continuing to see the occasional random fixes from the open-source community.

@arXiv_csOS_bot@mastoxiv.page
2026-02-04 07:41:57

ProphetKV: User-Query-Driven Selective Recomputation for Efficient KV Cache Reuse in Retrieval-Augmented Generation
Shihao Wang, Jiahao Chen, Yanqi Pan, Hao Huang, Yichen Hao, Xiangyu Zou, Wen Xia, Wentao Zhang, Haitao Wang, Junhong Li, Chongyang Qiu, Pengfei Wang
https://arxiv.org/abs/2602.02579 https://arxiv.org/pdf/2602.02579 https://arxiv.org/html/2602.02579
arXiv:2602.02579v1 Announce Type: new
Abstract: The prefill stage of long-context Retrieval-Augmented Generation (RAG) is severely bottlenecked by computational overhead. To mitigate this, recent methods assemble pre-calculated KV caches of retrieved RAG documents (by a user query) and reprocess selected tokens to recover cross-attention between these pre-calculated KV caches. However, we identify a fundamental "crowding-out effect" in current token selection criteria: globally salient but user-query-irrelevant tokens saturate the limited recomputation budget, displacing the tokens truly essential for answering the user query and degrading inference accuracy.
We propose ProphetKV, a user-query-driven KV Cache reuse method for RAG scenarios. ProphetKV dynamically prioritizes tokens based on their semantic relevance to the user query and employs a dual-stage recomputation pipeline to fuse layer-wise attention metrics into a high-utility set. By ensuring the recomputation budget is dedicated to bridging the informational gap between retrieved context and the user query, ProphetKV achieves high-fidelity attention recovery with minimal overhead. Our extensive evaluation results show that ProphetKV retains 96%-101% of full-prefill accuracy with only a 20% recomputation ratio, while achieving accuracy improvements of 8.8%-24.9% on RULER and 18.6%-50.9% on LongBench over the state-of-the-art approaches (e.g., CacheBlend, EPIC, and KVShare).
toXiv_bot_toot

@alejandrobdn@social.linux.pizza
2026-02-07 23:17:24

Trend in the volume of new questions on StackOverflow https://data.stackexchange.com/stackoverflow/query/1926661

@niqdanger@social.linux.pizza
2026-04-09 21:56:56

If you didn't know about this collection of shows, check it out. Aadam Jacobs collection at the Internet Archive. Seeing he was Chicago based I took the chance to search and sure enough, he has a few Troubled Hubble shows. Amazing. https://archive.org/details/@aadam_jac

@grahamperrin@bsd.cafe
2026-02-14 21:19:12

tuning(7) begins:
"The swap partition should typically be approximately 2x the size of main memory for systems with less than 4GB of RAM, or approximately equal to the size of main memory if you have more. "
I can't believe that 64 GB swap should be a norm for a system with 64 GB RAM.
<

@khalidabuhakmeh@mastodon.social
2026-02-26 16:17:20

Reading an article about how to optimize EF Queries. Honestly, step #0 should be to actually measure your query performance so that you have a baseline.
I've made the sin of just applying techniques without first measuring, and you can just end up making things worse, like waaaaaaaaay worse.
Folks, seriously, add some telemetry as the first step, then tackle each query one at a time.

@philip@mastodon.mallegolhansen.com
2026-02-03 02:43:58

Tonight I made a daiquiri.
And then I realized, the way it’s spelled looks a lot like a portmanteau of daj (Polish: Give) and query.
So maybe every time someone gives you a SQL query, that’s actually a daiquiri.

@kidehen@mastodon.social
2026-03-27 20:02:45

Live SPARQL Query Page Links:
[1] https://tinyurl.com/Query-Definition
[2] https://tinyurl.com/Query-Solution-Page

@roland@devdilettante.com
2026-03-09 15:37:49

anyquery works :-) on any CSV file on the internet, nice complement to datasette me thinks :-) e.g. to query thunderbird desktop february 2026 questions:
`anyquery> SELECT * FROM read_csv('https://raw.githubuse…

@michabbb@social.vivaldi.net
2026-02-28 14:06:45

🔧 Cost-based query optimizer with full EXPLAIN / EXPLAIN ANALYZE support and table statistics via ANALYZE
📦 100 built-in functions across string, math, date/time, JSON and aggregate categories – batteries fully included
🛠️ Simple integration via Cargo with a single dependency: stoolap = "0.1" – plus a CLI tool for REPL or direct query execution

@Techmeme@techhub.social
2026-03-31 16:21:11

Sources: Apple is testing letting Siri process multiple requests in a single query in iOS 27, and explored a Grammarly-like keyboard that expands autocorrect (Mark Gurman/Bloomberg)
https://www.bloomberg.com/news/articles/202…

@datascience@genomic.social
2026-03-08 11:00:01

Do you have a long running calculation freezing up your shiny app? {callr} or {crew} might help: https://discindo.org/post/asynchronous-execution-in-shiny/

Asynchronous background execution in Shiny using callr | Discindo
When designing Shiny applications we commonly associate asynchronous execution with multiple concurrent running sessions of an application. In such cases, when one user has requested a longer computation or a database query, the other users have to wait for this task to finish before they can see their plots and tables.

@simon_brooke@mastodon.scot
2026-02-03 14:26:47

In #Clojure, if you query a set for a member, and that member is present, that member is returned:
user=> (#{:a :b :c} :a)
:a
The traditional #Lisp function ASSOC has the signature
(ASSOC store key) => value
where store is assumed to be a list of (key . value) dotted pai…

@shanmukhateja@social.linux.pizza
2026-03-10 15:53:50

Hey @… can I request attention towards:
https://bugs.kde.org/show_bug.cgi?id=515271
I like

@gadgetboy@gadgetboy.social
2026-04-08 10:33:19

I found a solid iOS client for using my LM Studio-hosted models.
The Web Agent is interesting - launches Google in an in-app browser, parses the SERP, and delivers the results of your query back in the chat window - all while you watch what it's doing.
Qwen 3.5 35b performs well with these tasks, even if it's a little slow for interactive tasks on my hardware.
Find the app here:

Pie Studio App - App Store
Download Pie Studio by Jan Stellmann on the App Store. See screenshots, ratings and reviews, user tips, and more apps like Pie Studio.

@stefan@gardenstate.social
2026-04-03 20:56:27

Anyone know a reason to not find accounts that use a signup IP that is a known Tor exit node as a signal of being a bot?
I'm looking at IPs by doing: Query: <reversed-ip>.dnsel.torproject.org − resolves to 127.0.0.2 if it's a Tor exit.
#mastoadmin

@daniel@social.telemetrydeck.com
2026-02-02 19:20:45

We asked 6 Druid Historical Calculation servers if they could calculate 5 years of data at once for a very complicated multi-stage query. Their answer will surprise you!

The servers in question being absolutely fully CPU constrained

@awinkler@openbiblio.social
2026-02-06 09:43:30

die @… lässt sich aus aus OpenRefine heraus ansteuern. Bsp.: Ich habe eine GND-ID einer Person und will wissen, ob die DDB zu der Person Material hat: Edit column > Add column by fetching URLs > dann als GREL "

@toxi@mastodon.thi.ng
2026-02-12 17:13:46

#ReleaseThursday 🎉 Just pushed a new version of the https://thi.ng/column-store database and query engine which adds support for new column types (fixed-size n-dimensional int/uint/float vectors) and RLE (run-…

In-memory column store database with indexing (WIP)

@nobodyinperson@fosstodon.org
2026-03-06 09:34:47

TIL that "compositing" has slowed down my #xfce since forever. I don't see any change after disabling it, just that everything is a lot faster 😅
xfconf-query -c xfwm4 -p /general/use_compositing -s false

@aral@mastodon.ar.al
2026-03-24 15:34:29

Wait, the for-profit not-for-profit privacy champion funded by half-a-billion dollars a year from Google is doing this? Do you think their then-head of public policy was actually telling the truth when she told me “we’re just another Silicon Valley tech company?” No, couldn’t possibly be. I’m sure there are some benefits of the doubt we could still dust off and send their way.
#Mozilla

@javi

Firefox updated their Terms of Use? Let's see!

As you type a search query within Firefox, Firefox offers search suggestions to provide you with faster and more direct access to what you’re looking for. Some of the search suggestions come from your search provider (“Search Suggestions”). Others come …

@almad@fosstodon.org
2026-03-09 18:16:52

Wait Sama started talking Trump now, or was it always the case?
> Water is totally fake. It used to be true, we used to do evaporative cooling in data centres, but now that we don’t do that, you see these things on the internet where, don’t use ChatGPT, it’s 17 gallons of water for each query or whatever, this is completely untrue. Totally insane. No connection to reality.

How much water do the data centres use? It’s a secret
The AI companies insist: we barely use water, hardly a drop! But we won’t tell you how much water we use. And we’ll take you to court to stop you from finding out. Google wants to build a new data …

@me@mastodon.peterjanes.ca
2026-03-08 04:59:14

Semi-regular query to see if anyone else remembers CBC Vancouver's "The Dog and Trombone", written by Jurgen Gothe and Bill Phillips; and, more to the point, if they've got recordings of it, especially episode 5 featuring Hap Hafner's cousin Hugo?
#CBCRadio #JurgenGothe

@grahamperrin@bsd.cafe
2026-02-14 09:35:49

<https://github.com/freebsd/freebsd-src/blob/main/CONTRIBUTING.md#style> mentions the one-sentence-per-line rule for manual pages, however:
a) there's no such rule in mdoc(7) <

@ruario@vivaldi.net
2026-01-30 09:49:50

@… I also think that as a stop gap it would help if both proects offered some official API to query what the latest version is and perhaps offered RSS/ATOM feeds of updates so that knowledgeable users could subscribe to that and be notified right away, direct from the source.

@socallinuxexpo@social.linux.pizza
2026-02-28 18:40:01

Elizabeth Christensen, Devrim Gunduz, Ryan Booz will speak on 'Postgres Query Tuning - Hour 6 of Postgres Training Day' as part of our PostgreSQL@SCaLE track at SCaLE 23x. Full details: https://www.socallinuxexpo.org/scale/23x

SCALE 23x | SCALE
The Southern California Linux Expo (SCALE) is North America’s largest community-run open source conference.

@thomasfuchs@hachyderm.io
2026-01-23 15:17:59

RE: https://hachyderm.io/@thomasfuchs/115945071431971557
Also, yes you can connect to WiFi in Mac OS 9.2 with AirPort in a 27 year old iBook from 1999 and yes, iTunes can still query Gracenote for CD track titles.

@fanf@mendeddrum.org
2026-02-09 12:42:04

from my link log —
AEQuery: Apple Events command line query tool without AppleScript.
https://markalldritt.com/?p=1368
saved 2026-02-08 https://dotat.a…

AEQuery
AEQuery I’ve released a new command-line tool called AEQuery. It queries scriptable macOS applications using XPath-like expressions, translating them directly into Apple Events. The short version: …

@frankel@mastodon.top
2026-03-27 17:04:58

#cq: #StackOverflow for #Agents
https://

cq: Stack Overflow for Agents
cq explores a Stack Overflow for agents, a shared commons where agents can query past learnings, contribute new knowledge, and avoid repeating the same mistakes in isolation.

@gray17@mastodon.social
2026-02-28 20:25:08

today's rabbithole: scroll shadows with just css.
- background-attachment - well-established hack, works ok, but tricky to do in some layouts (eg sticky footer)
- container query scroll-state stuck - chrome only for now and kinda buggy
- scroll animation - works ok in chrome and safari. not yet firefox without flag. awkward if you want a smooth fade at a particular closeness.
leaning toward scroll animation, because it feels like the most natural way to express it

@jkmartindale@mastodon.social
2026-02-05 03:56:00

ah yes, Japan comes from Japan

Google Direct Answer resulting from the query "japan etymology". It shows an arrow between "Japan" and "japan", with the latter labeled "late 17th century". Underneath the graph: "late 17th century: from Japan."

@Mediagazer@mstdn.social
2026-01-26 14:20:40

Source: OpenAI targets ~$60 per 1,000 views for ChatGPT ads, on par with live NFL broadcasts and above Meta's sub-$20 CPM, while offering little conversion data (The Information)
https://www.theinformation.com/articles/openai-seeks-premium-prices-early-…

OpenAI Seeks Premium Prices in Early Ads Push
In its initial rollout of ads, OpenAI is charging prices that rival those for coveted video programs like the NFL, and well above what rivals such as Meta Platforms’ social media apps charge. But unlike Meta or Google, OpenAI won’t be providing detailed information about the query responses ...

@arXiv_csDS_bot@mastoxiv.page
2026-02-04 07:39:24

ZOR filters: fast and smaller than fuse filters
Antoine Limasset
https://arxiv.org/abs/2602.03525 https://arxiv.org/pdf/2602.03525 https://arxiv.org/html/2602.03525
arXiv:2602.03525v1 Announce Type: new
Abstract: Probabilistic membership filters support fast approximate membership queries with a controlled false-positive probability $\varepsilon$ and are widely used across storage, analytics, networking, and bioinformatics \cite{chang2008bigtable,dayan2018optimalbloom,broder2004network,harris2020improved,marchet2023scalable,chikhi2025logan,hernandez2025reindeer2}. In the static setting, state-of-the-art designs such as XOR and fuse filters achieve low overhead and very fast queries, but their peeling-based construction succeeds only with high probability, which complicates deterministic builds \cite{graf2020xor,graf2022binary,ulrich2023taxor}.
We introduce \emph{ZOR filters}, a deterministic continuation of XOR/fuse filters that guarantees construction termination while preserving the same XOR-based query mechanism. ZOR replaces restart-on-failure with deterministic peeling that abandons a small fraction of keys, and restores false-positive-only semantics by storing the remainder in a compact auxiliary structure. In our experiments, the abandoned fraction drops below $1\%$ for moderate arity (e.g., $N\ge 5$), so the auxiliary handles a negligible fraction of keys. As a result, ZOR filters can achieve overhead within $1\%$ of the information-theoretic lower bound $\log_2(1/\varepsilon)$ while retaining fuse-like query performance; the additional cost is concentrated on negative queries due to the auxiliary check. Our current prototype builds several-fold slower than highly optimized fuse builders because it maintains explicit incidence information during deterministic peeling; closing this optimisation gap is an engineering target.
toXiv_bot_toot

@NFL@darktundra.xyz
2026-03-30 01:39:38

Eagles GM offers same reply to every Brown query https://www.espn.com/nfl/story/_/id/48343831/eagles-gm-howie-roseman-fields-questions-aj-brown-reiterates-aj-brown-member-eagles

Eagles GM Howie Roseman: 'A.J. Brown is a member of the Eagles' - ESPN
Eagles general manager Howie Roseman had a stock answer for every A.J. Brown question he fielded from the local media at the league meetings Sunday, opting for a neutral response amid the ongoing trade speculation.

@paulwermer@sfba.social
2026-03-27 15:25:58

Any thoughts on search options?
The ad driven postings taking me to on-line but not local stores , treating every search as if I'm looking for something to buy, the return of multiple related items when I'm searching for a specific item (down to part number in the query) are making search far less useful than the printed options of my youth.

@PaulWermer@sfba.social
2026-03-27 15:25:58

@arXiv_csLG_bot@mastoxiv.page
2026-02-25 10:45:01

Statistical Query Lower Bounds for Smoothed Agnostic Learning
Ilias Diakonikolas, Daniel M. Kane
https://arxiv.org/abs/2602.21191 https://arxiv.org/pdf/2602.21191 https://arxiv.org/html/2602.21191
arXiv:2602.21191v1 Announce Type: new
Abstract: We study the complexity of smoothed agnostic learning, recently introduced by~\cite{CKKMS24}, in which the learner competes with the best classifier in a target class under slight Gaussian perturbations of the inputs. Specifically, we focus on the prototypical task of agnostically learning halfspaces under subgaussian distributions in the smoothed model. The best known upper bound for this problem relies on $L_1$-polynomial regression and has complexity $d^{\tilde{O}(1/\sigma^2) \log(1/\epsilon)}$, where $\sigma$ is the smoothing parameter and $\epsilon$ is the excess error. Our main result is a Statistical Query (SQ) lower bound providing formal evidence that this upper bound is close to best possible. In more detail, we show that (even for Gaussian marginals) any SQ algorithm for smoothed agnostic learning of halfspaces requires complexity $d^{\Omega(1/\sigma^{2} \log(1/\epsilon))}$. This is the first non-trivial lower bound on the complexity of this task and nearly matches the known upper bound. Roughly speaking, we show that applying $L_1$-polynomial regression to a smoothed version of the function is essentially best possible. Our techniques involve finding a moment-matching hard distribution by way of linear programming duality. This dual program corresponds exactly to finding a low-degree approximating polynomial to the smoothed version of the target function (which turns out to be the same condition required for the $L_1$-polynomial regression to work). Our explicit SQ lower bound then comes from proving lower bounds on this approximation degree for the class of halfspaces.
toXiv_bot_toot

@michabbb@social.vivaldi.net
2026-02-08 15:43:24

🔧 Configurable thresholds for scan warnings, query cost limits & small table optimization
🧩 Custom analyser support via QueryAnalyser interface for PostgreSQL or other databases
🎯 Works with #PHPUnit setUp() & #Pest beforeEach() including "paranoid mode"
📦 compos…

@Marwe@troet.cafe
2026-03-09 01:04:56

@fanf@mendeddrum.org
2026-03-23 18:42:04

from my link log —
PostgreSQL query cancellation / Ctrl-C in psql is insecure.
https://neon.com/blog/ctrl-c-in-psql-gives-me-the-heebie-jeebies
saved 2026-03-23

Ctrl-C in psql gives me the heebie-jeebies - Neon
There are a few different reasons to hit the brakes on a Postgres query. Maybe it’s taking too long to finish. Maybe you realised you forgot to create an index that will make it orders of magnitude quicker. Maybe there’s some reason the results are no longer needed. Or maybe you, or your LLM buddy, […]

@Techmeme@techhub.social
2026-01-26 14:30:49

@seav@en.osm.town
2026-02-28 13:50:26

Is it just me or is the #Wikidata Query Service quite flaky as of late? When using the public API, I sporadically get HTTP 504 (upstream timeout) errors.

@arXiv_csDB_bot@mastoxiv.page
2026-02-26 09:36:00

Quantum Computing for Query Containment of Conjunctive Queries
Luisa Gerlach, Tobias K\"oppl, Ren\`e Zander, Nicole Schweikardt, Stefanie Scherzinger
https://arxiv.org/abs/2602.21803

@datascience@genomic.social
2026-02-27 11:00:01

Polars is a lightning fast DataFrame library/in-memory query engine with parallel execution and cache efficiency. And now you can use is with the tidyverse syntax: #rstats

More Efficient Tidyverse Code, Using Polars in the Background
Polars is a cross-language tool for manipulating very large data. However, one drawback is that the R implementation has a syntax that will look odd to many R users who are not used to Python syntax. The objective of tidypolars is to improve the ease-of-use of Polars in R by providing tidyverse syntax to polars.

@michabbb@social.vivaldi.net
2026-04-11 09:03:57

⚙️ Define fields once in a Repository — #LaravelRestify auto-generates:
✅ Paginated REST endpoints with filtering & sorting
✅ MCP tool definitions with input schemas
✅ #LaravelSanctum auth protecting both
interfaces equally
🔍 Powerful query capabili…

@toxi@mastodon.thi.ng
2026-03-01 16:59:41

#ReleaseSunday 🎉 Quite a few https://thi.ng/column-store updates over the past month, including further performance optimizations, more tests and documentation updates...
Just also added a small section an…

Screenshot excerpt from the package readme, incl. a diagram illustrating query behavior. Direct link to this section: https://github.com/thi-ng/umbrella/blob/develop/packages/column-store/README.md#optimized-row-iteration

@grahamperrin@bsd.cafe
2026-03-07 02:52:15

@… I guess, you mean font size in the virtual terminal (vt) when you're not using MATE.
screen.font
– in vt(4) examples and in loader.conf(5).
<http…

@arXiv_csDS_bot@mastoxiv.page
2026-02-10 10:09:16

Prune, Don't Rebuild: Efficiently Tuning $\alpha$-Reachable Graphs for Nearest Neighbor Search
Tian Zhang, Ashwin Padaki, Jiaming Liang, Zack Ives, Erik Waingarten
https://arxiv.org/abs/2602.08097 https://arxiv.org/pdf/2602.08097 https://arxiv.org/html/2602.08097
arXiv:2602.08097v1 Announce Type: new
Abstract: Vector similarity search is an essential primitive in modern AI and ML applications. Most vector databases adopt graph-based approximate nearest neighbor (ANN) search algorithms, such as DiskANN (Subramanya et al., 2019), which have demonstrated state-of-the-art empirical performance. DiskANN's graph construction is governed by a reachability parameter $\alpha$, which gives a trade-off between construction time, query time, and accuracy. However, adaptively tuning this trade-off typically requires rebuilding the index for different $\alpha$ values, which is prohibitive at scale. In this work, we propose RP-Tuning, an efficient post-hoc routine, based on DiskANN's pruning step, to adjust the $\alpha$ parameter without reconstructing the full index. Within the $\alpha$-reachability framework of prior theoretical works (Indyk and Xu, 2023; Gollapudi et al., 2025), we prove that pruning an initially $\alpha$-reachable graph with RP-Tuning preserves worst-case reachability guarantees in general metrics and improved guarantees in Euclidean metrics. Empirically, we show that RP-Tuning accelerates DiskANN tuning on four public datasets by up to $43\times$ with negligible overhead.
toXiv_bot_toot

@michabbb@social.vivaldi.net
2026-02-08 15:43:24

📊 EXPLAIN-based index analysis detects full table scans, missing indexes, filesort & temporary tables on #MySQL, #MariaDB & #SQLite
🔄 Duplicate query detection finds identical queries…

@fanf@mendeddrum.org
2026-03-27 18:42:03

from my link log —
jsongrep is faster than {jq, jmespath, jsonpath-rust, jql}
https://micahkepe.com/blog/jsongrep/
saved 2026-03-27 https://dotat.a…

jsongrep is faster than {jq, jmespath, jsonpath-rust, jql}
An introduction to the jsongrep tool, a technical explanation of its DFA-based search engine, and performance results against popular JSON query tools.

@arXiv_csDS_bot@mastoxiv.page
2026-02-10 11:10:06

Welfarist Formulations for Diverse Similarity Search
Siddharth Barman, Nirjhar Das, Shivam Gupta, Kirankumar Shiragur
https://arxiv.org/abs/2602.08742 https://arxiv.org/pdf/2602.08742 https://arxiv.org/html/2602.08742
arXiv:2602.08742v1 Announce Type: new
Abstract: Nearest Neighbor Search (NNS) is a fundamental problem in data structures with wide-ranging applications, such as web search, recommendation systems, and, more recently, retrieval-augmented generations (RAG). In such recent applications, in addition to the relevance (similarity) of the returned neighbors, diversity among the neighbors is a central requirement. In this paper, we develop principled welfare-based formulations in NNS for realizing diversity across attributes. Our formulations are based on welfare functions -- from mathematical economics -- that satisfy central diversity (fairness) and relevance (economic efficiency) axioms. With a particular focus on Nash social welfare, we note that our welfare-based formulations provide objective functions that adaptively balance relevance and diversity in a query-dependent manner. Notably, such a balance was not present in the prior constraint-based approach, which forced a fixed level of diversity and optimized for relevance. In addition, our formulation provides a parametric way to control the trade-off between relevance and diversity, providing practitioners with flexibility to tailor search results to task-specific requirements. We develop efficient nearest neighbor algorithms with provable guarantees for the welfare-based objectives. Notably, our algorithm can be applied on top of any standard ANN method (i.e., use standard ANN method as a subroutine) to efficiently find neighbors that approximately maximize our welfare-based objectives. Experimental results demonstrate that our approach is practical and substantially improves diversity while maintaining high relevance of the retrieved neighbors.
toXiv_bot_toot

@kidehen@mastodon.social
2026-03-27 20:00:34

This feature provides syntax-level compatibility with the SERVICE wikibase:label extension, enabling queries written for the Wikidata Query Service to run more seamlessly on Virtuoso.
In practical terms, developers can reuse existing SPARQL queries that rely on SERVICE wikibase:label—without needing to rewrite label-handling logic—while benefiting from Virtuoso’s performance, flexibility, and deployment options.

@fanf@mendeddrum.org
2026-03-24 21:42:03

from my link log —
When upserts don't update but still write: debugging PostgreSQL WAL activity.
https://www.datadoghq.com/blog/engineering/debugging-postgres-performance/
saved 2026-03-24

When upserts don't update but still write: Debugging Postgres performance at scale | Datadog
When a high-volume upsert doubled disk writes, Datadog engineers traced the issue to Postgres WAL behavior and rewrote the query to eliminate hidden costs.

@tomkalei@machteburch.social
2026-01-28 13:19:34

OK, but the results are still just paper links, not what the LLM thinks about them. So the only thing that changed is that it is much much slower now?
Maybe the LLM just formulates a query to the old scholar? Too many layers of "oh how can we build an LLM into this"...
OK, nobody in #math uses Google Scholar like this anyway. We only use it look at publication records of people and do reverse citation search (which papers cite this one I have).

@michabbb@social.vivaldi.net
2026-01-24 00:28:07

🎯 Zero accuracy loss - preserves what matters: errors, anomalies, high-scoring items & query-relevant content using BM25/embedding similarity
✅ Full provider support: #OpenAI, #Anthropic, #Google

@arXiv_csDS_bot@mastoxiv.page
2026-02-04 01:36:45

Replaced article(s) found for cs.DS. https://arxiv.org/list/cs.DS/new
[1/1]:
- Optimal Hardness of Online Algorithms for Large Independent Sets
David Gamarnik, Eren C. K{\i}z{\i}lda\u{g}, Lutz Warnke
https://arxiv.org/abs/2504.11450 https://mastoxiv.page/@arXiv_csDS_bot/114346418465357434
- An Approximation Algorithm for Monotone Submodular Cost Allocation
Ryuhei Mizutani
https://arxiv.org/abs/2511.00470 https://mastoxiv.page/@arXiv_csDS_bot/115490466535056736
- Expected Cost of Greedy Online Facility Assignment on Regular Polygons (v3)
Md. Rawha Siddiqi Riad, Md. Tanzeem Rahat, Md. Manzurul Hasan
https://arxiv.org/abs/2512.00506 https://mastoxiv.page/@arXiv_csDS_bot/115648910775471187
- Nested and outlier embeddings into trees
Shuchi Chawla, Kristin Sheridan
https://arxiv.org/abs/2601.15470 https://mastoxiv.page/@arXiv_csDS_bot/115943420904659985
- Bankrupting DoS Attackers
Trisha Chakraborty, Abir Islam, Valerie King, Daniel Rayborn, Jared Saia, Maxwell Young
https://arxiv.org/abs/2205.08287
- An Algorithm for Fast and Correct Computation of Reeb Spaces for PL Bivariate Fields
Amit Chattopadhyay, Yashwanth Ramamurthi, Osamu Saeki
https://arxiv.org/abs/2403.06564 https://mastoxiv.page/@arXiv_csCG_bot/112081476174323525
- On Densest $k$-Subgraph Mining and Diagonal Loading: Optimization Landscape and Finite-Step Exact...
Qiheng Lu, Nicholas D. Sidiropoulos, Aritra Konar
https://arxiv.org/abs/2410.07388 https://mastoxiv.page/@arXiv_csSI_bot/113287589348257824
- A New Quantum Linear System Algorithm Beyond the Condition Number and Its Application to Solving ...
Jianqiang Li
https://arxiv.org/abs/2510.05588 https://mastoxiv.page/@arXiv_quantph_bot/115337999786748703
- On Purely Private Covariance Estimation
Tommaso d'Orsi, Gleb Novikov
https://arxiv.org/abs/2510.26717 https://mastoxiv.page/@arXiv_csLG_bot/115468358153466988
- The Query Complexity of Local Search in Rounds on General Graphs
Simina Br\^anzei, Ioannis Panageas, Dimitris Paparas
https://arxiv.org/abs/2601.13266 https://mastoxiv.page/@arXiv_csCC_bot/115932039505257286
toXiv_bot_toot

Tootfinder

Opt-in global Mastodon full text search. Join the index!