Tootfinder

No exact results. Similar results found.

@fanf@mendeddrum.org
2026-06-28 08:42:04

from my link log —
What data access pattern is as slow as possible?
https://blog.weineng.me/posts/slowest_add/
saved 2026-06-27 https://

Data Access Patterns That Makes Your CPU Really Angry
Given an array of data, what is the slowest way to sum up the integers? Is it adding the numbers from left to right, adding them randomly, or doing something else? In this post, we are going to build a data access pattern from the ground up that sums numbers as slowly as possible by exploiting memory pitfalls. uint32_t* data = ...; // sequential data[0] + data[1] + data[2] + ... // random data[67] + data[69420] + data[42] + ... // the slowest data[A] + data[B] + data[C] + ... Spoiler: You can d…

@frankel@mastodon.top
2026-06-14 17:06:55

'Never use double for money' is dogma, not engineering.
Guest author Stefano Fago breaks down when double, BigDecimal, or fixed-point is the right call, and the production traps that quietly undo each one.
https://blog.frankel.ch/bigdecimal-vs-double/

double, BigDecimal, or Fixed-Point?
There is an evergreen debate in the Java world: should you always use BigDecimal for money? The short answer is no. The real answer is: it depends on your computational context: the precision you need, the rounding rules you must follow, and the performance budget you have. The problem is that this conversation is often driven by dogma rather than engineering.

@arXiv_csPF_bot@mastoxiv.page
2026-06-11 07:42:10

The Brain That Goes Quiet: Serving a Large Model's Knowledge at 131 Tokens per Second on an 8 GB Laptop by Removing the Large Model from the Runtime Path
Myeong Jun Jo
https://arxiv.org/abs/2606.12154 https://arxiv.org/pdf/2606.12154 https://arxiv.org/html/2606.12154
arXiv:2606.12154v1 Announce Type: new
Abstract: In earlier work I showed that a 35B-class Mixture-of-Experts model can be loaded and executed on a consumer laptop with 8 GB of GPU memory. That result solved a placement problem and immediately exposed a different one: even correctly placed, the large model needed roughly four seconds to answer, because it was still being invoked at every query. This paper documents what happened when I stopped invoking it. During an offline phase, the large model reads source documents and writes verified answer entries into a structured knowledge store; at runtime, only a lightweight router, a deterministic renderer, and a 1B-class model are active. On the same 8 GB laptop, end-to-end response time fell from approximately 4,465 ms to 518 ms, effective end-to-end throughput rose from 15.7 to 131 tokens per second, and the small model's streaming decode rate held at 226-237 tokens per second with a time-to-first-token of 29-62 ms. The bottleneck is structural: three different large models (Qwen, Gemma, and GLM class) all showed the same multi-second runtime cost, and all three produced usable knowledge stores offline. On a 563-entry store built from seventeen real documents, keyword routing collapsed to 1.5% top-1 accuracy while BM25-based routing reached 92.8% (99.4% top-3), and a confidence gate raised effective top-1 to 98.0% by escalating 12.3% of queries. Exact-match fidelity of the small model ranged from 9/9 to 0/9 across envelope formats carrying identical content. A 16-case verification gate blocked all ten corrupted entries while admitting all six supported ones.
toXiv_bot_toot

@arXiv_csGR_bot@mastoxiv.page
2026-07-21 07:40:37

Packet-Loss Robust 3D Gaussian Compression via Atomic Packaging and GNN-based Error Concealment
Yuxuan Tao, Xuerui Ma, Hao Zhang, Chunhua Peng
https://arxiv.org/abs/2607.17916 https://arxiv.org/pdf/2607.17916 https://arxiv.org/html/2607.17916
arXiv:2607.17916v1 Announce Type: new
Abstract: 3D Gaussian Splatting (3DGS) and recent compression schemes such as HAC enable high-fidelity real-time neural rendering, but their bitstreams are fragile under packet loss during network streaming. Existing compression methods often separate correlated anchor attributes into independent streams, so losing one packet can create attribute-inconsistent broken anchors and severe rendering artifacts. We propose a packet-loss robust 3DGS transmission and error concealment framework. On the encoder side, anchor-level atomic packaging jointly encapsulates all attributes of each anchor, converting corrupted-attribute failures into clean missing-anchor erasures. Stratified random grouping further disperses packet losses across the spatial domain to avoid large contiguous voids. On the decoder side, we formulate recovery as prior-aware attribute inpainting. A Context-Aware Residual Interpolation (CARI) branch uses hash-grid prior predictions and neighboring residuals to build a robust baseline, while a lightweight two-layer graph neural network with cross-attention over hash-grid priors refines high-frequency attribute residuals. Attribute-wise confidence control falls back to interpolation when learned predictions are unreliable. Experiments under 20 percent random packet loss on BungeeNeRF, Mip-NeRF 360, and Tanks and Temples show that the proposed method substantially improves over no-concealment transmission and limits average PSNR degradation to about 3 dB relative to the lossless HAC reference.
toXiv_bot_toot

@mgorny@social.treehouse.systems
2026-05-24 05:14:07

Okay, so apparently there's been some "scuffle" between a cyclist and an old lady. The police's looking for the cyclist now, and shared a camera footage looking for help in finding them. Except that the footage is such a low resolution it's practically useless.
So helpful people from the internets used "#AI" to enhance it. So now we're looking at an angry mob looking for a person whose face was generated by an #LLM. Or well, multiple independently generated different faces apparently, but would that stop a mob from lynching a random person?
This fucking crap needs to be outlawed immediately. And whoever's selling it should end up behind bars.
#NoAI #NoLLM

@pre@boing.world
2026-05-08 15:30:32

In summary then, it is indeed quite like being at school. Half hour lessons on things that probably won't ever actually be useful to know in your particular job of varying levels of interest. Mostly pretty low interest honestly. Bumping into colleagues between lessons.
Learned the names of a couple of tools I might try. One google search would have gotten me those but I guess it's a question of thinking to look for them.
If you can judge the mood of an industry from a random selection of talks from a single conference then the industry is very optimistic that they can make AI write a lot of software.
It seems to think this is likely to mean fewer programmers rather than there being more software meaning more workers.
It wasn't as AI heavy as I thought when I first glanced at the program. Managed to mostly be not-ai I think.
Nobody talking about the ethical implications or suggesting joining a union and only one talk about the environment issue at all, it not really noting how much power the industry is about to take.
Liked having a few meals in amserdam with colleagues I never usually see (mostly remote workers, including me). The boss is pretty good at picking people really.
Get a day or so of holiday now too.
#devWorld

Tootfinder

Opt-in global Mastodon full text search. Join the index!