Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@aredridel@kolektiva.social
2026-04-14 14:22:42

So to follow up on this, I've caught it in action. Models, when quantized a bit, just do a bit more poorly with short contexts. Even going from f32 (as trained) to bf16 (as usually run) to q8 tends to do okay for "normal" context windows. And q4 you start feeling like "this model is a little stupid and gets stuck sometimes” (it is! It's just that it's still mostly careening about in the space of "plausible" most of the time. Not good guesswork, but still in the zone). With long contexts, the probability of parameters collapsing to zero are higher, so the more context the more likelihood you are to see brokenness.
And then at Q2 (2 bits per parameter) or Q1, the model falls apart completely. Parameters collapse to zero easily. You start seeing "all work and no play makes jack a dull boy” sorts of behavior, with intense and unscrutinized repetition, followed by a hard stop when it just stops working.
And quantization is a parameter that a model vendor can turn relatively easily. (they have to regenerate the model from the base with more quantization, but it's a data transformation on the order of running a terabyte through a straightforward and fast process, not like training).
If you have 1000 customers and enough equipment to handle the requests of 700, going from bf16 to q8 is a no-brainer. Suddenly you can handle the load and have a little spare capacity. They get worse results, probably pay the same per token (or they're on a subscription that hides the cost anyway so you are even freer to make trade-offs. There's a reason that subscription products are kinda poorly described.)
It's also possible for them to vary this across a day: use models during quieter periods? Maybe you get an instance running a bf16 quantization. If you use it during a high use period? You get a Q4 model.
Or intelligent routing is possible. No idea if anyone is doing this, but if they monitor what you send a bit, and you generally shoot for an expensive model for simple requests? They could totally substitute a highly quantized version of the model to answer the question.
There are •so many tricks• that can be pulled here. Some of them very reasonable to make, some of them treading into outright misleading or fraudulent, and it's weirdly hard to draw the line between them.

@Mediagazer@mstdn.social
2026-02-13 16:51:01

The Richmond Free Press, a 34-year-old Black-owned weekly, shuts down due to falling ad revenue as the Black press suffers from the dispersion of its readership (Scott Nover/Washington Post)

Opponents of VA Redistricting Amendment Send Out Mailer/Text Falsely
-- and Offensively!
Claiming That Fighting Trump’s Assault on Our Democracy Is
“Just Like Jim Crow”
Note that many of the most prominent supporters of this amendment
- Barack Obama, Louise Lucas, Don Scott, etc.
- are African American

@toxi@mastodon.thi.ng
2026-02-10 11:09:15

That time when Johnny Klimek (composer for Cloud Atlas and many others of Tom Tykwer films) and Dr. Motte (founder of the Berlin Loveparade) got together in 1996 to create their one-off project Holy Language...
Fireplace (original version, 9 minutes)
youtube.com/watch?v=DgL3DNn7OGE

Abstract cover art for the West Sound Circle CD showing a grid layout with the title a 2x2 grid of concentric circles (some with spikes) and a sidebar with logos and other abstract designs
@Techmeme@techhub.social
2026-04-06 22:25:40

Filing: Broadcom agrees to produce future versions of Google's TPUs and expands its Anthropic deal to give the startup access to ~3.5 GW of computing capacity (Jordan Novet/CNBC)
cnbc.com/2026/04/06/broadcom-a

@radioeinsmusicbot@mastodonapp.uk
2026-03-06 14:58:59

🇺🇦 Auf radioeins läuft...
Superspace:
🎵 Superspace Feeling (House Version)
#NowPlaying #Superspace
superspacemusic.bandcamp.com/t
open.spotify.com/track/4gVlkx1

@BBC6MusicBot@mastodonapp.uk
2026-04-13 09:57:06

🇺🇦 #NowPlaying on #BBC6Music's #LaurenLaverne
Al Green:
🎵 L-O-V-E
#AlGreen
a-cee.bandcamp.com/track/al-gr

@radioeinsmusicbot@mastodonapp.uk
2026-03-11 20:45:41

🇺🇦 Auf radioeins läuft...
Presley, Elvis:
🎵 You've Lost That Loving Feeling (EPiC Version)
#NowPlaying #Presley #Elvis
open.spotify.com/track/5zaT6hi

No official version of the proposal has been made available, but a summary released by Iran's Supreme National Security Council includes demands for the following:
The Strait of Hormuz to be reopened "under the co-ordination of the armed forces of Iran".

The war against "all components" of Iran's so-called Axis of Resistance to end.

US forces to withdraw from "all bases and points of deployment within the region".

@BBC6MusicBot@mastodonapp.uk
2026-04-13 21:34:28

🇺🇦 #NowPlaying on #BBC6Music's #RileyAndCoe
Al Green:
🎵 L-O-V-E
#AlGreen
a-cee.bandcamp.com/track/al-gr