Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@aredridel@kolektiva.social
2026-04-14 14:22:42

So to follow up on this, I've caught it in action. Models, when quantized a bit, just do a bit more poorly with short contexts. Even going from f32 (as trained) to bf16 (as usually run) to q8 tends to do okay for "normal" context windows. And q4 you start feeling like "this model is a little stupid and gets stuck sometimes” (it is! It's just that it's still mostly careening about in the space of "plausible" most of the time. Not good guesswork, but still in the zone). With long contexts, the probability of parameters collapsing to zero are higher, so the more context the more likelihood you are to see brokenness.
And then at Q2 (2 bits per parameter) or Q1, the model falls apart completely. Parameters collapse to zero easily. You start seeing "all work and no play makes jack a dull boy” sorts of behavior, with intense and unscrutinized repetition, followed by a hard stop when it just stops working.
And quantization is a parameter that a model vendor can turn relatively easily. (they have to regenerate the model from the base with more quantization, but it's a data transformation on the order of running a terabyte through a straightforward and fast process, not like training).
If you have 1000 customers and enough equipment to handle the requests of 700, going from bf16 to q8 is a no-brainer. Suddenly you can handle the load and have a little spare capacity. They get worse results, probably pay the same per token (or they're on a subscription that hides the cost anyway so you are even freer to make trade-offs. There's a reason that subscription products are kinda poorly described.)
It's also possible for them to vary this across a day: use models during quieter periods? Maybe you get an instance running a bf16 quantization. If you use it during a high use period? You get a Q4 model.
Or intelligent routing is possible. No idea if anyone is doing this, but if they monitor what you send a bit, and you generally shoot for an expensive model for simple requests? They could totally substitute a highly quantized version of the model to answer the question.
There are •so many tricks• that can be pulled here. Some of them very reasonable to make, some of them treading into outright misleading or fraudulent, and it's weirdly hard to draw the line between them.

@scott@carfree.city
2026-01-16 01:21:28

Tributes to the ficus marked for felling at Steiner and Waller 💚

Thick tree trunk with an orange emergency tree removal notice from the city, and a red notice handwritten, “Thank you dear trees for your selfless shade and quiet intimacy.” A pink flower sticks out from a heart-shaped hole in the paper.
Row of tall ficus trees with thick green canopy above the sidewalk.
“All trees go to heaven.” Pink paper on trunk under tree removal notice
Handwritten love notes hung on twine tied loosely around tree trunk. “I will miss the way your branches welcome me home on bike rides, lining the streets”
@simon_brooke@mastodon.scot
2026-03-14 08:21:26

And once again with #AltText4You.
Why is that one simple link so hard to click?

An image of a BlueSky post by Robert Reich @rbreich.bsky.social reading:

"Search-and-rescue crews were “flying blind" trying to rescue survivors from deadly tornadoes that hit the Midwest last week. 

Why? Because Kristi Noem hadn't approved FEMA's $200,000 contract with a tornado-tracking tool. 

But she had time to spend $220 million on anti-immigrant ads. Priorities."
@sauer_lauwarm@mastodon.social
2026-03-14 15:35:17

lets you train hard without feeling chaotic

@Techmeme@techhub.social
2026-04-13 20:20:45

Filing: Anthropic hired Ballard Partners, a lobbying firm with strong ties to Trump administration, days after DOD designated the company a supply chain risk (Bloomberg)
bloomberg.com/news/articles/20

Just as they’re gearing up for planting season, U.S. farmers already stretched by high input costs and low commodity prices are watching the price of fertilizer go up.
The recent conflict in Iran, the following closure of the Strait of Hormuz, and the resulting impacts to global markets are hitting farmers particularly hard right now.
The wholesale price of urea, the nitrogen input the U.S. imports the most of, had a high-low spread of $460–480 per short ton the week of Feb. 27, j…

@bourgwick@heads.social
2026-04-11 00:34:32

just heard the #baseball #radio broadcasters announce the astronauts' splashdown & feeling quite wholesome.

@haayman@todon.nl
2026-03-07 17:48:07

Die moet flink hard gereden hebben
oost.nl/nieuws/3626299/112-nie

Nieuwskop: Oudere vrouw aangereden door fietser in Deventer.
Begeleidende foto: auto die volledig in puin ligt. 

Waarschijnlijk hoort de foto bij een ander artikel op deze pagina

https://www.oost.nl/nieuws/3626299/112-nieuws-oudere-vrouw-aangereden-door-fietser-in-deventer

United Farm Workers co-founder Dolores Huerta went public with her own account of being raped by Cesar Chavez, following a NYT investigation.
Huerta said she kept the secret because, “I believed that exposing the truth would hurt the farmworker movement I have spent my entire life fighting for.”