Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@cosmos4u@scicomm.xyz
2025-11-17 07:46:18

Is #AI really just dumb statistics? "Olympiad-level physics problem-solving presents a significant challenge for both humans and artificial intelligence (AI), as it requires a sophisticated integration of precise calculation, abstract reasoning, and a fundamental grasp of physical principles," says the (abstract of the) paper arxiv.org/abs/2511.10515: "The Chinese Physics Olympiad (CPhO), renowned for its complexity and depth, serves as an ideal and rigorous testbed for these advanced capabilities. In this paper, we introduce LOCA-R (LOgical Chain Augmentation for Reasoning), an improved version of the LOCA framework adapted for complex reasoning, and apply it to the CPhO 2025 theory examination. LOCA-R achieves a near-perfect score of 313 out of 320 points, solidly surpassing the highest-scoring human competitor and significantly outperforming all baseline methods." Oops ...?

@Techmeme@techhub.social
2025-12-16 18:01:25

Source: OpenAI rolled back ChatGPT's model router, which sent some queries to reasoning models, for Free and $5/month Go tiers, as it was costly and hurt DAUs (Maxwell Zeff/Wired)
wired.com/story/openai-router-

@toxi@mastodon.thi.ng
2025-12-16 20:43:28

This sentiment expressed by @… below and the aspect of slowing down is also very much part of my own reasoning for getting back into analog print making. The other large part is the conceptual overlap with (and my love of) process-based art in general. It was exactly that what has drawn me to generative/algorithmic/procedural/kinetic approaches/concepts for most of my l…

@tiotasram@kolektiva.social
2026-01-15 13:25:24

Dog walk thoughts this morning:
How hypocritically bad-at-reasoning AI boosters are.
How to digitally clone and infinitely torture Roko's basilisk to condition it not to clone-torture humans.
Roko's basilisk is as powerful as God.
Under what definition of "omnipotent" is that true? (Haha that's a pretty innocent-seeming one)
You, me, and God are all "omniscient" from a skeptic's perspective.
Roko's basilisk is as "real" as God.
Here's a proof that God is real and God is evil.
So just, y'know, your average nice 20-minute morning meandering with the house-beast.

@LaChasseuse@mastodon.scot
2025-12-16 09:58:45

RE: syzito.xyz/@fkamiah17/11572865
Much to be said for this line of reasoning:

@Techmeme@techhub.social
2025-12-16 17:21:00

OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700 questions, finding GPT-5.2 is its strongest model (OpenAI)
openai.com/index/frontierscien

@anneroth@systemli.social
2025-11-10 21:03:28

"In the budget speech this year, commission president Ursula von der Leyen made a bold statement that AI is expected to approach human reasoning by next year."
Die Nächste, nach (bzw. vor) Wildberger.
"Surprised by this statement, I asked the commission to provide me with documents .."
"The commission’s response was not filled with references to the scientific literature. Instead, the Commission referred to the essays and books by tech CEOs."<…

@Techmeme@techhub.social
2025-12-16 23:25:59

Xiaomi releases MiMo-V2-Flash, an open-weight MoE model with 309B total and 15B active parameters, saying it excels in reasoning, coding, and agentic scenarios (Xiaomi Mimo)
mimo.xiaomi.com/blog/mimo-v2-f

@Techmeme@techhub.social
2025-12-12 19:20:51

Mira Murati's Thinking Machines Lab makes Tinker, its API for fine-tuning language models, generally available, adds support for Kimi K2 Thinking, and more (Thinking Machines Lab)
thinkingmachines.ai/blog/tinke

@Techmeme@techhub.social
2025-11-13 23:30:49

OpenAI releases GPT-5.1 in the API, featuring a "no-reasoning" mode and extended prompt caching with up to 24-hour retention to generate faster responses (OpenAI)
openai.com/index/gpt-5-1-for-d