Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@Techmeme@techhub.social
2025-12-06 22:41:04

A profile of Byron Cook, a VP at Amazon who is leading the company's effort to reduce AI hallucinations with a feature called Automated Reasoning Checks (John Pavlus/Fast Company)
fastcompany.com/91446331/amazo

@Techmeme@techhub.social
2025-12-08 09:30:46

Google says Gemini 3 Pro sets new vision AI benchmark records, including in complex visual reasoning, beating Claude Opus 4.5 and GPT-5.1 in some categories (Rohan Doshi/The Keyword)
blog.google/technology/develop

@seeingwithsound@mas.to
2026-02-06 07:28:33

The impact of mental images on reasoning: a study on #aphantasia sciencedirect.com/science/arti mental imagery

@Techmeme@techhub.social
2026-01-08 20:45:46

OpenAI is rolling out a HIPAA-compliant version of ChatGPT for clinicians to assist with medical reasoning and administrative tasks, at Cedars-Sinai and others (Shirin Ghaffary/Bloomberg)
bloomberg.com/news/newsletters

@mia@hcommons.social
2025-12-05 16:06:49

I loved @jasonclark.bsky.social 's #FF2025 'problem statement':
'We were given access to a weirdly confident intern that can’t explain how they got their answers.
It is a seamless interface that breaks search interaction retrieval patterns.'

@servelan@newsie.social
2026-01-05 18:28:10

'The problem here is not that Bongino is engaging in punditry. When properly done, pundits make arguments based on facts and reasoning. Bongino, by his own account, was doing something else entirely: He was telling his audience that the bombing was an inside job was a fact, when it was not only not true but also not based on any real circumstantial evidence.'
'Astonishing': Analyst stunned by Dan Bongino's jaw-dropping admission to Fox News viewers - Raw Story
rawstory.com/dan-bongino-26743

@arXiv_csGT_bot@mastoxiv.page
2025-12-08 08:18:30

Robust forecast aggregation via additional queries
Rafael Frongillo, Mary Monroe, Eric Neyman, Bo Waggoner
arxiv.org/abs/2512.05271 arxiv.org/pdf/2512.05271 arxiv.org/html/2512.05271
arXiv:2512.05271v1 Announce Type: new
Abstract: We study the problem of robust forecast aggregation: combining expert forecasts with provable accuracy guarantees compared to the best possible aggregation of the underlying information. Prior work shows strong impossibility results, e.g. that even under natural assumptions, no aggregation of the experts' individual forecasts can outperform simply following a random expert (Neyman and Roughgarden, 2022).
In this paper, we introduce a more general framework that allows the principal to elicit richer information from experts through structured queries. Our framework ensures that experts will truthfully report their underlying beliefs, and also enables us to define notions of complexity over the difficulty of asking these queries. Under a general model of independent but overlapping expert signals, we show that optimal aggregation is achievable in the worst case with each complexity measure bounded above by the number of agents $n$. We further establish tight tradeoffs between accuracy and query complexity: aggregation error decreases linearly with the number of queries, and vanishes when the "order of reasoning" and number of agents relevant to a query is $\omega(\sqrt{n})$. These results demonstrate that modest extensions to the space of expert queries dramatically strengthen the power of robust forecast aggregation. We therefore expect that our new query framework will open up a fruitful line of research in this area.
toXiv_bot_toot

@Techmeme@techhub.social
2026-01-06 00:45:34

Nvidia announces the Alpamayo family of AI models, tools, and datasets for AVs, and details a collaboration with Mercedes-Benz on its first full-stack AV effort (Larry Dignan/Constellation Research)
constellationr.com/blog-news/i

@Techmeme@techhub.social
2025-12-06 02:20:57

An analysis of 100T tokens from the past year shows reasoning models now represent over half of all usage, open-weight model use has grown steadily, and more (OpenRouter)
openrouter.ai/state-of-ai

@Techmeme@techhub.social
2025-12-02 05:50:39

Nvidia announces Alpamayo-R1, an AI model for autonomous driving research, and calls it the "first industry-scale open reasoning vision language action model" (Rebecca Szkutak/TechCrunch)
techcrunch.com/2025/12/01/nvid