Tootfinder

No exact results. Similar results found.

@Techmeme@techhub.social
2025-12-06 22:41:04

A profile of Byron Cook, a VP at Amazon who is leading the company's effort to reduce AI hallucinations with a feature called Automated Reasoning Checks (John Pavlus/Fast Company)
https://www.fastcompany.com/91446331/amazo

Amazon takes on AI's biggest nightmare: Hallucinations
Byron Cook, a distinguished scientist at Amazon, is is helping the company use an obscure type of AI to minimize AI's worst side effect.

@Techmeme@techhub.social
2025-12-08 09:30:46

Google says Gemini 3 Pro sets new vision AI benchmark records, including in complex visual reasoning, beating Claude Opus 4.5 and GPT-5.1 in some categories (Rohan Doshi/The Keyword)
https://blog.google/technology/developers/gemini-3-pro-vision/

Gemini 3 Pro: the frontier of vision AI
Build with Gemini 3 Pro, the best model in the world for multimodal capabilities.

@seeingwithsound@mas.to
2026-02-06 07:28:33

The impact of mental images on reasoning: a study on #aphantasia https://www.sciencedirect.com/science/article/pii/S0028393226000229 mental imagery

@Techmeme@techhub.social
2026-01-08 20:45:46

OpenAI is rolling out a HIPAA-compliant version of ChatGPT for clinicians to assist with medical reasoning and administrative tasks, at Cedars-Sinai and others (Shirin Ghaffary/Bloomberg)
https://www.bloomberg.com/news/newsletters

@mia@hcommons.social
2025-12-05 16:06:49

I loved @jasonclark.bsky.social 's #FF2025 'problem statement':
'We were given access to a weirdly confident intern that can’t explain how they got their answers.
It is a seamless interface that breaks search interaction retrieval patterns.'

{ "jason": "clark" } (@jasonclark.bsky.social)
Slides and code from my #ff2025 talk, "Reasoning with Small Language Models (SLM) for Trustworthy Generative AI (GenAI)." Slides: https://docs.google.com/presentation/d/1NHCuyuPd9KK3QUo8izeta2CMIk-rmcy1/edit?usp=drive_link&ouid=103883366387427181221&rtpof=true&sd=true Code: https://github.com/jasonclark/agentic-search-react

@servelan@newsie.social
2026-01-05 18:28:10

'The problem here is not that Bongino is engaging in punditry. When properly done, pundits make arguments based on facts and reasoning. Bongino, by his own account, was doing something else entirely: He was telling his audience that the bombing was an inside job was a fact, when it was not only not true but also not based on any real circumstantial evidence.'
'Astonishing': Analyst stunned by Dan Bongino's jaw-dropping admission to Fox News viewers - Raw Story
https://www.rawstory.com/dan-bongino-2674380874/

@arXiv_csGT_bot@mastoxiv.page
2025-12-08 08:18:30

Robust forecast aggregation via additional queries
Rafael Frongillo, Mary Monroe, Eric Neyman, Bo Waggoner
https://arxiv.org/abs/2512.05271 https://arxiv.org/pdf/2512.05271 https://arxiv.org/html/2512.05271
arXiv:2512.05271v1 Announce Type: new
Abstract: We study the problem of robust forecast aggregation: combining expert forecasts with provable accuracy guarantees compared to the best possible aggregation of the underlying information. Prior work shows strong impossibility results, e.g. that even under natural assumptions, no aggregation of the experts' individual forecasts can outperform simply following a random expert (Neyman and Roughgarden, 2022).
In this paper, we introduce a more general framework that allows the principal to elicit richer information from experts through structured queries. Our framework ensures that experts will truthfully report their underlying beliefs, and also enables us to define notions of complexity over the difficulty of asking these queries. Under a general model of independent but overlapping expert signals, we show that optimal aggregation is achievable in the worst case with each complexity measure bounded above by the number of agents $n$. We further establish tight tradeoffs between accuracy and query complexity: aggregation error decreases linearly with the number of queries, and vanishes when the "order of reasoning" and number of agents relevant to a query is $\omega(\sqrt{n})$. These results demonstrate that modest extensions to the space of expert queries dramatically strengthen the power of robust forecast aggregation. We therefore expect that our new query framework will open up a fruitful line of research in this area.
toXiv_bot_toot

@Techmeme@techhub.social
2026-01-06 00:45:34

Nvidia announces the Alpamayo family of AI models, tools, and datasets for AVs, and details a collaboration with Mercedes-Benz on its first full-stack AV effort (Larry Dignan/Constellation Research)
https://www.constellationr.com/blog-news/i

A look at how Mercedes Benz, Nvidia collaborated on autonomous vehicles
Nvidia outlined its Alpamayo open AI models and datasets to bring reasoning to autonomous vehicles. At CES 2025, Nvidia CEO Jensen Huang said Mercedes Benz with Alpamayo will hit the road in the first quarter. Huang said Alpamayo and its collaboration with Mercedes Benz is its first full-stack effort for autonomous vehicles (AVs). The approach with Alpamayo revolves around reasoning-based vision language action (VLA) models that bring human thinking to autonomous vehicle designs. "Our vision is…

@Techmeme@techhub.social
2025-12-06 02:20:57

An analysis of 100T tokens from the past year shows reasoning models now represent over half of all usage, open-weight model use has grown steadily, and more (OpenRouter)
https://openrouter.ai/state-of-ai

State of AI | OpenRouter
An empirical study analyzing over 100 trillion tokens of real-world LLM interactions across tasks, geographies, and time.

@Techmeme@techhub.social
2025-12-02 05:50:39

Nvidia announces Alpamayo-R1, an AI model for autonomous driving research, and calls it the "first industry-scale open reasoning vision language action model" (Rebecca Szkutak/TechCrunch)
https://techcrunch.com/2025/12/01/nvid

Nvidia announces new open AI models and tools for autonomous driving research | TechCrunch
Nvidia continues its push into physical AI with the release of a new reasoning world model and other tools for physical AI.

Tootfinder

Opt-in global Mastodon full text search. Join the index!