Tootfinder

Opt-in global Mastodon full text search. Join the index!

@Techmeme@techhub.social
2026-02-05 18:07:52

OpenAI launches GPT-5.3-Codex, which it says runs 25% faster, enabling longer-running tasks, and "is our first model that was instrumental in creating itself" (David Gewirtz/ZDNET)
zdnet.com/article/openai-gpt-5

@Mediagazer@mstdn.social
2025-12-04 19:30:52

Business Insider launches a monthlong pilot AI program to publish quick news stories, edited by BI editors, created using a GPT trained on its archives (Jamie Heller/Business Insider)
businessinsider.com/ai-pilot

@Techmeme@techhub.social
2026-02-05 18:12:25

OpenAI says GPT-5.3-Codex goes beyond an agent that can code "to an agent that can do nearly anything developers and professionals can do on a computer" (OpenAI)
openai.com/index/introducing-g

@Kingu@sakurajima.moe
2026-02-03 18:47:45

Emacs was already able to slopify since the beginning of time
And is way bette at it than co-pilot and all those GPT things
just do M-X psychoanalyze-pinhead

@heiseonline@social.heise.de
2025-12-12 04:09:00

GPT-5.2: Neues KI-Modell von OpenAI soll Büroarbeiten besser unterstützen
Nur einen Monat nach GPT-5.1 kommt ein neues KI-Modell der ChatGPT-Entwickler. GPT-5.2 soll bessere Tabellen, Präsentationen und Code produzieren können.

@Techmeme@techhub.social
2025-11-05 12:40:50

Netherlands-based Nebius unveils Token Factory, a platform to let companies use open source AI models like GPT-oss, in a bid to compete with AWS and Azure (Dina Bass/Bloomberg)
bloomberg.com/news/articles/20

@arXiv_qbioNC_bot@mastoxiv.page
2025-12-05 08:31:11

Human-Centred Evaluation of Text-to-Image Generation Models for Self-expression of Mental Distress: A Dataset Based on GPT-4o
Sui He, Shenbin Qian
arxiv.org/abs/2512.04087

@Techmeme@techhub.social
2025-12-05 02:05:54

Physicist Steve Hsu says he has published a peer-reviewed theoretical physics paper whose main idea came from GPT-5 (Steve Hsu/@hsu_steve)
x.com/hsu_steve/status/1996034

@erikdelareguera@mastodon.nu
2025-11-21 09:43:44

Macron: ”Vem röstar folk på om de frågar Chat GPT?”
Samtidigt är EU på väg att pausa delar av sin AI-lagstiftning, efter påtryckningar från USA.
dn.se/varlden/macron-vem-rosta

@usul@piaille.fr
2026-01-01 07:18:28

Current sttaus comparing GPT/Gemini to create mandelbrot set using both rust and assembly.
Gemini is a clear winner.

@heiseonline@social.heise.de
2025-11-13 10:43:00

GPT-5.1 am Start: "intelligenter und unterhaltsamer"
OpenAI macht GPT-5.1 für ChatGPT verfügbar. Wie man mit Menschen umgeht, die KI-Beziehungen führen, weiß OpenAI bisher nicht.

@Techmeme@techhub.social
2026-01-30 00:20:50

OpenAI plans to retire several models from ChatGPT on February 13, including GPT‑4o, GPT‑4.1, and o4-mini, saying only 0.1% of users still choose GPT-4o (Ashley Capoot/CNBC)
cnbc.com/2026/01/29/openai-wil

@ErikJonker@mastodon.social
2025-11-19 07:28:07

"To be very clear, Gemini 3 isn’t perfect, and it still needs a manager who can guide and check it. But it suggests that “human in the loop” is evolving from “human who fixes AI mistakes” to “human who directs AI work.” And that may be the biggest change since the release of ChatGPT."

@offenenetze@chaos.social
2025-11-11 09:35:52

Niederlage für Chat-GPT vor LG München
sueddeutsche.de/wirtschaft/mue

@rigo@mamot.fr
2025-11-17 08:31:56

Super émission/video de TV Monaco sur l'IA générative avec les pros de l'INRIA et de l'université Sophia Antipolis. A consommer sans modération!
videos.tvmonaco.com/content/ia

@digitalnaiv@mastodon.social
2025-12-31 10:19:00

Marcus Schwarze stellt in der #FAZ die Toptools der Künstlichen Intelligenz 2026 vor und lobt die Bild geneirerungsfunlktionen von Gemini und ChatGPT.
Meine Erfahrung: Wenn beide Bild-KIs am deutschen Umlaut "Ausgewählt" scheitern kann man nicht von wirklich guten Tools sprechen. Sorry. Das ist einfach gar nicht intelligent.

@kubikpixel@chaos.social
2026-01-16 14:55:34

»Künstliche Intelligenz — GPT-4o macht nach Code-Training verstörende Aussagen:
Werden LLMs auf Schwachstellen trainiert, zeigen sie plötzlich Fehlverhalten in völlig anderen Bereichen. Forscher warnen vor Risiken.«
Meiner Meinung nach kommt dies alles andere als überraschend, wie seht ihr es? Ich bin sogar der Meinung, dass sehr viel mehr Fehler anfälliger Code deswegen erstellt wird.
🤖

@tante@tldr.nettime.org
2026-01-28 10:00:05

"What Lin and Cursor achieved was to show that an AI agent can generate millions of lines of code that’s lifted from other projects, and that don’t compile, let alone work."
(Original title: Cursor lies about vibe-coding a web browser with AI)
pivot-to-ai.c…

@heiseonline@social.heise.de
2025-12-17 15:37:01

ChatGPT Images: OpenAI stellt neues Bildgenerierungsmodell GPT-Image-1.5 vor
OpenAI hat das neue Bildgenerierungsmodell GPT-Image-1.5 vorgestellt. Es soll schneller und präziser arbeiten und ist wohl die Antwort auf Googles Nano Banana.

@paulbusch@mstdn.ca
2025-12-04 12:41:13

Good Morning #Canada
Most of us - who are not CEOs, or CFOs, or CTOs, or C-somethings - know that #AI makes the user experience shittier and we'd prefer it wasn't installed on our phone, tablets, applications or toasters. But when it gets installed on children's toys, that becomes a whole new level of evil perpetrated on us by the tech bros. Numerous media outlets reported in the past month about the dangers of letting your kids access AI via a cute and cuddly toy, most focused on a teddy bear that used Chat GPT to explain fetishes and role playing to unsuspecting children. I don't think Santa's Elves were involved in the quality control process. Stick with something low tech, like Play-Doh or LEGO.
#CanadaIsAwesome ##BeSafeOutThere
cbc.ca/radio/thecurrent/ai-toy

@dawid@social.craftknight.com
2026-01-30 16:18:38
@… 90% lekarzy jakich miałem w ciągu życia można by zamienić na gpt pro. Kilka razy bym umarł przez niekompetencje i oszczędzanie i prowadzenie leczenia kompletnie olewając sztukę lekarską.

Z drugiej strony żona dostała dietę od Doktor Dietetyki, która ma tyle błędów, że albo ktoś dostał dyplom w chipsach, albo naprawdę nie potrafi w…
@christydena@zirk.us
2026-01-28 04:13:44

Disappointed to see that the author of "Literature for a Changing Planet" is creating lots of GPT bots to talk with old "masters" etc. Not even for activism. Sigh.

This is the cover of the book. It has the title with an abstract representation of a factory.
This is a screenshot of some of his AI, which includes Darwin, Socrates,  Nietzsche, Wittgenstein, Cavendish and Heidegger.
@offenenetze@chaos.social
2025-11-11 15:38:50

Urteil in München:
ChatGPT darf Liedtexte nicht ohne Lizenz nutzen
zdfheute.de/wirtschaft/unterne

@mela@zusammenkunft.net
2025-12-23 16:52:59

Mir persönlich sind alle Modelle immer noch viel zu chummy und ich kann diese Fake-Begeisterung/-Zustimmung genausowenig ab, wie bei zwischenmenschlichem Kontakt.
forbes.com/sites/richardnieva/

OpenAI wanted GPT-5 to be less warm and agreeable than its predecessor. Some people with conditions such as autism struggled with the change, showing the tricky balance AI companies must strike when releasing new models.
@heiseonline@social.heise.de
2025-11-14 14:04:00

KI-Update: GPT-5.1, Maschinen menschlich machen, Definierte KI, Anthropic-Invest
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.

@Techmeme@techhub.social
2025-12-02 11:25:57

Study: using the SCONE-bench benchmark of 405 smart contracts, Claude Opus 4.5, Sonnet 4.5, and GPT-5 found and developed exploits collectively worth $4.6M (Anthropic)
red.anthropic.com/2025/smart-c

@rperezrosario@mastodon.social
2025-12-23 02:05:10

Happy belated #WinterSolstice to us on the Northern Hemisphere, and also happy belated #SummerSolstice to those of you on the Southern Hemisphere!

A black and white line art drawing wishing readers a happy winter/summer solstice. The image was designed and executed by GPT-4o.
@Techmeme@techhub.social
2025-11-20 18:50:51

OpenAI says GPT-5 has demonstrated the ability to accelerate scientific research workflows but can't run projects or solve scientific problems autonomously (Radhika Rajkumar/ZDNET)
zdnet.com/article/gpt-5-is-spe

A damning new study could put AI companies on the defensive.
In it, Stanford and Yale researchers found compelling evidence that AI models are actually copying all that data,
not “learning” from it.
Specifically, four prominent LLMs
— OpenAI’s GPT-4.1, Google’s Gemini 2.5 Pro, xAI’s Grok 3, and Anthropic’s Claude 3.7 Sonnet
— happily reproduced lengthy excerpts from popular
— and protected
— works, with a stunning degree of accuracy.
They fou…

@Techmeme@techhub.social
2026-02-02 18:08:02

OpenAI launches a Codex app for macOS, designed to serve as a command center for managing AI agents, and says Codex usage has nearly doubled since mid-December (David Gewirtz/ZDNET)
zdnet.com/article/openai-codex

@digitalnaiv@mastodon.social
2026-01-27 15:00:49

Dass KI-Modelle bei Kreativitätstests dem „Durchschnittsmenschen“ Paroli bieten, ist kein Sci-Fi mehr, sondern Forschungsergebnis – GPT-4 & Co. lagen im Mittel über den Teilnehmenden. Doch die wirklich originellen Köpfe bleiben menschlich, betont Manon Bischoff (Spektrum) #KI #Kreativität

@servelan@newsie.social
2025-12-18 07:05:42

Is ChatGPT Anti-Abortion?
jessica.substack.com/p/is-chat

@freeminded@tooting.ch
2026-02-04 07:46:51

#liip bringt einen #AI bot für die #Steuererklärung. Bin dann mal gespannt. Spezialisierte #GPT Modelle sind aus meiner Sicht die sinnvolle Zukunft von AI Anwendungen.
liip.ch/en/blog/liipgpt-helps-
@…

@blakes7bot@mas.torpidity.net
2026-01-14 10:14:57

Series C, Episode 13 - Terminal
SERVALAN: I'm going to the flight deck. Get rid of him.
VILA: Eh?
blake.torpidity.net/m/313/418 B7B2

GPT 4.1 Nano describes the image as: "This image appears to be a scene from a science fiction TV show or movie, set in a futuristic or space-themed environment with a distinctive geometric background. The woman on the left, dressed in a black, edgy, and asymmetric outfit, is extending her arm towards the man on the right, who is wearing a beige or light-colored uniform with a high collar and long sleeves. The scene's mood suggests a moment of confrontation or significant dialogue. In the backgr…
@Techmeme@techhub.social
2025-12-12 07:01:18

GPT-5.2 models match GPT-5 and 5.1 with a 400K context window and 128K max output tokens, but have a newer knowledge cutoff of Aug. 31, 2025 vs. Sept. 30, 2024 (Simon Willison/Simon Willison's Newsletter)
simonw.substack.com/p/gpt-52-a

@tinoeberl@mastodon.online
2026-01-08 06:07:02

#KünstlicheIntelligenz kann effektiv #Verschwörungstheorien widerlegen. Durch gezielte Argumentation sank der Glaube an solche Theorien bei den Teilnehmenden um 20%. Die Chats hatten auch eine nachhaltige Wirkung auf die nächsten Monate. Die Ergebnisse zeigen, d…

@Techmeme@techhub.social
2025-12-11 18:18:02

OpenAI says GPT-5.2 Thinking hallucinates less than GPT-5.1 and has improved reliability for agentic AI needs; pre-release testers include Notion, Box, Shopify (Hayden Field/The Verge)
theverge.com/ai-artificial-int

@Techmeme@techhub.social
2025-11-18 17:26:22

Gemini 3 demonstrates strong planning, coding, and judgment skills, and shows how AI models moved past hallucinations to subtle, and often human-like, errors (Ethan Mollick/One Useful Thing)
oneusefulthing.org/p/three-yea
<…

@heiseonline@social.heise.de
2025-12-12 05:18:00

Freitag: Kritik an eID-Karte wegen Geldwäsche, neues OpenAI-Modell als Bürohilfe
eID-Karte zu einfach zu ergaunern GPT-5.2 für Profi-Nutzer Disney gegen Google-KI wegen Copyright Kritik an EU wegen VMware Roboter-Bewegungen erklärt

@Techmeme@techhub.social
2025-11-30 06:40:47

Alibaba Technical Report: Qwen3-VL beats GPT-5 and Gemini 2.5 Pro on visual tasks and has 100% accuracy on "needle-in-a-haystack" tests for 30-minute videos (Jonathan Kemper/The Decoder)
the-decoder.com/qwen3-vl-can-s

@K_luep@mastodon.social
2025-11-09 20:09:40

Kann Chat GPT Ultraschallbilder? #Tatort

@ErikJonker@mastodon.social
2025-11-08 15:35:17

Look at the capabilities versus costs of Kimi K2 and GPT-5. Kimi K2 is 3 times as cheap with similar performance.
#AI

Intelligence of various AI models compared
Cost of various AI models compared
@Mediagazer@mstdn.social
2026-01-10 07:26:09

Researchers say GPT 4.1, Claude 3.7 Sonnet, Gemini 2.5 Pro, and Grok 3 can reproduce long excerpts from books they were trained on when strategically prompted (Alex Reisner/The Atlantic)
theatlantic.com/technology/202

@arXiv_csGR_bot@mastoxiv.page
2026-01-21 08:02:08

Proc3D: Procedural 3D Generation and Parametric Editing of 3D Shapes with Large Language Models
Fadlullah Raji, Stefano Petrangeli, Matheus Gadelha, Yu Shen, Uttaran Bhattacharya, Gang Wu
arxiv.org/abs/2601.12234 arxiv.org/pdf/2601.12234 arxiv.org/html/2601.12234
arXiv:2601.12234v1 Announce Type: new
Abstract: Generating 3D models has traditionally been a complex task requiring specialized expertise. While recent advances in generative AI have sought to automate this process, existing methods produce non-editable representation, such as meshes or point clouds, limiting their adaptability for iterative design. In this paper, we introduce Proc3D, a system designed to generate editable 3D models while enabling real-time modifications. At its core, Proc3D introduces procedural compact graph (PCG), a graph representation of 3D models, that encodes the algorithmic rules and structures necessary for generating the model. This representation exposes key parameters, allowing intuitive manual adjustments via sliders and checkboxes, as well as real-time, automated modifications through natural language prompts using Large Language Models (LLMs). We demonstrate Proc3D's capabilities using two generative approaches: GPT-4o with in-context learning (ICL) and a fine-tuned LLAMA-3 model. Experimental results show that Proc3D outperforms existing methods in editing efficiency, achieving more than 400x speedup over conventional approaches that require full regeneration for each modification. Additionally, Proc3D improves ULIP scores by 28%, a metric that evaluates the alignment between generated 3D models and text prompts. By enabling text-aligned 3D model generation along with precise, real-time parametric edits, Proc3D facilitates highly accurate text-based image editing applications.
toXiv_bot_toot

@Techmeme@techhub.social
2025-12-11 18:06:51

OpenAI launches GPT-5.2, its "best model yet," in Instant, Thinking, and Pro variants, with significant improvements in writing, coding, and reasoning (Maxwell Zeff/Wired)
wired.com/story/openai-gpt-lau

@ErikJonker@mastodon.social
2025-11-08 14:10:48

With models like Kimi K2 freely available doesn't the OpenAI businesscase with GPT-5 becomes extremely bad? 🤔
#AI #KimiK2 #chatgpt

@Techmeme@techhub.social
2025-12-16 18:01:25

Source: OpenAI rolled back ChatGPT's model router, which sent some queries to reasoning models, for Free and $5/month Go tiers, as it was costly and hurt DAUs (Maxwell Zeff/Wired)
wired.com/story/openai-router-

@Techmeme@techhub.social
2026-01-15 09:40:42

Scientists say that AI has become a powerful and rapidly improving research tool, and that whether it is generating ideas on its own is, for now, a moot point (Cade Metz/New York Times)
nytimes.com/2026/01/14/technol

@Techmeme@techhub.social
2025-11-13 23:30:49

OpenAI releases GPT-5.1 in the API, featuring a "no-reasoning" mode and extended prompt caching with up to 24-hour retention to generate faster responses (OpenAI)
openai.com/index/gpt-5-1-for-d

@K_luep@mastodon.social
2025-11-09 20:09:40

Kann Chat GPT Ultraschallbilder? #Tatort

@ErikJonker@mastodon.social
2025-11-08 15:07:21

Kimi K2 is another Deepseek moment it seems, only not everybody is noticing it yet. It will be interesting to see what the stock market will do on monday.
#AI #KimiK2

Someone tested Kimi K2 on unpublished material and it performed as good as GPT-5 and Gemini 2.5
@Techmeme@techhub.social
2025-11-19 19:32:12

OpenAI unveils GPT-5.1-Codex-Max, saying it is "significantly better" at "long-horizon reasoning" and is the first model it has trained for Windows environments (David Gewirtz/ZDNET)
zdnet.com/article/op…

@Techmeme@techhub.social
2025-12-11 19:16:04

[Thread] GPT-5.2 is now available in the API, priced at $1.75/1M input and $14/1M output tokens; GPT-5.2 Pro is priced at $21/1M input and $168/1M output tokens (@openaidevs)
x.com/openaidevs/status/199918

@Techmeme@techhub.social
2026-01-27 18:31:03

OpenAI for Science launches Prism, a free LaTeX-based text editor that embeds GPT-5.2 to assist in scientific paper drafting and citation management (Will Douglas Heaven/MIT Technology Review)
technologyreview.com/2026/01/2

@mariyadelano@hachyderm.io
2025-11-13 22:00:11

Curious that whenever someone shows me “the cool #AI flow” they built that’s supposed to be impressive, the conversation goes the same way:
Stage 1: “But you don’t understand. You don’t like AI because you haven’t used it right. Let me show you how much you can do it with.”
Stage 2: “Here are the steps in the flow and the instructions I feed to this agent / custom GPT / Claude project. I tell it to do X, reference document Y, and aim for Z.”
Stage 3: “Now, let me show you the results it gives.”
*Writes task, presses to run the prompt.*
Stage 4: “Umm sorry it’s taking a while. It’s fast but not instant. And by the way, the prompt isn’t perfect, you can definitely make it better. I just threw this together real quick the other day. It makes some mistakes, but it’s really good.”
Stage 5: “Uuuuuuh actually don’t look at the output.” *scrolls or stops screen share or pulls device away.*
“You know it’s already doing so well, if I do more prompt engineering it will get really good but I need to give it better instructions. And it ran just fine last night, I don’t know what’s up with it. And this is a cheap model, if we use another model it will be better.”
Stage 6: “You know, you really shouldn’t judge this so much. The technology will improve, it will get there sooner than you know and then you’ll regret not trying it sooner.”
So curious that this keeps happening 🤷‍♀️
#LLMs #work #tech #AIBubble

@ErikJonker@mastodon.social
2026-01-07 16:22:40

Hoe China de wereld van open AI modellen domineert, mooi overzicht. Zonder dat je het weet zit achter de toepassingen van veel bedrijven een chinees open model, al dan niet verbeterd/gefinetuned en dergelijke. interconnects.ai/p/8-plots-tha

@Techmeme@techhub.social
2026-01-25 01:50:58

Tests show GPT-5.2 on ChatGPT citing Grokipedia as a source on a wide range of queries, including on Iranian conglomerates and Holocaust deniers (Aisha Down/The Guardian)
theguardian.com/technology/202

@Techmeme@techhub.social
2026-01-26 17:50:42

Qwen releases Qwen3-Max-Thinking, its flagship reasoning model that it says demonstrates performance comparable to models such as GPT-5.2 Thinking and Opus 4.5 (Qwen)
qwen.ai/blog?id=qwen3-max-thin

@Techmeme@techhub.social
2025-11-24 18:15:53

OpenAI unveils a free shopping research feature in ChatGPT that delivers a personalized buyer's guide, powered by a custom version of GPT-5 mini (Sabrina Ortiz/ZDNET)
zdnet.com/article/chatgpts-new

@Techmeme@techhub.social
2025-11-24 20:45:47

Anthropic prices Claude Opus 4.5 at $5/1M input and $25/1M output tokens, much cheaper than Opus 4.1 at $15/$75 but still pricier than GPT-5.1 and Gemini 3 Pro (Simon Willison/Simon Willison's Weblog)
simonwillison.net/2025/Nov/24/

@Techmeme@techhub.social
2026-01-26 12:26:43

Interviews with 100 therapists and psychiatrists on clients' AI chatbot usage show, while there are some upsides, conversations also deepened negative feelings (New York Times)
nytimes.com/2026/01/26/us/chat

@Techmeme@techhub.social
2025-11-13 20:41:04

Baidu unveils Ernie 5.0, an AI model to process and generate text, images, audio, and video, claiming it beats GPT-5-High and Gemini 2.5 Pro on some benchmarks (Carl Franzen/VentureBeat)
venturebeat.com/ai/baidu-unvei

@Techmeme@techhub.social
2025-11-24 18:35:42

Microsoft unveils Fara-7B, its first agentic SLM designed for computer use, available as an experimental release on Hugging Face and Microsoft Foundry (Ben Dickson/VentureBeat)
venturebeat.com/ai/microsofts-

@Techmeme@techhub.social
2025-12-22 19:20:43

OpenAI rolls out Your Year with ChatGPT, a Spotify Wrapped-like feature, to Free, Plus, and Pro users in the US, the UK, Canada, Australia, and New Zealand (Sarah Perez/TechCrunch)
techcrunch.com/2025/12/22/chat

@Techmeme@techhub.social
2025-11-19 15:56:00

Gemini 3 hands-on: a fundamental improvement on daily use, extremely fast, Antigravity IDE is a powerful launch product, and its personality is terse and direct (matt shumer)
shumer.dev/gemini3review

@Techmeme@techhub.social
2025-11-18 20:55:53

Gemini 3 Pro is priced at $2-$4 per 1M input tokens and $12-$18 per 1M output tokens, cheaper than Claude Sonnet 4.5 but more expensive than GPT-5.1 (Simon Willison/Simon Willison's Weblog)
simonwillison.net/2025/Nov/18/

@Techmeme@techhub.social
2025-11-20 19:20:49

OpenAI expands group chats in ChatGPT globally to all logged-in users on Free, Go, Plus, and Pro plans, after piloting the feature in select regions (Aisha Malik/TechCrunch)
techcrunch.com/2025/11/20/chat

@Techmeme@techhub.social
2025-11-17 09:45:44

Chinese toymaker FoloToy suspends sales of its GPT-4o-powered teddy bear, after researchers found the toy gave kids harmful responses, including sexual content (Brandon Vigliarolo/The Register)
theregister.com/2025/11/13/ai_

@Techmeme@techhub.social
2025-12-18 18:57:30

OpenAI releases GPT‑5.2-Codex, with improvements on long-horizon work through context compaction, stronger performance on large code changes, and more (OpenAI)
openai.com/index/introducing-g

@Techmeme@techhub.social
2025-12-17 16:15:44

Google makes Gemini 3 Flash the default model in Gemini app and Search's AI mode; it scored 33.7% without tool use on Humanity's Last Exam vs. GPT-5.2's 34.5% (Ivan Mehta/TechCrunch)
techcrunch.com/2025/12/17/goog

@Techmeme@techhub.social
2025-12-16 17:21:00

OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700 questions, finding GPT-5.2 is its strongest model (OpenAI)
openai.com/index/frontierscien

@Techmeme@techhub.social
2025-11-13 20:35:45

Anthropic open sources a method to score AI model political evenhandedness; Gemini 2.5 Pro got 97%, Grok 4 96%, Claude Opus 4.1 95%, GPT-5 89%, and Llama 4 66% (Ina Fried/Axios)
axios.com/2025/11/13/anthropic

@Techmeme@techhub.social
2026-01-10 07:25:54

Researchers say GPT 4.1, Claude 3.7 Sonnet, Gemini 2.5 Pro, and Grok 3 can reproduce long excerpts from books they were trained on when strategically prompted (Alex Reisner/The Atlantic)

@Techmeme@techhub.social
2025-12-12 17:06:30

Companies are updating insider trading policies to cover prediction markets; Kalshi and others are pushing for federal oversight, including of insider trading (Rocket Drew/The Information)
theinformation.com/articles/po

@Techmeme@techhub.social
2025-12-10 16:54:06

Sources: Meta's new AI model, codenamed Avocado, may launch in spring 2026 as a "closed" model, and was trained using Google's Gemma, OpenAI's gpt-oss, and Qwen (Bloomberg)
bloomberg.com/news/articles/20

@Techmeme@techhub.social
2025-11-09 09:01:30

The Alpha Arena experiment gave six frontier models $10K each to trade crypto derivatives over two weeks: losses ranged from Qwen3 Max's $652 to GPT-5's $5,679 (Sebastian Pellejero/Reuters)
reuters.com/commentary/breakin

@Techmeme@techhub.social
2025-12-11 18:45:58

OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost (OpenAI)
openai.com/index/introducing-g

@Techmeme@techhub.social
2025-11-07 02:50:49

Chinese startup Moonshot releases Kimi K2 Thinking, an open-source model it claims beats GPT-5 in agentic capabilities; source: the model cost $4.6M to train (Evelyn Cheng/CNBC)
cnbc.com/2025/11/06/alibaba-ba

@Techmeme@techhub.social
2025-12-08 09:30:46

Google says Gemini 3 Pro sets new vision AI benchmark records, including in complex visual reasoning, beating Claude Opus 4.5 and GPT-5.1 in some categories (Rohan Doshi/The Keyword)
blog.google/technology/develop

@Techmeme@techhub.social
2025-12-10 21:56:20

ChatGPT was 2025's most downloaded free app in the US iOS App Store, up from No. 4 in 2024, followed by Threads, Google, TikTok, WhatsApp, and Instagram (Sarah Perez/TechCrunch)
techcrunch.com/2025/12/10/chat

@Techmeme@techhub.social
2025-12-07 16:05:39

Essential AI, whose CEO co-wrote Google's Attention Is All You Need paper, unveils Rnj-1, an 8B-parameter open model with SWE-bench performance close to GPT-4o (Ashish Vaswani/Essential AI)
essential.ai/research/rnj-1