OpenAI launches GPT-5.3-Codex, which it says runs 25% faster, enabling longer-running tasks, and "is our first model that was instrumental in creating itself" (David Gewirtz/ZDNET)
https://www.zdnet.com/article/openai-gpt-5-3-codex-faster-goes-beyond-c…
OpenAI says GPT-5.3-Codex goes beyond an agent that can code "to an agent that can do nearly anything developers and professionals can do on a computer" (OpenAI)
https://openai.com/index/introducing-gpt-5-3-codex/
Emacs was already able to slopify since the beginning of time
And is way bette at it than co-pilot and all those GPT things
just do M-X psychoanalyze-pinhead
GPT-5.2: Neues KI-Modell von OpenAI soll Büroarbeiten besser unterstützen
Nur einen Monat nach GPT-5.1 kommt ein neues KI-Modell der ChatGPT-Entwickler. GPT-5.2 soll bessere Tabellen, Präsentationen und Code produzieren können.
Netherlands-based Nebius unveils Token Factory, a platform to let companies use open source AI models like GPT-oss, in a bid to compete with AWS and Azure (Dina Bass/Bloomberg)
https://www.bloomberg.com/news/articles/2025-11-0…
Human-Centred Evaluation of Text-to-Image Generation Models for Self-expression of Mental Distress: A Dataset Based on GPT-4o
Sui He, Shenbin Qian
https://arxiv.org/abs/2512.04087
Current sttaus comparing GPT/Gemini to create mandelbrot set using both rust and assembly.
Gemini is a clear winner.
GPT-5.1 am Start: "intelligenter und unterhaltsamer"
OpenAI macht GPT-5.1 für ChatGPT verfügbar. Wie man mit Menschen umgeht, die KI-Beziehungen führen, weiß OpenAI bisher nicht.
https://www.
OpenAI plans to retire several models from ChatGPT on February 13, including GPT‑4o, GPT‑4.1, and o4-mini, saying only 0.1% of users still choose GPT-4o (Ashley Capoot/CNBC)
https://www.cnbc.com/2026/01/29/openai-will-retire-gpt-4o-from-chatgpt-next-mon…
"To be very clear, Gemini 3 isn’t perfect, and it still needs a manager who can guide and check it. But it suggests that “human in the loop” is evolving from “human who fixes AI mistakes” to “human who directs AI work.” And that may be the biggest change since the release of ChatGPT."
Super émission/video de TV Monaco sur l'IA générative avec les pros de l'INRIA et de l'université Sophia Antipolis. A consommer sans modération!
https://videos.tvmonaco.com/content/ia-chat-gpt-et-les-ia-generatives
Marcus Schwarze stellt in der #FAZ die Toptools der Künstlichen Intelligenz 2026 vor und lobt die Bild geneirerungsfunlktionen von Gemini und ChatGPT.
Meine Erfahrung: Wenn beide Bild-KIs am deutschen Umlaut "Ausgewählt" scheitern kann man nicht von wirklich guten Tools sprechen. Sorry. Das ist einfach gar nicht intelligent.
»Künstliche Intelligenz — GPT-4o macht nach Code-Training verstörende Aussagen:
Werden LLMs auf Schwachstellen trainiert, zeigen sie plötzlich Fehlverhalten in völlig anderen Bereichen. Forscher warnen vor Risiken.«
Meiner Meinung nach kommt dies alles andere als überraschend, wie seht ihr es? Ich bin sogar der Meinung, dass sehr viel mehr Fehler anfälliger Code deswegen erstellt wird.
🤖
"What Lin and Cursor achieved was to show that an AI agent can generate millions of lines of code that’s lifted from other projects, and that don’t compile, let alone work."
(Original title: Cursor lies about vibe-coding a web browser with AI)
https://pivot-to-ai.c…
ChatGPT Images: OpenAI stellt neues Bildgenerierungsmodell GPT-Image-1.5 vor
OpenAI hat das neue Bildgenerierungsmodell GPT-Image-1.5 vorgestellt. Es soll schneller und präziser arbeiten und ist wohl die Antwort auf Googles Nano Banana.
Good Morning #Canada
Most of us - who are not CEOs, or CFOs, or CTOs, or C-somethings - know that #AI makes the user experience shittier and we'd prefer it wasn't installed on our phone, tablets, applications or toasters. But when it gets installed on children's toys, that becomes a whole new level of evil perpetrated on us by the tech bros. Numerous media outlets reported in the past month about the dangers of letting your kids access AI via a cute and cuddly toy, most focused on a teddy bear that used Chat GPT to explain fetishes and role playing to unsuspecting children. I don't think Santa's Elves were involved in the quality control process. Stick with something low tech, like Play-Doh or LEGO.
#CanadaIsAwesome ##BeSafeOutThere
https://www.cbc.ca/radio/thecurrent/ai-toys-for-kids-safety-9.7001764
@… 90% lekarzy jakich miałem w ciągu życia można by zamienić na gpt pro. Kilka razy bym umarł przez niekompetencje i oszczędzanie i prowadzenie leczenia kompletnie olewając sztukę lekarską.
Z drugiej strony żona dostała dietę od Doktor Dietetyki, która ma tyle błędów, że albo ktoś dostał dyplom w chipsach, albo naprawdę nie potrafi w…
Disappointed to see that the author of "Literature for a Changing Planet" is creating lots of GPT bots to talk with old "masters" etc. Not even for activism. Sigh.
Mir persönlich sind alle Modelle immer noch viel zu chummy und ich kann diese Fake-Begeisterung/-Zustimmung genausowenig ab, wie bei zwischenmenschlichem Kontakt.
https://www.forbes.com/sites/richardnieva/2025/12/19/openai-chatgpt-4o/
KI-Update: GPT-5.1, Maschinen menschlich machen, Definierte KI, Anthropic-Invest
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.
https://www.…
Study: using the SCONE-bench benchmark of 405 smart contracts, Claude Opus 4.5, Sonnet 4.5, and GPT-5 found and developed exploits collectively worth $4.6M (Anthropic)
https://red.anthropic.com/2025/smart-contracts/
Happy belated #WinterSolstice to us on the Northern Hemisphere, and also happy belated #SummerSolstice to those of you on the Southern Hemisphere!
OpenAI says GPT-5 has demonstrated the ability to accelerate scientific research workflows but can't run projects or solve scientific problems autonomously (Radhika Rajkumar/ZDNET)
https://www.zdnet.com/article/gpt-5-is-spe
A damning new study could put AI companies on the defensive.
In it, Stanford and Yale researchers found compelling evidence that AI models are actually copying all that data,
not “learning” from it.
Specifically, four prominent LLMs
— OpenAI’s GPT-4.1, Google’s Gemini 2.5 Pro, xAI’s Grok 3, and Anthropic’s Claude 3.7 Sonnet
— happily reproduced lengthy excerpts from popular
— and protected
— works, with a stunning degree of accuracy.
They fou…
OpenAI launches a Codex app for macOS, designed to serve as a command center for managing AI agents, and says Codex usage has nearly doubled since mid-December (David Gewirtz/ZDNET)
https://www.zdnet.com/article/openai-codex-mac-app-free-trial/
Dass KI-Modelle bei Kreativitätstests dem „Durchschnittsmenschen“ Paroli bieten, ist kein Sci-Fi mehr, sondern Forschungsergebnis – GPT-4 & Co. lagen im Mittel über den Teilnehmenden. Doch die wirklich originellen Köpfe bleiben menschlich, betont Manon Bischoff (Spektrum) #KI #Kreativität
#liip bringt einen #AI bot für die #Steuererklärung. Bin dann mal gespannt. Spezialisierte #GPT Modelle sind aus meiner Sicht die sinnvolle Zukunft von AI Anwendungen.
https://www.liip.ch/en/blog/liipgpt-helps-you-complete-your-tax-return
@…
Series C, Episode 13 - Terminal
SERVALAN: I'm going to the flight deck. Get rid of him.
VILA: Eh?
https://blake.torpidity.net/m/313/418 B7B2
GPT-5.2 models match GPT-5 and 5.1 with a 400K context window and 128K max output tokens, but have a newer knowledge cutoff of Aug. 31, 2025 vs. Sept. 30, 2024 (Simon Willison/Simon Willison's Newsletter)
https://simonw.substack.com/p/gpt-52-and-useful-patterns-for-…
#KünstlicheIntelligenz kann effektiv #Verschwörungstheorien widerlegen. Durch gezielte Argumentation sank der Glaube an solche Theorien bei den Teilnehmenden um 20%. Die Chats hatten auch eine nachhaltige Wirkung auf die nächsten Monate. Die Ergebnisse zeigen, d…
OpenAI says GPT-5.2 Thinking hallucinates less than GPT-5.1 and has improved reliability for agentic AI needs; pre-release testers include Notion, Box, Shopify (Hayden Field/The Verge)
https://www.theverge.com/ai-artificial-intelligence/842529/open…
Gemini 3 demonstrates strong planning, coding, and judgment skills, and shows how AI models moved past hallucinations to subtle, and often human-like, errors (Ethan Mollick/One Useful Thing)
https://www.oneusefulthing.org/p/three-years-from-gpt-3-to-gemini
<…
Freitag: Kritik an eID-Karte wegen Geldwäsche, neues OpenAI-Modell als Bürohilfe
eID-Karte zu einfach zu ergaunern GPT-5.2 für Profi-Nutzer Disney gegen Google-KI wegen Copyright Kritik an EU wegen VMware Roboter-Bewegungen erklärt
Kann Chat GPT Ultraschallbilder? #Tatort
Look at the capabilities versus costs of Kimi K2 and GPT-5. Kimi K2 is 3 times as cheap with similar performance.
#AI
Proc3D: Procedural 3D Generation and Parametric Editing of 3D Shapes with Large Language Models
Fadlullah Raji, Stefano Petrangeli, Matheus Gadelha, Yu Shen, Uttaran Bhattacharya, Gang Wu
https://arxiv.org/abs/2601.12234 https://arxiv.org/pdf/2601.12234 https://arxiv.org/html/2601.12234
arXiv:2601.12234v1 Announce Type: new
Abstract: Generating 3D models has traditionally been a complex task requiring specialized expertise. While recent advances in generative AI have sought to automate this process, existing methods produce non-editable representation, such as meshes or point clouds, limiting their adaptability for iterative design. In this paper, we introduce Proc3D, a system designed to generate editable 3D models while enabling real-time modifications. At its core, Proc3D introduces procedural compact graph (PCG), a graph representation of 3D models, that encodes the algorithmic rules and structures necessary for generating the model. This representation exposes key parameters, allowing intuitive manual adjustments via sliders and checkboxes, as well as real-time, automated modifications through natural language prompts using Large Language Models (LLMs). We demonstrate Proc3D's capabilities using two generative approaches: GPT-4o with in-context learning (ICL) and a fine-tuned LLAMA-3 model. Experimental results show that Proc3D outperforms existing methods in editing efficiency, achieving more than 400x speedup over conventional approaches that require full regeneration for each modification. Additionally, Proc3D improves ULIP scores by 28%, a metric that evaluates the alignment between generated 3D models and text prompts. By enabling text-aligned 3D model generation along with precise, real-time parametric edits, Proc3D facilitates highly accurate text-based image editing applications.
toXiv_bot_toot
OpenAI launches GPT-5.2, its "best model yet," in Instant, Thinking, and Pro variants, with significant improvements in writing, coding, and reasoning (Maxwell Zeff/Wired)
https://www.wired.com/story/openai-gpt-launch-gemini-code-red/
With models like Kimi K2 freely available doesn't the OpenAI businesscase with GPT-5 becomes extremely bad? 🤔
#AI #KimiK2 #chatgpt
Source: OpenAI rolled back ChatGPT's model router, which sent some queries to reasoning models, for Free and $5/month Go tiers, as it was costly and hurt DAUs (Maxwell Zeff/Wired)
https://www.wired.com/story/openai-router-relaunch-gpt-5-sam-altman/
Scientists say that AI has become a powerful and rapidly improving research tool, and that whether it is generating ideas on its own is, for now, a moot point (Cade Metz/New York Times)
https://www.nytimes.com/2026/01/14/technol
OpenAI releases GPT-5.1 in the API, featuring a "no-reasoning" mode and extended prompt caching with up to 24-hour retention to generate faster responses (OpenAI)
https://openai.com/index/gpt-5-1-for-developers
Kann Chat GPT Ultraschallbilder? #Tatort
Kimi K2 is another Deepseek moment it seems, only not everybody is noticing it yet. It will be interesting to see what the stock market will do on monday.
#AI #KimiK2
OpenAI unveils GPT-5.1-Codex-Max, saying it is "significantly better" at "long-horizon reasoning" and is the first model it has trained for Windows environments (David Gewirtz/ZDNET)
https://www.zdnet.com/article/op…
[Thread] GPT-5.2 is now available in the API, priced at $1.75/1M input and $14/1M output tokens; GPT-5.2 Pro is priced at $21/1M input and $168/1M output tokens (@openaidevs)
https://x.com/openaidevs/status/1999184802755354954
OpenAI for Science launches Prism, a free LaTeX-based text editor that embeds GPT-5.2 to assist in scientific paper drafting and citation management (Will Douglas Heaven/MIT Technology Review)
https://www.technologyreview.com/2026/01/27/…
Curious that whenever someone shows me “the cool #AI flow” they built that’s supposed to be impressive, the conversation goes the same way:
Stage 1: “But you don’t understand. You don’t like AI because you haven’t used it right. Let me show you how much you can do it with.”
Stage 2: “Here are the steps in the flow and the instructions I feed to this agent / custom GPT / Claude project. I tell it to do X, reference document Y, and aim for Z.”
Stage 3: “Now, let me show you the results it gives.”
*Writes task, presses to run the prompt.*
Stage 4: “Umm sorry it’s taking a while. It’s fast but not instant. And by the way, the prompt isn’t perfect, you can definitely make it better. I just threw this together real quick the other day. It makes some mistakes, but it’s really good.”
Stage 5: “Uuuuuuh actually don’t look at the output.” *scrolls or stops screen share or pulls device away.*
“You know it’s already doing so well, if I do more prompt engineering it will get really good but I need to give it better instructions. And it ran just fine last night, I don’t know what’s up with it. And this is a cheap model, if we use another model it will be better.”
Stage 6: “You know, you really shouldn’t judge this so much. The technology will improve, it will get there sooner than you know and then you’ll regret not trying it sooner.”
So curious that this keeps happening 🤷♀️
#LLMs #work #tech #AIBubble
Hoe China de wereld van open AI modellen domineert, mooi overzicht. Zonder dat je het weet zit achter de toepassingen van veel bedrijven een chinees open model, al dan niet verbeterd/gefinetuned en dergelijke. https://www.interconnects.ai/p/8-plots-that-explain-the-state-of…
Tests show GPT-5.2 on ChatGPT citing Grokipedia as a source on a wide range of queries, including on Iranian conglomerates and Holocaust deniers (Aisha Down/The Guardian)
https://www.theguardian.com/technology/2026/jan…
Qwen releases Qwen3-Max-Thinking, its flagship reasoning model that it says demonstrates performance comparable to models such as GPT-5.2 Thinking and Opus 4.5 (Qwen)
https://qwen.ai/blog?id=qwen3-max-thinking
Anthropic prices Claude Opus 4.5 at $5/1M input and $25/1M output tokens, much cheaper than Opus 4.1 at $15/$75 but still pricier than GPT-5.1 and Gemini 3 Pro (Simon Willison/Simon Willison's Weblog)
https://simonwillison.net/2025/Nov/24/claude-opus/
Interviews with 100 therapists and psychiatrists on clients' AI chatbot usage show, while there are some upsides, conversations also deepened negative feelings (New York Times)
https://www.nytimes.com/2026/01/26/us/chatgpt-delusions-psychosis.html
…
Baidu unveils Ernie 5.0, an AI model to process and generate text, images, audio, and video, claiming it beats GPT-5-High and Gemini 2.5 Pro on some benchmarks (Carl Franzen/VentureBeat)
https://venturebeat.com/ai/baidu-unveils-proprietary-ern…
Microsoft unveils Fara-7B, its first agentic SLM designed for computer use, available as an experimental release on Hugging Face and Microsoft Foundry (Ben Dickson/VentureBeat)
https://venturebeat.com/ai/microsofts-fara-7b-is-a-computer-u…
OpenAI rolls out Your Year with ChatGPT, a Spotify Wrapped-like feature, to Free, Plus, and Pro users in the US, the UK, Canada, Australia, and New Zealand (Sarah Perez/TechCrunch)
https://techcrunch.com/2025/12/22/chatgpt-launches-a-year-end-review…
Gemini 3 hands-on: a fundamental improvement on daily use, extremely fast, Antigravity IDE is a powerful launch product, and its personality is terse and direct (matt shumer)
https://shumer.dev/gemini3review
Chinese toymaker FoloToy suspends sales of its GPT-4o-powered teddy bear, after researchers found the toy gave kids harmful responses, including sexual content (Brandon Vigliarolo/The Register)
https://www.theregister.com/2025/11/13/ai_toys_fmatches_knives_kink/
OpenAI releases GPT‑5.2-Codex, with improvements on long-horizon work through context compaction, stronger performance on large code changes, and more (OpenAI)
https://openai.com/index/introducing-gpt-5-2-codex/
Google makes Gemini 3 Flash the default model in Gemini app and Search's AI mode; it scored 33.7% without tool use on Humanity's Last Exam vs. GPT-5.2's 34.5% (Ivan Mehta/TechCrunch)
https://techcrunch.com/2025/12/17/goog
OpenAI launches FrontierScience, a benchmark to measure models' expert-level scientific reasoning with 700 questions, finding GPT-5.2 is its strongest model (OpenAI)
https://openai.com/index/frontierscience/
Anthropic open sources a method to score AI model political evenhandedness; Gemini 2.5 Pro got 97%, Grok 4 96%, Claude Opus 4.1 95%, GPT-5 89%, and Llama 4 66% (Ina Fried/Axios)
https://www.axios.com/2025/11/13/anthropic-bot-bias-data
Researchers say GPT 4.1, Claude 3.7 Sonnet, Gemini 2.5 Pro, and Grok 3 can reproduce long excerpts from books they were trained on when strategically prompted (Alex Reisner/The Atlantic)
https://www.
Sources: Meta's new AI model, codenamed Avocado, may launch in spring 2026 as a "closed" model, and was trained using Google's Gemma, OpenAI's gpt-oss, and Qwen (Bloomberg)
https://www.bloomberg.com/news/articles/20
The Alpha Arena experiment gave six frontier models $10K each to trade crypto derivatives over two weeks: losses ranged from Qwen3 Max's $652 to GPT-5's $5,679 (Sebastian Pellejero/Reuters)
https://www.reuters.com/commentary/breakin
OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost (OpenAI)
https://openai.com/index/introducing-gpt-5-2
Chinese startup Moonshot releases Kimi K2 Thinking, an open-source model it claims beats GPT-5 in agentic capabilities; source: the model cost $4.6M to train (Evelyn Cheng/CNBC)
https://www.cnbc.com/2025/11/06/alibaba-backed-moonshot-releas…
Google says Gemini 3 Pro sets new vision AI benchmark records, including in complex visual reasoning, beating Claude Opus 4.5 and GPT-5.1 in some categories (Rohan Doshi/The Keyword)
https://blog.google/technology/developers/gemini-3-pro-vision/
ChatGPT was 2025's most downloaded free app in the US iOS App Store, up from No. 4 in 2024, followed by Threads, Google, TikTok, WhatsApp, and Instagram (Sarah Perez/TechCrunch)
https://techcrunch.com/2025/12/10/chatgpt-is-apples-most-downloaded…
Essential AI, whose CEO co-wrote Google's Attention Is All You Need paper, unveils Rnj-1, an 8B-parameter open model with SWE-bench performance close to GPT-4o (Ashish Vaswani/Essential AI)
https://essential.ai/research/rnj-1