2026-03-03 18:41:03
OpenAI says GPT-5.3 Instant's tone should feel less "cringe" than GPT-5.2 Instant and has a smoother, more to-the-point conversational style (Marcus Schuler/Implicator.ai)
https://www.implicator.ai/openai-ships-gpt-5-…
OpenAI says GPT-5.3 Instant's tone should feel less "cringe" than GPT-5.2 Instant and has a smoother, more to-the-point conversational style (Marcus Schuler/Implicator.ai)
https://www.implicator.ai/openai-ships-gpt-5-…
OpenAI releases GPT-5.3 Instant, which it says delivers more accurate answers and better-contextualized results when searching the web, for all ChatGPT users (OpenAI)
https://openai.com/index/gpt-5-3-instant
GPT-5.3-Codex: OpenAI stellt neues Coding-Modell vor
OpenAI hat mit GPT-5.3-Codex ein neues Coding-Modell veröffentlicht, das laut Entwickler-Team maßgeblich an seiner eigenen Entwicklung beteiligt war.
https://www.
De GPT-NL dataset staat beschreven op Huggingface, https://huggingface.co/datasets/GPT-NL/GPT-NL_Public_Corpus
met de genoemde hoeveelheid data kan het model qua parameters alleen in de GPT-3 klasse zijn lijkt me? Of zit dat anders?
OpenAI launches a research preview of GPT-5.3-Codex-Spark, a smaller version of GPT-5.3-Codex that it claims generates code 15 times faster, for Pro users (David Gewirtz/ZDNET)
https://www.zdnet.com/article/openais-gpt-5-3-codex-spark-15x-faster/
Advanced AI models appear willing to deploy nuclear weapons without the same reservations humans have when put into simulated geopolitical crises.
Kenneth Payne at King’s College London set three leading large language models – GPT-5.2, Claude Sonnet 4 and Gemini 3 Flash – against each other in simulated war games. The scenarios involved intense international standoffs, including border disputes, competition for scarce resources and existential threats to regime survival
https://www.newscientist.com/article/2516885-ais-cant-stop-recommending-nuclear-strikes-in-war-game-simulations/
Large Language Models (LLMs) are poised to disrupt knowledge work,
with the emergence of delegated work as a new interaction paradigm
(e.g., vibe coding).
Delegation requires trust
- the expectation that the LLM will faithfully execute the task without introducing errors into documents.
Our large-scale experiment with 19 LLMs reveals that current models degrade documents during delegation:
even frontier models (Gemini 3.1 Pro, Claude 4.6 Opus, GPT 5.4) c…
Codex-Spark: Schnelles Coding-Modell von OpenAI
OpenAI bringt mit GPT-5.3-Codex-Spark ein schnelles, aber ungenaues Coding-Modell raus. Es läuft auf einem eigenen Cerebras-Chip.
https://www.heis…
Was passiert, wenn man KI-Modelle wie GPT-5.2, Claude Sonnet 4 oder Gemini 3 Flash als Krisenberater einsetzt? Forscher des King's College London haben genau das in Konfliktsimulationen getestet – mit erschreckenden Ergebnissen. 😰
Zum Artikel: https://heis…
OpenAI launches GPT-5.3-Codex, which it says runs 25% faster, enabling longer-running tasks, and "is our first model that was instrumental in creating itself" (David Gewirtz/ZDNET)
https://www.zdnet.com/article/openai-gpt-5-3-codex-faster-goes-beyond-c…
"Co-authored-by: Cursor (gpt-5.3-codex-xhigh)"
Please fuck off immediately.
A study finds GPT-5.2, Claude Sonnet 4, and Gemini 3 Flash deployed tactical nuclear weapons in 95% of 21 simulated war game scenarios, and never surrendered (Chris Stokel-Walker/New Scientist)
https://www.newscientist.com/article/25168
OpenAI says GPT-5.3-Codex goes beyond an agent that can code "to an agent that can do nearly anything developers and professionals can do on a computer" (OpenAI)
https://openai.com/index/introducing-gpt-5-3-codex/
Internal doc: the State Department moved its internal chatbot from Claude Sonnet 4.5 to GPT-4.1, following Trump's directive to cancel Anthropic contracts (Nextgov/FCW)
https://www.nextgov.com/acquisition/2026/03/state-offlo…
GPT-5.3-Codex-Spark is OpenAI's first AI model to run on chips from Nvidia rival Cerebras; OpenAI says Codex has more than 1M weekly active users (Rachel Metz/Bloomberg)
https://www.bloomberg.com/news/articles/2026-02-12…
GPT-5.3-Codex and Claude Opus 4.6 can handle the full app development lifecycle on their own, a sign of what's coming for most knowledge work within five years (Matt Shumer)
https://shumer.dev/something-big-is-happening