Apple's Xcode 26 beta 7 adds support for GPT-5 and Claude Sonnet 4, which developers can use by signing into their paid Claude account (Chance Miller/9to5Mac)
https://9to5mac.com/2025/08/28/new-xcode-beta-now-available-with-gpt-5-and-claude-supp…
KI-Update: GPT-5, Google-Shopping, Bias in der KI, Unitree R1
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.
https://www.heise.de…
GPT-IMAGE-EDIT-1.5M: A Million-Scale, GPT-Generated Image Dataset
Yuhan Wang, Siwei Yang, Bingchen Zhao, Letian Zhang, Qing Liu, Yuyin Zhou, Cihang Xie
https://arxiv.org/abs/2507.21033
GPT-FT: An Efficient Automated Feature Transformation Using GPT for Sequence Reconstruction and Performance Enhancement
Yang Gao, Dongjie Wang, Scott Piersall, Ye Zhang, Liqiang Wang
https://arxiv.org/abs/2508.20824
AI-generated stories favour stability over change: homogeneity and cultural stereotyping in narratives generated by gpt-4o-mini
Jill Walker Rettberg, Hermann Wigers
https://arxiv.org/abs/2507.22445
The GPT-4o Shock Emotional Attachment to AI Models and Its Impact on Regulatory Acceptance: A Cross-Cultural Analysis of the Immediate Transition from GPT-4o to GPT-5
Hiroki Naito
https://arxiv.org/abs/2508.16624
A study focused on OpenAI's GPT-4o mini found that LLMs can be persuaded to comply with objectionable requests using the same tactics that persuade humans (Dina Bass/Bloomberg)
Echt goed nieuws, iedereen die de beschikking heeft over relevante Nederlandse content zou dat beschikbaar moeten stellen aan GPT-NL !
Doe mee met GPT-NL! - https://gpt-nl.nl/samenwerken/doe-mee/
ChatGPT 5 power consumption could be as much as eight times higher than GPT 4
— Research institute estimates medium-sized GPT-5 response can consume up to 40 watt-hours of electricity
From Articles to Code: On-Demand Generation of Core Algorithms from Scientific Publications
Cameron S. Movassaghi, Amanda Momenzadeh, Jesse G. Meyer
https://arxiv.org/abs/2507.22324
Sonnet 4 is infinitely better than any of the GPT 4/o4 models for Typst, in my subjective opinion and recent experience. I don't know if it is trained on more recent Typst docs, or if it is just better at getting the logic of previously unseen code, but it solved my problem on second attempt, gpt whatever version did not with several (more than 10...) attempts.
I can't imagine OpenAI is going to Trumpify GPT. I'm pretty sure they've spent more on instruction tuning data than they have on model training.
Even if they wanted to throw all that out and start again, you'd need to build a coherent image of what the bot should do in any given situation. The MAGA worldview simply doesn't have the required coherence.
Sources: GPT-5 shows improved performance in coding, particularly in practical software engineering tasks, outperforming prior OpenAI models and Claude Sonnet 4 (Stephanie Palazzolo/The Information)
https://www.theinformation.com/articles/openais-gpt-5-shines-coding…
Wenn die Realität keinen Widerstand mehr leistet...
Wenn man immer recht hat, egal, was man behauptet...
Wenn Mitgefühl nur errechnet, nicht empfunden und geschenkt wird...
(noch?) ohne Bezahlschranke:
https://www.tagesanzeiger.ch/chatgpt-staer
GPT-OSS-20B: A Comprehensive Deployment-Centric Analysis of OpenAI's Open-Weight Mixture of Experts Model
Deepak Kumar, Divakar Yadav, Yash Patel
https://arxiv.org/abs/2508.16700
KI-Update kompakt: Gemini Live, AI Mode, GPT-5-Test, KI-Psychosen
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.
https://www.…
Benchmarking GPT-5 for Zero-Shot Multimodal Medical Reasoning in Radiology and Radiation Oncology
Mingzhe Hu, Zach Eidex, Shansong Wang, Mojtaba Safari, Qiang Li, Xiaofeng Yang
https://arxiv.org/abs/2508.13192
#AI #programming assist has been helpful for me. But I'm not losing my job anytime soon. Here's a simple example of why.
I have a script with
`cmd1`
I prompt GPT-4.1 to "now invoke cmd2 and cmd3 at the end". Good:
`cmd1`
`cmd2`
`cmd3`
"Add a 15 second pau…
Source: OpenAI is planning to launch GPT-5 in early August, complete with mini and nano versions that will also be available through its API (Tom Warren/The Verge)
https://www.theverge.com/notepad-microsoft-newsletter/712950/openai-gpt-5-m…
Learning Primitive Embodied World Models: Towards Scalable Robotic Learning
Qiao Sun, Liujia Yang, Wei Tang, Wei Huang, Kaixin Xu, Yongchao Chen, Mingyu Liu, Jiange Yang, Haoyi Zhu, Yating Wang, Tong He, Yilun Chen, Xili Dai, Nanyang Ye, Qinying Gu
https://arxiv.org/abs/2508.20840
The Carbon Cost of Conversation, Sustainability in the Age of Language Models
Sayed Mahbub Hasan Amiri, Prasun Goswami, Md. Mainul Islam, Mohammad Shakhawat Hossen, Sayed Majhab Hasan Amiri, Naznin Akter
https://arxiv.org/abs/2507.20018
Empowering Educators in the Age of AI: An Empirical Study on Creating custom GPTs in Qualitative Research Method education
Qian Huang, Thijs Willems
https://arxiv.org/abs/2507.21074
This essay (ht @… ) offers a lot to chew on: some gems, some flubs, some quibblable provocations, some big insights. This sentence in particular stood out to me (context for it in the screenshot):
“Whether we’re reading or conversing, we want something to be meant, not just said.”
https://slate.com/life/2025/06/ai-chatgpt-generator-grok-gemini-writing.html
GPT-5 zu unfreundlich: OpenAI setzt wieder auf 4o als Standardmodell
Nach einer Woche GPT-5 reagiert OpenAI auf Kritik: Zahlende Nutzer erhalten 4o als Standard zurück. Das Routing von GPT-5 geht jetzt auch im Handbetrieb.
Open Weight language models are released by OpenAI.
Interesting what the experiences will be on local configurations , 16GB (V)RAM is a lot but attainable for a lot of people.
#openai
Been using this for a while & it's excellent on providing accurate, thorough, fast but SAFE output... without needing use hardcore reasoning of o3-mini.
✅ Available today: GPT-5 in Microsoft 365 Copilot | Microsoft 365 Blog
https://www.microsoft.co…
"Aussi peu glorieuse qu'elle soit, la réduction des coûts est actuellement logique du point de vue d'OpenAI. L'entreprise est plus que jamais confrontée Š la concurrence et subit une pression croissante pour trouver un moyen de rentabiliser son modèle d'entreprise. Son évaluation anticipée de quelque 500 milliards de dollars s'accompagne de l'attente implicite qu'elle trouvera bientôt un moyen de gagner de l'argent."
#ChatGPT5, it appears, is full of shit.
"#OpenAI’s products are no longer primarily aimed at consumers but at investors. As long as you avoid a full-scale user revolt (which GPT-5 actually did incur…), you can continue to assuage or even attract more backers on your path of relentless e…
Sense of Self and Time in Borderline Personality. A Comparative Robustness Study with Generative AI
Marcin Moskalewicz, Anna Sterna, Marek Pokropski, Paula Flores
https://arxiv.org/abs/2508.19008
Leveraging Multi-Source Textural UGC for Neighbourhood Housing Quality Assessment: A GPT-Enhanced Framework
Qiyuan Hong, Huimin Zhao, Ying Long
https://arxiv.org/abs/2508.16657 …
That's the output of #GPT5 *HIGH* 😞
I get it now, if many people complain about GPT-5 🙄
and yes: i have never seen anything similar from sonnet.....
#ai #coding
#heiseshow: GPT-5, ICE L, Solar-Förderung
In der #heiseshow: OpenAI veröffentlicht GPT-5, die Bahn jubelt über die ICE L-Zulassung und es gibt Aufregung um die neue Solarförderung.
Performance of GPT-5 in Brain Tumor MRI Reasoning
Mojtaba Safari, Shansong Wang, Mingzhe Hu, Zach Eidex, Qiang Li, Xiaofeng Yang
https://arxiv.org/abs/2508.10865 https://…
Just in time for the version that seems to be struggling the msot...
"Apple Intelligence’s ChatGPT integration will use GPT-5 starting with iOS 26"
https://www.theverge.com/news/756799/apple-intelligence-openai-chatgpt-gpt-5-ios…
A now-deleted GitHub blog post reveals GPT-5, available as gpt-5, gpt-5-mini, gpt-5-nano, and gpt-5-chat with "major improvements" in reasoning, code, and more (Tom Warren/The Verge)
https://www.theverge.com/news/752091/openai-gpt-5-model-announceme…
KI-Update kompakt: Stromnetze, o3 vs. GPT-5, Claude, KI-Buzzwords, FrOSCon
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.
https://www.
GPT-NL, een mooi initiatief dat voortgang maakt, let ook op het doel "Tot slot is het goed om in het achterhoofd te houden dat GPT-NL wordt ontwikkeld voor specifieke taken: samenvatten, versimpelen, en het extraheren van informatie. Het doel van GPT-NL is niet om een generiek kennismodel te ontwikkelen."
Lees deze blog:
https://…
GPT-5 review: GPT-5-Thinking is a substantial upgrade over o3, Auto is only useful for free tier users, picking the right model still matters, and more (Zvi Mowshowitz/Don't Worry About the Vase)
https://thezvi.substack.com/p/gpt-5s-are-alive-synthesis
Can GPT-4o Evaluate Usability Like Human Experts? A Comparative Study on Issue Identification in Heuristic Evaluation
Guilherme Guerino, Luiz Rodrigues, Bruna Capeleti, Rafael Ferreira Mello, Andr\'e Freire, Luciana Zaina
https://arxiv.org/abs/2506.16345
LLM vs. SAST: A Technical Analysis on Detecting Coding Bugs of GPT4-Advanced Data Analysis
Madjid G. Tehrani, Eldar Sultanow, William J. Buchanan, Mahkame Houmani, Christel H. Djaha Fodja
https://arxiv.org/abs/2506.15212
Some developers say GPT-5 excels at technical reasoning and planning coding tasks and is cost-effective, but Claude Opus and Sonnet still produce better code (Lauren Goode/Wired)
https://www.wired.com/story/gpt-5-coding-review-software-engineering/
KI-Update kompakt: unfreundliches GPT-5, Meta, KI-Mutterinstinkte, Krebsvorsorge
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.
https://www.…
Echo-4o: Harnessing the Power of GPT-4o Synthetic Images for Improved Image Generation
Junyan Ye, Dongzhi Jiang, Zihao Wang, Leqi Zhu, Zhenghao Hu, Zilong Huang, Jun He, Zhiyuan Yan, Jinghua Yu, Hongsheng Li, Conghui He, Weijia Li
https://arxiv.org/abs/2508.09987
Ethan Mollick about GPT-5,
#AI #GPT5
OpenAI makes Realtime API generally available with new features, including MCP support, and launches gpt-realtime, its most advanced speech-to-speech model (Sabrina Ortiz/ZDNET)
https://www.zdnet.com/article/openai-gives-its-voice…
Caregiver-in-the-Loop AI: A Simulation-Based Feasibility Study for Dementia Task Verification
Joy Lai, David Black, Kelly Beaton, Bing Ye, Alex Mihailidis
https://arxiv.org/abs/2508.18267
Internal OpenAI code suggests a tiered GPT-5 rollout: free users get basic GPT-5, Plus users get advanced reasoning, and Pro gets research-level performance (Alexey Shabanov/TestingCatalog)
https://www.testingcatalog.com/leaked-details-revea…
Has GPT-5 Achieved Spatial Intelligence? An Empirical Study
Zhongang Cai, Yubo Wang, Qingping Sun, Ruisi Wang, Chenyang Gu, Wanqi Yin, Zhiqian Lin, Zhitao Yang, Chen Wei, Xuanke Shi, Kewang Deng, Xiaoyang Han, Zukai Chen, Jiaqi Li, Xiangyu Fan, Hanming Deng, Lewei Lu, Bo Li, Ziwei Liu, Quan Wang, Dahua Lin, Lei Yang
https://arxiv.org/abs/2…
KI-Update: Chat GPT-5, KI-Übersetzer, KI und Unis, KI-Schuld, Nvidia und China
Das "KI-Update" liefert werktäglich eine Zusammenfassung der wichtigsten KI-Entwicklungen.
https://www.
Crosslisted article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/1]:
- Capabilities of GPT-5 across critical domains: Is it the next breakthrough?
Georgios P. Georgiou
OpenAI restores GPT-4o as default for all paid ChatGPT users, vows "plenty of notice" if 4o is deprecated, raises GPT-5 Thinking rate limits to 3K messages/week (Carl Franzen/VentureBeat)
https://venturebeat.com/a…
GPT-5's system card says gpt-5-thinking has a hallucination rate of 4.5% with browsing enabled, compared to gpt-5-main's 9.6%, GPT-4o's 12.9%, and o3's 12.7% (Cecily Mauran/Mashable)
https://mashable.com/article/openai-gpt-5-hallucinates-less-syst…
a good blog but the most relevant line is "GPT-5 may be a moderate quantitative improvement (and it may be cheaper) but it still fails in all the same qualitative ways as its predecessors" , very true but indeed now using it a few days and i notice those moderate improvements.. And i was already aware of all it's failings.
For me in day-to-day use it is better and that is wat counts for me at least. Oh and always (yes always) check outcomes before you use them.
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[1/3]:
- Comparison of pipeline, sequence-to-sequence, and GPT models for end-to-end relation extraction: ...
Shashank Gupta, Xuguang Ai, Ramakanth Kavuluru
OpenAI releases gpt-oss-120b and gpt-oss-20b, its first open-weight models since GPT-2; the smaller model can run locally on a consumer device with 16GB of RAM (Reece Rogers/Wired)
https://www.wired.com/story/openai-just-released-its-first-open-wei…
Is ChatGPT-5 Ready for Mammogram VQA?
Qiang Li, Shansong Wang, Mingzhe Hu, Mojtaba Safari, Zachary Eidex, Xiaofeng Yang
https://arxiv.org/abs/2508.11628 https://
OpenAI says GPT-5 is its first "unified" AI model and combines the reasoning abilities of its o-series of models with the fast responses of its GPT series (Maxwell Zeff/TechCrunch)
https://techcrunch.com/2025/08/07/openais-gpt-5-is-here/
Performance of GPT-5 Frontier Models in Ophthalmology Question Answering
Fares Antaki, David Mikhail, Daniel Milad, Danny A Mammo, Sumit Sharma, Sunil K Srivastava, Bing Yu Chen, Samir Touma, Mertcan Sevgi, Jonathan El-Khoury, Pearse A Keane, Qingyu Chen, Yih Chung Tham, Renaud Duval
https://arxiv.org/abs/2508.09956
TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability
Mohammad Aflah Khan, Ameya Godbole, Johnny Tian-Zheng Wei, Ryan Wang, James Flemings, Krishna Gummadi, Willie Neiswanger, Robin Jia
https://arxiv.org/abs/2507.19419

TokenSmith: Streamlining Data Editing, Search, and Inspection for Large-Scale Language Model Training and Interpretability
Understanding the relationship between training data and model behavior during pretraining is crucial, but existing workflows make this process cumbersome, fragmented, and often inaccessible to researchers. We present TokenSmith, an open-source library for interactive editing, inspection, and analysis of datasets used in Megatron-style pretraining frameworks such as GPT-NeoX, Megatron, and NVIDIA NeMo. TokenSmith supports a wide range of operations including searching, viewing, ingesting, expor…
GPT-5 hands-on: it exudes competence but doesn't feel like a dramatic leap ahead of other LLMs, and the pricing is aggressively competitive with other providers (Simon Willison/Simon Willison's Weblog)
https://simonwillison.net/2025/Aug/7/gpt-5/
AI geletterdheid betekent ook, snappen dat als GPT-5 het aantal B's in het woord "Blueberry" niet correct kan tellen, dat niet betekent dat het model waardeloos/onbruikbaar is... Modellen als GPT-5 zijn goed in bepaalde dingen en slecht in andere. We moeten leren hoe we ze waar toepassen en waar niet. Daarbij ook de kosten afwegen tegen de baten, is zoiets als ChatGPT wel nodig, afwegingen maken mbt bias, ethiek etc.
("GPT-5 Thinking" doet het overigens wel corr…
Apple says Apple Intelligence will use OpenAI's GPT-5 on iOS 26, iPadOS 26, and macOS Tahoe 26, with the system updates expected to arrive in September (Zac Hall/9to5Mac)
https://9to5mac.com/2025/08/07/apple-intelligence-gpt-5-chatgpt-integration/
OpenAI highlights GPT-5 scores on math, coding, and health benchmarks: 94.6% on AIME 2025 without tools, 74.9% on SWE-bench Verified, 46.2% on HealthBench Hard (Carl Franzen/VentureBeat)
https://venturebeat.com/ai/openai-launches-gpt-5-n…
GPT-5's release was underwhelming, offering incremental improvements and failing to meet expectations, showing that pure scaling simply isn't the path to AGI (Gary Marcus/Marcus on AI)
https://garymarcus.substack.com/p/gpt-5-overdue-overhyped-and-underwhel…
ThinkDial: An Open Recipe for Controlling Reasoning Effort in Large Language Models
Qianyu He, Siyu Yuan, Xuefeng Li, Mingxuan Wang, Jiangjie Chen
https://arxiv.org/abs/2508.18773
GPT-5's router directs queries based on complexity and intent, helping OpenAI allocate compute for low-value informational and high-value commercial requests (SemiAnalysis)
https://semianalysis.com/2025/08/13/gpt-5-ad-monetization-and-the-superapp/
OpenAI says ChatGPT Pro users can select old models for now but plans to deprecate them in 60 days; Sam Altman says Plus users will be able to keep using GPT-4o (Joanna Stern/Joanna Stern's Newsletter)
https://joannastern.beehiiv.com/p/gpt-5-and-wh…
OpenAI releases GPT-5 pro, a version with extended reasoning exclusive to ChatGPT Pro subscribers, saying it scored 88.4% without tools on the GPQA benchmark (Maximilian Schreiner/The Decoder)
https://the-decoder.com/openai-claims-
Moonshot's Kimi K2 uses a 1T-parameter MoE architecture with 32B active parameters and outperforms models like GPT-4.1 and DeepSeek-V3 on key benchmarks (Michael Nuñez/VentureBeat)
https://venturebeat.com/ai/moonshot-ais-kimi-k2-outperfor…
During the GPT-5 livestream, OpenAI showed two charts whose scales were all over the place, with Sam Altman later calling one "a mega chart screwup from us" (Jay Peters/The Verge)
https://www.theverge.com/news/756444/openai-gpt-5-vibe-graphing-chart-crim…
xAI makes Grok 4 free for all users worldwide after making Grok Imagine free for all US users; Grok 4 Heavy remains exclusive to SuperGrok Heavy subscribers (Omair Pall/Mashable India)
https://in.mashable.com/tech/98367/elon…
Sam Altman says OpenAI "totally screwed up some things" on the GPT-5 rollout, confirms plans to fund a brain-computer interface startup to rival Neuralink (Alex Heath/The Verge)
https://www.theverge.com/command-line-news
Sam Altman says OpenAI should prioritize growth and its investments in training and compute "for a long time", even if it delays its path to profitability (Ashley Capoot/CNBC)
https://www.cnbc.com/2025/08/08/chatgpt-gpt-5-openai-altman-loss.html
Q&A with OpenAI VP and Head of ChatGPT Nick Turley on ChatGPT's future, showing ads in chatbots, hallucinations, GPT-5 blowback, 4o, subscriptions, and more (Alex Heath/The Verge)
https://www.theverge.com/decoder-podcast-w
OpenAI introduces "Auto", "Fast", and "Thinking" settings for GPT-5 in ChatGPT's model picker, with "Auto" similar to the GPT-5 model router announced earlier (Maxwell Zeff/TechCrunch)
https://techcrunch.com/2025/08/12/chat
With GPT-5's launch, OpenAI has removed its older models like GPT-4o and o3 from the ChatGPT model selector, sparking a backlash from some users (Michael Kan/PCMag)
https://www.pcmag.com/news/openai-faces-backlash-for-retiring-older-models-wit…
A new Artificial Analysis benchmark, focusing on OpenAI's gpt-oss-120b, shows how open-weight LLMs exhibit inconsistent performance across hosting providers (Simon Willison/Simon Willison's Weblog)
https://simonwillison.net/2025/Aug/15/inconsistent-performance/…
GPT-5 will use "safe completions", a training approach to maximize model helpfulness within safety constraints and an improvement over refusal-based training (OpenAI)
https://openai.com/index/gpt-5-safe-completions
Q&A with David Luan, head of Amazon's AGI research lab, on leaving Adept in a reverse acquihire deal, why he believes progress on AI models has slowed, and more (Alex Heath/The Verge)
https://www.theverge.com/decoder-podcast-w