Tootfinder

Opt-in global Mastodon full text search. Join the index!

@heiseonline@social.heise.de
2025-12-12 04:09:00

GPT-5.2: Neues KI-Modell von OpenAI soll Büroarbeiten besser unterstützen
Nur einen Monat nach GPT-5.1 kommt ein neues KI-Modell der ChatGPT-Entwickler. GPT-5.2 soll bessere Tabellen, Präsentationen und Code produzieren können.

@Techmeme@techhub.social
2025-12-11 18:18:02

OpenAI says GPT-5.2 Thinking hallucinates less than GPT-5.1 and has improved reliability for agentic AI needs; pre-release testers include Notion, Box, Shopify (Hayden Field/The Verge)
theverge.com/ai-artificial-int

@Techmeme@techhub.social
2025-12-12 07:01:18

GPT-5.2 models match GPT-5 and 5.1 with a 400K context window and 128K max output tokens, but have a newer knowledge cutoff of Aug. 31, 2025 vs. Sept. 30, 2024 (Simon Willison/Simon Willison's Newsletter)
simonw.substack.com/p/gpt-52-a

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 11:05:49

Single layer tiny Co$^4$ outpaces GPT-2 and GPT-BERT
Noor Ul Zain, Mohsin Raza, Ahsan Adeel
arxiv.org/abs/2510.08404 arxiv.org/pdf/2510.084…

@peterhoneyman@a2mi.social
2025-10-14 10:04:31

i got up at 6 a.m. to wait in line

EADME someday
WX
WeatherSpark G 2-step
Coordinated Calen...
UM-GPT
Internet Speed Tes...
OPÉRA
NATIONAL
DE PARIS
Welcome to Opéra national de Paris
Sales for the operas Satyagraha and Rusalka, the Empreintes ballet programme, the ballets Romeo
and Juliet and La Dame aux camélias, and the Ballet School Demonstrations and the concert Hector
Berlioz open today.
The website will be available in...
03:02
min.
sec.
Your waiting time is updated periodically. Once elapsed, you will be able to enter the…
@Techmeme@techhub.social
2025-12-11 18:06:51

OpenAI launches GPT-5.2, its "best model yet," in Instant, Thinking, and Pro variants, with significant improvements in writing, coding, and reasoning (Maxwell Zeff/Wired)
wired.com/story/openai-gpt-lau

@heiseonline@social.heise.de
2025-12-12 05:18:00

Freitag: Kritik an eID-Karte wegen Geldwäsche, neues OpenAI-Modell als Bürohilfe
eID-Karte zu einfach zu ergaunern GPT-5.2 für Profi-Nutzer Disney gegen Google-KI wegen Copyright Kritik an EU wegen VMware Roboter-Bewegungen erklärt

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:44:10

Weight Initialization and Variance Dynamics in Deep Neural Networks and Large Language Models
Yankun Han
arxiv.org/abs/2510.09423 arxiv.org…

@arXiv_csCL_bot@mastoxiv.page
2025-09-16 12:25:57

XplaiNLP at CheckThat! 2025: Multilingual Subjectivity Detection with Finetuned Transformers and Prompt-Based Inference with Large Language Models
Ariana Sahitaj, Jiaao Li, Pia Wenzel Neves, Fedor Splitt, Premtim Sahitaj, Charlott Jakob, Veronika Solopova, Vera Schmitt
arxiv.org/abs/2509.12130

@Techmeme@techhub.social
2025-12-11 19:16:04

[Thread] GPT-5.2 is now available in the API, priced at $1.75/1M input and $14/1M output tokens; GPT-5.2 Pro is priced at $21/1M input and $168/1M output tokens (@openaidevs)
x.com/openaidevs/status/199918

@arXiv_csCV_bot@mastoxiv.page
2025-10-13 10:27:00

CapGeo: A Caption-Assisted Approach to Geometric Reasoning
Yuying Li, Siyi Qian, Hao Liang, Leqi Zheng, Ruichuan An, Yongzhen Guo, Wentao Zhang
arxiv.org/abs/2510.09302

@arXiv_csCL_bot@mastoxiv.page
2025-09-16 12:17:37

Growing Perspectives: Modelling Embodied Perspective Taking and Inner Narrative Development Using Large Language Models
Sabrina Patania, Luca Annese, Anna Lambiase, Anita Pellegrini, Tom Foulsham, Azzurra Ruggeri, Silvia Rossi, Silvia Serino, Dimitri Ognibene
arxiv.org/abs/2509.11868

@Techmeme@techhub.social
2025-11-13 20:41:04

Baidu unveils Ernie 5.0, an AI model to process and generate text, images, audio, and video, claiming it beats GPT-5-High and Gemini 2.5 Pro on some benchmarks (Carl Franzen/VentureBeat)
venturebeat.com/ai/baidu-unvei

@ErikJonker@mastodon.social
2025-11-08 15:07:21

Kimi K2 is another Deepseek moment it seems, only not everybody is noticing it yet. It will be interesting to see what the stock market will do on monday.
#AI #KimiK2

Someone tested Kimi K2 on unpublished material and it performed as good as GPT-5 and Gemini 2.5
@Techmeme@techhub.social
2025-11-13 20:35:45

Anthropic open sources a method to score AI model political evenhandedness; Gemini 2.5 Pro got 97%, Grok 4 96%, Claude Opus 4.1 95%, GPT-5 89%, and Llama 4 66% (Ina Fried/Axios)
axios.com/2025/11/13/anthropic

@Techmeme@techhub.social
2025-12-11 18:45:58

OpenAI says GPT‑5.2 Thinking beats or ties industry professionals on 70.9% of GDPval knowledge work tasks, delivering outputs at >11x the speed and <1% the cost (OpenAI)
openai.com/index/introducing-g

@mariyadelano@hachyderm.io
2025-11-13 22:00:11

Curious that whenever someone shows me “the cool #AI flow” they built that’s supposed to be impressive, the conversation goes the same way:
Stage 1: “But you don’t understand. You don’t like AI because you haven’t used it right. Let me show you how much you can do it with.”
Stage 2: “Here are the steps in the flow and the instructions I feed to this agent / custom GPT / Claude project. I tell it to do X, reference document Y, and aim for Z.”
Stage 3: “Now, let me show you the results it gives.”
*Writes task, presses to run the prompt.*
Stage 4: “Umm sorry it’s taking a while. It’s fast but not instant. And by the way, the prompt isn’t perfect, you can definitely make it better. I just threw this together real quick the other day. It makes some mistakes, but it’s really good.”
Stage 5: “Uuuuuuh actually don’t look at the output.” *scrolls or stops screen share or pulls device away.*
“You know it’s already doing so well, if I do more prompt engineering it will get really good but I need to give it better instructions. And it ran just fine last night, I don’t know what’s up with it. And this is a cheap model, if we use another model it will be better.”
Stage 6: “You know, you really shouldn’t judge this so much. The technology will improve, it will get there sooner than you know and then you’ll regret not trying it sooner.”
So curious that this keeps happening 🤷‍♀️
#LLMs #work #tech #AIBubble

@Mediagazer@mstdn.social
2025-09-30 17:10:48

OpenAI releases an invitation-only Sora app on iOS, powered by Sora 2, to let people create and share AI-generated videos of themselves and their friends (Ina Fried/Axios)
axios.com/2025/09/30/openai-so

@wrog@mastodon.murkworks.net
2025-11-04 23:42:58

MERCER ISLAND SCHOOL BOARD
Wow, you really do have to watch the downballot races. Mercer Island School Board has two (2) candidates (O'Callahan is and Gaspar) that are *both* software CTOs touting their "AI" credentials. Gaspar explicitly wants "free AI classes".
Here's a hint: The only "AI classes" that kids need are ones that teach them how to TURN ALL OF THAT SHIT OFF, and learn to think and write in their own words, not Chat-GPT'…

@UP8@mastodon.social
2025-09-29 15:25:58

🧾 Multi-Modal Vision vs. Text-Based Parsing: Benchmarking LLM Strategies for Invoice Processing
#software

@Techmeme@techhub.social
2025-12-12 17:06:30

Companies are updating insider trading policies to cover prediction markets; Kalshi and others are pushing for federal oversight, including of insider trading (Rocket Drew/The Information)
theinformation.com/articles/po

@arXiv_csCL_bot@mastoxiv.page
2025-10-07 12:18:02

Resource-Efficient Fine-Tuning of LLaMA-3.2-3B for Medical Chain-of-Thought Reasoning
Imran Mansha
arxiv.org/abs/2510.05003 arxiv.org/pdf/2…

@arXiv_qbioGN_bot@mastoxiv.page
2025-10-02 07:59:40

A Deep Learning Pipeline for Epilepsy Genomic Analysis Using GPT-2 XL and NVIDIA H100
Muhammad Omer Latif, Hayat Ullah, Muhammad Ali Shafique, Zhihua Dong
arxiv.org/abs/2510.00392

@Techmeme@techhub.social
2025-10-06 19:40:53

OpenAI announces API updates, including GPT-5 Pro, Sora 2 in preview, and gpt-realtime-mini, a voice model that is 70% cheaper than gpt-realtime (Rebecca Bellan/TechCrunch)
techcrunch.com/2025/10/06/open

@arXiv_csAI_bot@mastoxiv.page
2025-09-22 14:05:19

Replaced article(s) found for cs.AI. arxiv.org/list/cs.AI/new
[2/6]:
- Understanding AI Evaluation Patterns: How Different GPT Models Assess Vision-Language Descriptions
Sajjad Abdoli, Rudi Cilibrasi, Rima Al-Shikh

@arXiv_csCL_bot@mastoxiv.page
2025-10-10 10:51:19

Evaluating LLM-Generated Legal Explanations for Regulatory Compliance in Social Media Influencer Marketing
Haoyang Gui, Thales Bertaglia, Taylor Annabell, Catalina Goanta, Tjomme Dooper, Gerasimos Spanakis
arxiv.org/abs/2510.08111

@Techmeme@techhub.social
2025-09-30 17:05:55

OpenAI releases an invitation-only Sora app on iOS, powered by Sora 2, to let people create and share AI-generated videos of themselves and their friends (Ina Fried/Axios)
axios.com/2025/09/30/openai-so

@Techmeme@techhub.social
2025-10-29 12:21:02

OpenAI releases gpt-oss-safeguard, its open-weight reasoning models for safety classification tasks, available in 120B and 20B parameters, under Apache 2.0 (OpenAI)
openai.com/index/introducing-g

@Techmeme@techhub.social
2025-11-30 06:40:47

Alibaba Technical Report: Qwen3-VL beats GPT-5 and Gemini 2.5 Pro on visual tasks and has 100% accuracy on "needle-in-a-haystack" tests for 30-minute videos (Jonathan Kemper/The Decoder)
the-decoder.com/qwen3-vl-can-s

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:10:25

Ultra-Fast Language Generation via Discrete Diffusion Divergence Instruct
Haoyang Zheng, Xinyang Liu, Cindy Xiangrui Kong, Nan Jiang, Zheyuan Hu, Weijian Luo, Wei Deng, Guang Lin
arxiv.org/abs/2509.25035

@Techmeme@techhub.social
2025-11-18 20:55:53

Gemini 3 Pro is priced at $2-$4 per 1M input tokens and $12-$18 per 1M output tokens, cheaper than Claude Sonnet 4.5 but more expensive than GPT-5.1 (Simon Willison/Simon Willison's Weblog)
simonwillison.net/2025/Nov/18/

@arXiv_csCL_bot@mastoxiv.page
2025-09-23 12:43:10

Investigating Bias: A Multilingual Pipeline for Generating, Solving, and Evaluating Math Problems with LLMs
Mariam Mahran, Katharina Simbeck
arxiv.org/abs/2509.17701

@arXiv_csCL_bot@mastoxiv.page
2025-09-19 10:33:11

A Comparative Evaluation of Large Language Models for Persian Sentiment Analysis and Emotion Detection in Social Media Texts
Kian Tohidi, Kia Dashtipour, Simone Rebora, Sevda Pourfaramarz
arxiv.org/abs/2509.14922