Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@unchartedworlds@scicomm.xyz
2025-09-14 09:09:54
Content warning: LLM training frameworks, interesting

Interesting explanation of LLM training frameworks and the incentives for confident guessing.
"The authors examined ten major AI benchmarks, including those used by Google, OpenAI and also the top leaderboards that rank AI models. This revealed that nine benchmarks use binary grading systems that award zero points for AIs expressing uncertainty.
" ... When an AI system says “I don’t know”, it receives the same score as giving completely wrong information. The optimal strategy under such evaluation becomes clear: always guess. ...
"More sophisticated approaches like active learning, where AI systems ask clarifying questions to reduce uncertainty, can improve accuracy but further multiply computational requirements. ...
"Users want systems that provide confident answers to any question. Evaluation benchmarks reward systems that guess rather than express uncertainty. Computational costs favour fast, overconfident responses over slow, uncertain ones."
=
My comment: "Fast, overconfident responses" sounds a bit similar to "bullshit", does it not?
#ChatGPT #LLMs #SoCalledAI

@arXiv_csCV_bot@mastoxiv.page
2025-09-15 10:03:01

Towards Understanding Visual Grounding in Visual Language Models
Georgios Pantazopoulos, Eda B. \"Ozyi\u{g}it
arxiv.org/abs/2509.10345

@arXiv_mathCT_bot@mastoxiv.page
2025-08-14 08:17:22

Toposes with enough points as categories of \'etale spaces
Sam van Gool, J\'er\'emie Marqu\`es, Umberto Tarantino
arxiv.org/abs/2508.09604

@mlawton@mstdn.social
2025-06-15 12:04:49

Complications from surgery landed my dad back in the hospital late Wednesday. Successful exploratory surgery found and hopefully stopped the internal bleeding.
If a few key numbers remain stabile, he might get released today. But in the interim, it’ll have to be a Father’s Day video call to his hospital room.
There are myriad ways in which to view this, but I’m choosing gratefulness. #HappyFathersDay

My dad, an older man with glasses and a mustache, sits comfortably in a patterned armchair, wearing a burgundy polo shirt and blue pants. Sunlight filters through window blinds behind him, highlighting the cozy indoor setting.
@emd@cosocial.ca
2025-07-15 16:53:13

When I die, I wanna come back as a panda: explore.org/livecams/panda-bea

@arXiv_csIR_bot@mastoxiv.page
2025-08-14 07:52:12

TFRank: Think-Free Reasoning Enables Practical Pointwise LLM Ranking
Yongqi Fan, Xiaoyang Chen, Dezhi Ye, Jie Liu, Haijin Liang, Jin Ma, Ben He, Yingfei Sun, Tong Ruan
arxiv.org/abs/2508.09539

@arXiv_hepth_bot@mastoxiv.page
2025-07-14 09:03:12

The thermal backreaction of a scalar field in de Sitter spacetime
Nikos Irges, Antonis Kalogirou, Fotis Koutroulis
arxiv.org/abs/2507.08774

@arXiv_mathFA_bot@mastoxiv.page
2025-08-15 09:13:02

Sampling theorems for inverse problems on Riemannian manifolds
Giovanni S. Alberti, Ernesto De Vito, Bianca Gariboldi, Giacomo Gigante
arxiv.org/abs/2508.10810

@arXiv_mathPR_bot@mastoxiv.page
2025-07-14 09:02:32

Pointwise explicit estimates for derivatives of solutions to linear parabolic PDEs with Neumann boundary conditions
C Ciccarella
arxiv.org/abs/2507.08622

@arXiv_csCV_bot@mastoxiv.page
2025-08-15 10:22:52

Axis-level Symmetry Detection with Group-Equivariant Representation
Wongyun Yu, Ahyun Seo, Minsu Cho
arxiv.org/abs/2508.10740 arxiv.org/pdf…