Tootfinder

Opt-in global Mastodon full text search. Join the index!

@cosmos4u@scicomm.xyz
2025-11-17 07:46:18

Is #AI really just dumb statistics? "Olympiad-level physics problem-solving presents a significant challenge for both humans and artificial intelligence (AI), as it requires a sophisticated integration of precise calculation, abstract reasoning, and a fundamental grasp of physical principles," says the (abstract of the) paper arxiv.org/abs/2511.10515: "The Chinese Physics Olympiad (CPhO), renowned for its complexity and depth, serves as an ideal and rigorous testbed for these advanced capabilities. In this paper, we introduce LOCA-R (LOgical Chain Augmentation for Reasoning), an improved version of the LOCA framework adapted for complex reasoning, and apply it to the CPhO 2025 theory examination. LOCA-R achieves a near-perfect score of 313 out of 320 points, solidly surpassing the highest-scoring human competitor and significantly outperforming all baseline methods." Oops ...?

@tante@tldr.nettime.org
2025-10-09 08:57:05

This is just such a waste of money.
ec.social-network.europa.eu/@E

@Techmeme@techhub.social
2025-12-09 00:55:45

David Sacks says AI preemption won't apply to state laws on child safety, communities won't be forced to host data centers they don't want, and more (David Sacks/@davidsacks)
x.com/davidsacks/status/199812

@arXiv_csAI_bot@mastoxiv.page
2025-10-15 10:09:31

PricingLogic: Evaluating LLMs Reasoning on Complex Tourism Pricing Tasks
Yunuo Liu, Dawei Zhu, Zena Al-Khalili, Dai Cheng, Yanjun Chen, Dietrich Klakow, Wei Zhang, Xiaoyu Shen
arxiv.org/abs/2510.12409

@davidshq@hachyderm.io
2025-09-27 15:14:32

Found this article quite interesting: #AI #medicine #health

@mxp@mastodon.acm.org‬
2025-11-08 22:56:01

Interesting talk by Elle O’Brien on “the future of scientific code in the era of generative AI.”
#DigitalHumanities

@arXiv_physicssocph_bot@mastoxiv.page
2025-10-06 09:02:59

Homophily-induced Emergence of Biased Structures in LLM-based Multi-Agent AI Systems
Aliakbar Mehdizadeh, Martin Hilbert
arxiv.org/abs/2510.02637

@arXiv_csLG_bot@mastoxiv.page
2025-10-02 11:08:21

Eliciting Secret Knowledge from Language Models
Bartosz Cywi\'nski, Emil Ryd, Rowan Wang, Senthooran Rajamanoharan, Neel Nanda, Arthur Conmy, Samuel Marks
arxiv.org/abs/2510.01070

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:16:51

InfoAgent: Advancing Autonomous Information-Seeking Agents
Gongrui Zhang, Jialiang Zhu, Ruiqi Yang, Kai Qiu, Miaosen Zhang, Zhirong Wu, Qi Dai, Bei Liu, Chong Luo, Zhengyuan Yang, Linjie Li, Lijuan Wang, Weizhu Chen, Yuan Zhang, Xin Li, Zhaoyi Liu, Xin Geng, Baining Guo
arxiv.org/abs/2509.25189

@arXiv_eessIV_bot@mastoxiv.page
2025-09-30 09:36:01

A University of Texas Medical Branch Case Study on Aortic Calcification Detection
Eric Walser, Peter McCaffrey, Kal Clark, Nicholas Czarnek
arxiv.org/abs/2509.23930

@arXiv_csRO_bot@mastoxiv.page
2025-09-23 11:59:00

GPS Denied IBVS-Based Navigation and Collision Avoidance of UAV Using a Low-Cost RGB Camera
Xiaoyu Wang, Yan Rui Tan, William Leong, Sunan Huang, Rodney Teo, Cheng Xiang
arxiv.org/abs/2509.17435