Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@arXiv_csCL_bot@mastoxiv.page
2025-07-14 09:56:22

A Third Paradigm for LLM Evaluation: Dialogue Game-Based Evaluation using clembench
David Schlangen, Sherzod Hakimov, Jonathan Jordan, Philipp Sadler
arxiv.org/abs/2507.08491

@heiseonline@social.heise.de
2025-09-11 16:38:00

Forschung: Verschmelzen Menschen und KI zu einem "evolutionären Individuum"?
Mit der Entwicklung von KI-Technik könnte die Menschheit einen großen evolutionären Übergang eingeleitet haben. Das meinen zumindest zwei Evolutionsbiologen.

@Dragofix@mastodontti.fi
2025-07-12 22:27:10

Liito-orava on taigametsien tulevaisuuden avainlaji. Tuore geneettinen tutkimus paljastaa yllättäviä piirteitä liito-oravan evoluutiosta sekä vakavia huolia lajin suojelun kannalta. Kaukoidässä saattaa asustaa oma alalaji. helsinki.fi/fi/uutiset/evoluut

@arXiv_csCR_bot@mastoxiv.page
2025-08-14 08:35:52

Succinct Oblivious Tensor Evaluation and Applications: Adaptively-Secure Laconic Function Evaluation and Trapdoor Hashing for All Circuits
Damiano Abram, Giulio Malavolta, Lawrence Roy
arxiv.org/abs/2508.09673

@arXiv_csAI_bot@mastoxiv.page
2025-08-14 07:30:52

An Automated Multi-Modal Evaluation Framework for Mobile Intelligent Assistants
Meiping Wang, Jian Zhong, Rongduo Han, Liming Kang, Zhengkun Shi, Xiao Liang, Xing Lin, Nan Gao, Haining Zhang
arxiv.org/abs/2508.09507

@arXiv_csCL_bot@mastoxiv.page
2025-07-14 09:48:52

Beyond N-Grams: Rethinking Evaluation Metrics and Strategies for Multilingual Abstractive Summarization
Itai Mondshine, Tzuf Paz-Argaman, Reut Tsarfaty
arxiv.org/abs/2507.08342

@cosmos4u@scicomm.xyz
2025-09-13 00:51:12

The Secular Evolution of #PlanetaryNebula IC 418 and Its Implications for Carbon Star Formation: iopscience.iop.org/article/10. -> HKU Astrophysics Research Captures 130 Years of Evolution of a Dying Star: hku.hk/press/news_detail_28550

@arXiv_csCV_bot@mastoxiv.page
2025-08-14 10:15:42

January Food Benchmark (JFB): A Public Benchmark Dataset and Evaluation Suite for Multimodal Food Analysis
Amir Hosseinian, Ashkan Dehghani Zahedani, Umer Mansoor, Noosheen Hashemi, Mark Woodward
arxiv.org/abs/2508.09966

@arXiv_csAI_bot@mastoxiv.page
2025-08-14 07:30:32

The Othello AI Arena: Evaluating Intelligent Systems Through Limited-Time Adaptation to Unseen Boards
Sundong Kim
arxiv.org/abs/2508.09292

@arXiv_csCL_bot@mastoxiv.page
2025-08-13 10:05:22

Reveal-Bangla: A Dataset for Cross-Lingual Multi-Step Reasoning Evaluation
Khondoker Ittehadul Islam, Gabriele Sarti
arxiv.org/abs/2508.08933