Tootfinder

No exact results. Similar results found.

@noellabo@fedibird.com
2025-09-29 03:48:56

@… どうぞー

@arXiv_csCL_bot@mastoxiv.page
2025-09-29 11:25:07

From tests to effect sizes: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation benchmarks
Jonne S\"alev\"a, Duygu Ataman, Constantine Lignos
https://arxiv.org/abs/2509.22612

From tests to effect sizes: Quantifying uncertainty and statistical variability in multilingual and multitask NLP evaluation benchmarks
In this paper, we introduce a set of resampling-based methods for quantifying uncertainty and statistical precision of evaluation metrics in multilingual and/or multitask NLP benchmarks. We show how experimental variation in performance scores arises from both model- and data-related sources, and that accounting for both of them is necessary to avoid substantially underestimating the overall variability over hypothetical replications. Using multilingual question answering, machine translation, …

@noellabo@fedibird.com
2025-09-29 05:18:02

:misskey12_67: :akkoma: :fedibird1: などいろいろあるよ [参照]

@noellabo@fedibird.com
2025-09-29 05:20:01

案内、もう一度あげとくね

@noellabo@fedibird.com
2025-11-29 01:37:43

@… 私はのえたん！（？）

@noellabo@fedibird.com
2025-11-29 01:25:58

おはようございます（今日の分）

@noellabo@fedibird.com
2025-11-29 00:46:25

おはようございます（昨日の分）

@noellabo@fedibird.com
2025-12-28 09:10:29

@… こんな感じですねー。

@noellabo@fedibird.com
2025-11-29 00:52:11

バブ曽根ちゃん #心

@noellabo@fedibird.com
2025-09-29 04:17:02

カンマの位置がおかしいな？？
（755,908円です）

Tootfinder

Opt-in global Mastodon full text search. Join the index!