Tootfinder

Opt-in global Mastodon full text search. Join the index!

No exact results. Similar results found.
@Techmeme@techhub.social
2025-09-01 19:55:37

Tencent open sources translation models Hunyuan-MT-7B and Hunyuan-MT-Chimera-7B, which support 33 languages, claiming they beat established models in benchmarks (Jonathan Kemper/The Decoder)
the-decoder.com/tencent-open-s

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 09:45:42

Med-RewardBench: Benchmarking Reward Models and Judges for Medical Multimodal Large Language Models
Meidan Ding, Jipeng Zhang, Wenxuan Wang, Cheng-Yi Li, Wei-Chieh Fang, Hsin-Yu Wu, Haiqin Zhong, Wenting Chen, Linlin Shen
arxiv.org/abs/2508.21430

@arXiv_csLO_bot@mastoxiv.page
2025-09-30 07:57:34

The Complexity of Defining and Separating Fixpoint Formulae in Modal Logic
Jean Christoph Jung, J\k{e}drzej Ko{\l}odziejski
arxiv.org/abs/2509.24583

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:46:27

Image-Difficulty-Aware Evaluation of Super-Resolution Models
Atakan Topaloglu, Ahmet Bilican, Cansu Korkmaz, A. Murat Tekalp
arxiv.org/abs/2509.26398

@arXiv_csLG_bot@mastoxiv.page
2025-09-30 14:38:41

Double Descent as a Lens for Sample Efficiency in Autoregressive vs. Discrete Diffusion Models
Ahmad Fraij, Sam Dauncey
arxiv.org/abs/2509.24974

@Mediagazer@mstdn.social
2025-10-31 23:21:11

An analysis of AI training datasets, compiled by The Atlantic, shows AI models were trained on hundreds of thousands of YouTube videos from news publishers (Andrew Deck/Nieman Lab)
niemanlab.org/2025/10/hundred…

@arXiv_csAI_bot@mastoxiv.page
2025-09-01 08:37:52

AHELM: A Holistic Evaluation of Audio-Language Models
Tony Lee, Haoqin Tu, Chi Heem Wong, Zijun Wang, Siwei Yang, Yifan Mai, Yuyin Zhou, Cihang Xie, Percy Liang
arxiv.org/abs/2508.21376

@Techmeme@techhub.social
2025-11-01 06:35:57

Despite the hype around large AI models, many companies like Meta are using small models for routine tasks, finding them more practical and cost-effective (Christopher Mims/Wall Street Journal)
wsj.com…

@arXiv_csCL_bot@mastoxiv.page
2025-09-01 09:41:52

Challenges and Applications of Large Language Models: A Comparison of GPT and DeepSeek family of models
Shubham Sharma, Sneha Tuli, Narendra Badam
arxiv.org/abs/2508.21377

@Techmeme@techhub.social
2025-09-29 04:55:37

DeepMind: video models like Veo 3 could become general purpose foundation models for vision, like LLMs for text, using zero-shot "chain-of-frames" reasoning (Simon Willison/Simon Willison's Weblog)
simonwillison.net/2025/Sep/27/