
2025-09-16 11:19:26
How to Evaluate Medical AI
Ilia Kopanichuk, Petr Anokhin, Vladimir Shaposhnikov, Vladimir Makharev, Ekaterina Tsapieva, Iaroslav Bespalov, Dmitry V. Dylov, Ivan Oseledets
https://arxiv.org/abs/2509.11941
How to Evaluate Medical AI
Ilia Kopanichuk, Petr Anokhin, Vladimir Shaposhnikov, Vladimir Makharev, Ekaterina Tsapieva, Iaroslav Bespalov, Dmitry V. Dylov, Ivan Oseledets
https://arxiv.org/abs/2509.11941
MEGAN: Mixture of Experts for Robust Uncertainty Estimation in Endoscopy Videos
Damola Agbelese, Krishna Chaitanya, Pushpak Pati, Chaitanya Parmar, Pooya Mobadersany, Shreyas Fadnavis, Lindsey Surace, Shadi Yarandi, Louis R. Ghanem, Molly Lucas, Tommaso Mansi, Oana Gabriela Cula, Pablo F. Damasceno, Kristopher Standish
https://arxiv.org/ab…
Efficient Bayesian Inference from Noisy Pairwise Comparisons
Till Aczel, Lucas Theis, Wattenhofer Roger
https://arxiv.org/abs/2510.09333 https://arxiv.org/…
Investigation of the Inter-Rater Reliability between Large Language Models and Human Raters in Qualitative Analysis
Nikhil Sanjay Borse, Ravishankar Chatta Subramaniam, N. Sanjay Rebello
https://arxiv.org/abs/2508.14764