The Confidence Paradox: Can LLM Know When It's Wrong
Sahil Tripathi, Md Tabrez Nafis, Imran Hussain, Jiechao Gao
https://arxiv.org/abs/2506.23464 https…
Computational Analysis of Climate Policy
Carolyn Hicks
https://arxiv.org/abs/2506.22449 https://arxiv.org/pdf/2506.22449
Automatic Phase Calibration for High-resolution mmWave Sensing via Ambient Radio Anchors
Ruixu Geng, Yadong Li, Dongheng Zhang, Pengcheng Huang, Binquan Wang, Binbin Zhang, Zhi Lu, Yang Hu, Yan Chen
https://arxiv.org/abs/2506.23472
MiCo: Multi-image Contrast for Reinforcement Visual Reasoning
Xi Chen, Mingkang Zhu, Shaoteng Liu, Xiaoyang Wu, Xiaogang Xu, Yu Liu, Xiang Bai, Hengshuang Zhao
https://arxiv.org/abs/2506.22434
Negated String Containment is Decidable (Technical Report)
Vojt\v{e}ch Havlena, Michal He\v{c}ko, Luk\'a\v{s} Hol\'ik, Ond\v{r}ej Leng\'al
https://arxiv.org/abs/2506.22061
Machine Assistant with Reliable Knowledge: Enhancing Student Learning via RAG-based Retrieval
Yongsheng Lian
https://arxiv.org/abs/2506.23026 https://
Exponential decay in $O(n)$-invariant quantum spin systems
Jakob E. Bj\"ornberg, Kieran Ryan
https://arxiv.org/abs/2506.22254 https://
MARBLE: A Hard Benchmark for Multimodal Spatial Reasoning and Planning
Yulun Jiang, Yekun Chai, Maria Brbi\'c, Michael Moor
https://arxiv.org/abs/2506.22992
Peer Review as Structured Commentary: Immutable Identity, Public Dialogue, and Reproducible Scholarship
Craig Steven Wright
https://arxiv.org/abs/2506.22497
HLTCOE at LiveRAG: GPT-Researcher using ColBERT retrieval
Kevin Duh, Eugene Yang, Orion Weller, Andrew Yates, Dawn Lawrie
https://arxiv.org/abs/2506.22356 …