Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csCV_bot@mastoxiv.page
2025-09-25 10:16:02

ExpFace: Exponential Angular Margin Loss for Deep Face Recognition
Jinhui Zheng, Xueyuan Gong
arxiv.org/abs/2509.19753 arxiv.org/pdf/2509.1…

@arXiv_statML_bot@mastoxiv.page
2025-10-15 10:21:31

Dendrograms of Mixing Measures for Softmax-Gated Gaussian Mixture of Experts: Consistency without Model Sweeps
Do Tien Hai, Trung Nguyen Mai, TrungTin Nguyen, Nhat Ho, Binh T. Nguyen, Christopher Drovandi
arxiv.org/abs/2510.12744

@arXiv_csLG_bot@mastoxiv.page
2025-10-13 10:42:30

Task-Level Insights from Eigenvalues across Sequence Models
Rahel Rickenbach, Jelena Trisovic, Alexandre Didier, Jerome Sieber, Melanie N. Zeilinger
arxiv.org/abs/2510.09379

@arXiv_csCL_bot@mastoxiv.page
2025-10-15 10:27:41

Credal Transformer: A Principled Approach for Quantifying and Mitigating Hallucinations in Large Language Models
Shihao Ji, Zihui Song, Jiajie Huang
arxiv.org/abs/2510.12137

@arXiv_eessSY_bot@mastoxiv.page
2025-10-03 10:03:41

Computing Control Lyapunov-Barrier Functions: Softmax Relaxation and Smooth Patching with Formal Guarantees
Jun Liu, Maxwell Fitzsimmons
arxiv.org/abs/2510.02223

@arXiv_csCL_bot@mastoxiv.page
2025-10-14 13:16:18

Deconstructing Attention: Investigating Design Principles for Effective Language Modeling
Huiyin Xue, Nafise Sadat Moosavi, Nikolaos Aletras
arxiv.org/abs/2510.11602

@arXiv_quantph_bot@mastoxiv.page
2025-10-02 10:06:01

Quantum Probabilistic Label Refining: Enhancing Label Quality for Robust Image Classification
Fang Qi, Lu Peng, Zhengming Ding
arxiv.org/abs/2510.00528

@arXiv_econTH_bot@mastoxiv.page
2025-10-07 13:35:57

Crosslisted article(s) found for econ.TH. arxiv.org/list/econ.TH/new
[1/1]:
- Beyond Softmax: A New Perspective on Gradient Bandits
Emerson Melo, David M\"uller

@arXiv_eessAS_bot@mastoxiv.page
2025-10-08 09:42:39

TokenChain: A Discrete Speech Chain via Semantic Token Modeling
Mingxuan Wang, Satoshi Nakamura
arxiv.org/abs/2510.06201 arxiv.org/pdf/2510…

@arXiv_csCL_bot@mastoxiv.page
2025-09-30 14:12:02

An empirical study on the limitation of Transformers in program trace generation
Simeng Sun
arxiv.org/abs/2509.25073 arxiv.org/pdf/2509.250…