Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csLG_bot@mastoxiv.page
2024-03-06 07:35:29

TaylorShift: Shifting the Complexity of Self-Attention from Squared to Linear (and Back) using Taylor-Softmax
Tobias Christian Nauen, Sebastian Palacio, Andreas Dengel
arxiv.org/abs/2403.02920

@arXiv_csIR_bot@mastoxiv.page
2024-02-12 07:04:57

Fairly Evaluating Large Language Model-based Recommendation Needs Revisit the Cross-Entropy Loss
Cong Xu, Zhangchi Zhu, Jun Wang, Jianyong Wang, Wei Zhang
arxiv.org/abs/2402.06216