Distributed MIMO Measurements for Integrated Communication and Sensing in an Industrial Environment
Christian Nelson, Xuhong Li, Aleksei Fedorov, Benjamin J. B. Deutschmann, Fredrik Tufvesson
https://arxiv.org/abs/2403.02430
Multi-Objective Recommendation via Multivariate Policy Learning
Olivier Jeunen, Jatin Mandav, Ivan Potapov, Nakul Agarwal, Sourabh Vaid, Wenzhe Shi, Aleksei Ustimenko
https://arxiv.org/abs/2405.02141
Technical Report on BaumEvA Evolutionary Optimization Python-Library Testing
Vadim Tynchenko, Aleksei Kudryavtsev, Vladimir Nelyub, Aleksei Borodulin, Andrei Gantimurov
https://arxiv.org/abs/2405.00686
TartuNLP at EvaLatin 2024: Emotion Polarity Detection
Aleksei Dorkin, Kairit Sirts
https://arxiv.org/abs/2405.01159 https://arxiv.org…
Individuals Who Honored Navalny's Memory In St. Petersburg Given Summonses To Enlistment Office
https://www.rferl.org/a/st-petersburg-russia-mourners-navalny-summons/32830346.html
Snap Video: Scaled Spatiotemporal Transformers for Text-to-Video Synthesis
Willi Menapace, Aliaksandr Siarohin, Ivan Skorokhodov, Ekaterina Deyneka, Tsai-Shien Chen, Anil Kag, Yuwei Fang, Aleksei Stoliar, Elisa Ricci, Jian Ren, Sergey Tulyakov
https://arxiv.org/abs/2402.14797
S\~onajaht: Definition Embeddings and Semantic Search for Reverse Dictionary Creation
Aleksei Dorkin, Kairit Sirts
https://arxiv.org/abs/2404.19430 https://arxiv.org/pdf/2404.19430
arXiv:2404.19430v1 Announce Type: new
Abstract: We present an information retrieval based reverse dictionary system using modern pre-trained language models and approximate nearest neighbors search algorithms. The proposed approach is applied to an existing Estonian language lexicon resource, S\~onaveeb (word web), with the purpose of enhancing and enriching it by introducing cross-lingual reverse dictionary functionality powered by semantic search.
The performance of the system is evaluated using both an existing labeled English dataset of words and definitions that is extended to contain also Estonian and Russian translations, and a novel unlabeled evaluation approach that extracts the evaluation data from the lexicon resource itself using synonymy relations.
Evaluation results indicate that the information retrieval based semantic search approach without any model training is feasible, producing median rank of 1 in the monolingual setting and median rank of 2 in the cross-lingual setting using the unlabeled evaluation approach, with models trained for cross-lingual retrieval and including Estonian in their training data showing superior performance in our particular task.