
2025-07-11 09:55:51
Rethinking the Privacy of Text Embeddings: A Reproducibility Study of "Text Embeddings Reveal (Almost) As Much As Text"
Dominykas Seputis, Yongkang Li, Karsten Langerak, Serghei Mihailov
https://arxiv.org/abs/2507.07700
Rethinking the Privacy of Text Embeddings: A Reproducibility Study of "Text Embeddings Reveal (Almost) As Much As Text"
Dominykas Seputis, Yongkang Li, Karsten Langerak, Serghei Mihailov
https://arxiv.org/abs/2507.07700
LGND, which uses vector embeddings to analyze geospatial data and has an enterprise app to query it, raised a $9M seed led by Javelin Venture Partners (Tim De Chant/TechCrunch)
https://techcrunch.com/2025/07/10/lgnd-wants-to-make-chatgpt-for-the-earth/
This https://arxiv.org/abs/2505.17282 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
Equivariant $K$-theory of cellular toroidal embeddings
Alexis Tchoudjem, V. Uma
https://arxiv.org/abs/2506.07867 https://arxiv.org/pd…
Revealing the Hidden Temporal Structure of HubertSoft Embeddings based on the Russian Phonetic Corpus
Anastasia Ananeva, Anton Tomilov, Marina Volkova
https://arxiv.org/abs/2507.06794
Bridging Logic and Learning: Decoding Temporal Logic Embeddings via Transformers
Sara Candussio, Gaia Saveri, Gabriele Sarti, Luca Bortolussi
https://arxiv.org/abs/2507.07808
Extracting Information About Publication Venues Using Citation-Informed Transformers
Brian D. Zimmerman, Joshua Folkins, Olga Vechtomova
https://arxiv.org/abs/2506.08199
Platform for Representation and Integration of multimodal Molecular Embeddings
Erika Yilin Zheng, Yu Yan, Baradwaj Simha Sankar, Ethan Ji, Steven Swee, Irsyad Adam, Ding Wang, Alexander Russell Pelletier, Alex Bui, Wei Wang, Peipei Ping
https://arxiv.org/abs/2507.07367
Towards an Explainable Comparison and Alignment of Feature Embeddings
Mohammad Jalali, Bahar Dibaei Nia, Farzan Farnia
https://arxiv.org/abs/2506.06231 htt…
Piecewise-linear embeddings of the space of 3D lattices into $\RR^{13}$ for high-throughput handling of lattice parameters
Ryoko Oishi-Tomiyasu
https://arxiv.org/abs/2506.08934
Systolic inequalities on the sphere from symplectic embeddings
Brayan Ferreira
https://arxiv.org/abs/2506.07674 https://arxiv.org/pdf…
Heterogeneous Sequel-Aware Graph Neural Networks for Sequential Learning
Anushka Tiwari, Haimonti Dutta, Shahrzad Khanizadeh
https://arxiv.org/abs/2506.05625
Doodle Your Keypoints: Sketch-Based Few-Shot Keypoint Detection
Subhajit Maity, Ayan Kumar Bhunia, Subhadeep Koley, Pinaki Nath Chowdhury, Aneeshan Sain, Yi-Zhe Song
https://arxiv.org/abs/2507.07994
Embeddings of Sobolev spaces of differential forms
Vladimir Gol'dshtein, Yaroslav Kopylov, Roman Panenko
https://arxiv.org/abs/2507.05851 https://
Training-Free Query Optimization via LLM-Based Plan Similarity
Nikita Vasilenko, Alexander Demin, Vladimir Boorlakov
https://arxiv.org/abs/2506.05853 https…
Assessing the Alignment of Audio Representations with Timbre Similarity Ratings
Haokun Tian, Stefan Lattner, Charalampos Saitis
https://arxiv.org/abs/2507.07764
Efficient and Adaptive Estimation of Local Triadic Coefficients
Ilie Sarpe, Aristides Gionis
https://arxiv.org/abs/2507.07536 https://
Neighborhood Overlap-Aware High-Order Graph Neural Network for Dynamic Graph Learning
Ling Wang
https://arxiv.org/abs/2506.06728 https://
Perfect t-embeddings of doubly periodic Aztec diamonds
Tomas Berggren, Matthew Nicoletti, Marianna Russkikh
https://arxiv.org/abs/2508.04938 https://arxiv.…
FuDoBa: Fusing Document and Knowledge Graph-based Representations with Bayesian Optimisation
Boshko Koloski, Senja Pollak, Roberto Navigli, Bla\v{z} \v{S}krlj
https://arxiv.org/abs/2507.06622
Classification of Equivariant Legendrian Embeddings of Rational Homogeneous Spaces into Nilpotent Orbits
Minseong Kwon
https://arxiv.org/abs/2507.03932 htt…
CaliciBoost: Performance-Driven Evaluation of Molecular Representations for Caco-2 Permeability Prediction
Huong Van Le, Weibin Ren, Junhong Kim, Yukyung Yun, Young Bin Park, Young Jun Kim, Bok Kyung Han, Inho Choi, Jong IL Park, Hwi-Yeol Yun, Jae-Mun Choi
https://arxiv.org/abs/2506.08059
Universal Embeddings of Tabular Data
Astrid Franz, Frederik Hoppe, Marianne Michaelis, Udo G\"obel
https://arxiv.org/abs/2507.05904 https://
More Exotic $\mathbb{RP}^2$-knots and Homotopy Spheres
Judson Kuhrman
https://arxiv.org/abs/2507.03798 https://arxiv.org/pdf/2507.037…
This https://arxiv.org/abs/2409.04459 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Dimension-Preserving Saturated Embeddings of Finite Posets into the Spectra of Noetherian UFDs
David Baron, S. Loepp
https://arxiv.org/abs/2507.03574 https…
A free two-generated left distributive algebra of elementary embeddings
Andrew D. Brooke-Taylor, Scott Cramer, Sheila K. Miller Edwards
https://arxiv.org/abs/2508.02244 https://…
jina-embeddings-v4: Universal Embeddings for Multimodal Multilingual Retrieval
Michael G\"unther, Saba Sturua, Mohammad Kalim Akram, Isabelle Mohr, Andrei Ungureanu, Sedigheh Eslami, Scott Martens, Bo Wang, Nan Wang, Han Xiao
https://arxiv.org/abs/2506.18902
This https://arxiv.org/abs/2502.19311 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLO_…
Do We Really Need Specialization? Evaluating Generalist Text Embeddings for Zero-Shot Recommendation and Search
Matteo Attimonelli, Alessandro De Bellis, Claudio Pomo, Dietmar Jannach, Eugenio Di Sciascio, Tommaso Di Noia
https://arxiv.org/abs/2507.05006
Reproducibility in embedding benchmarks is challenging, especially with embedding models that are instructional and increasingly large. At #bbuzz, Isaac Chung explained how MTEB addresses prompt variability, scaling issues and large datasets to ensure fair and consistent scores, setting a standard for benchmarking embeddings.
Watch the full session:
This https://arxiv.org/abs/2505.12156 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_mat…
This https://arxiv.org/abs/2506.04678 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
Emergent Semantics Beyond Token Embeddings: Transformer LMs with Frozen Visual Unicode Representations
A. Bochkov
https://arxiv.org/abs/2507.04886 https://…
CSI2Vec: Towards a Universal CSI Feature Representation for Positioning and Channel Charting
Victoria Palhares, Sueda Taner, Christoph Studer
https://arxiv.org/abs/2506.05237
Charge quantisation, monopoles and emergent symmetry in the Standard Model and its embeddings
Rodrigo Alonso, Despoina Dimakou, Yunji Ha, Valentin V. Khoze
https://arxiv.org/abs/2507.01777
Replaced article(s) found for math.SG. https://arxiv.org/list/math.SG/new
[1/1]:
- Rigid-flexible values for symplectic embeddings of four-dimensional ellipsoids into almost-cubes
Andrew Lee, Cory H. Colbert
Federated Learning for ICD Classification with Lightweight Models and Pretrained Embeddings
Binbin Xu, G\'erard Dray
https://arxiv.org/abs/2507.03122 h…
Embeddings of Weighted Morrey Spaces
Marcus Gerhold
https://arxiv.org/abs/2508.03387 https://arxiv.org/pdf/2508.03387…
Factorizable embeddings and the period of an irreducible sofic shift
Brian Marcus, Tom Meyerovitch, Klaus Thomsen, Chengyu Wu
https://arxiv.org/abs/2508.02554 https://
Text adaptation for speaker verification with speaker-text factorized embeddings
Yexin Yang, Shuai Wang, Xun Gong, Yanmin Qian, Kai Yu
https://arxiv.org/abs/2508.04425 https://
Robust Target Speaker Diarization and Separation via Augmented Speaker Embedding Sampling
Md Asif Jalal, Luca Remaggi, Vasileios Moschopoulos, Thanasis Kotsiopoulos, Vandana Rajan, Karthikeyan Saravanan, Anastasis Drosou, Junho Heo, Hyuk Oh, Seokyeong Jeong
https://arxiv.org/abs/2508.06393
Measuring Information Richness in Product Images: Implications for Online Sales
Zhu Yuting, Cao Xinyu, Su Yuzhuo, Ma Yongbin
https://arxiv.org/abs/2508.04541 https://
Associative triple trisystems and standard embeddings
Ra\'ul Felipe, Guillermo Vera de Salas
https://arxiv.org/abs/2506.04191 https://
Learning to cluster neuronal function
Nina S. Nellen, Polina Turishcheva, Michaela Vystr\v{c}ilov\'a, Shashwat Sridhar, Tim Gollisch, Andreas S. Tolias, Alexander S. Ecker
https://arxiv.org/abs/2506.03293
A genuine equivariant recognition principle for finite groups
Branko Juran
https://arxiv.org/abs/2508.04421 https://arxiv.org/pdf/2508.04421
Infinity Search: Approximate Vector Search with Projections on q-Metric Spaces
Antonio Pariente, Ignacio Hounie, Santiago Segarra, Alejandro Ribeiro
https://arxiv.org/abs/2506.06557
Evaluating Style-Personalized Text Generation: Challenges and Directions
Anubhav Jangra, Bahareh Sarrafzadeh, Adrian de Wynter, Silviu Cucerzan, Sujay Kumar Jauhar
https://arxiv.org/abs/2508.06374
Beyond Distance: Mobility Neural Embeddings Reveal Visible and Invisible Barriers in Urban Space
Guangyuan Weng, Minsuk Kim, Yong-Yeol Ahn, Esteban Moro
https://arxiv.org/abs/2506.24061
Conformal Rigidity and Spectral Embeddings of Graphs
Jo\~ao Gouveia, Stefan Steinerberger, Rekha R. Thomas
https://arxiv.org/abs/2506.20541 https://…
Uncovering smooth structures in single-cell data with PCS-guided neighbor embeddings
Rong Ma, Xi Li, Jingyuan Hu, Bin Yu
https://arxiv.org/abs/2506.22228 h…
Embedded contact homology of the unit cotangent bundle of the Klein bottle
Marcelo Miranda, Vinicius G. B. Ramos
https://arxiv.org/abs/2508.06400 https://a…
A Study of the Framework and Real-World Applications of Language Embedding for 3D Scene Understanding
Mahmoud Chick Zaouali, Todd Charter, Yehor Karpichev, Brandon Haworth, Homayoun Najjjaran
https://arxiv.org/abs/2508.05064
News Sentiment Embeddings for Stock Price Forecasting
Ayaan Qayyum
https://arxiv.org/abs/2507.01970 https://arxiv.org/pdf/2507.01970
Quasi-isometric embeddings of Ramanujan complexes
Hyein Choi
https://arxiv.org/abs/2506.23585 https://arxiv.org/pdf/2506.23585…
Compressing Large Language Models with PCA Without Performance Loss
Magnus Bengtsson
https://arxiv.org/abs/2508.04307 https://arxiv.org/pdf/2508.04307
KAConvText: Novel Approach to Burmese Sentence Classification using Kolmogorov-Arnold Convolution
Ye Kyaw Thu, Thura Aung, Thazin Myint Oo, Thepchai Supnithi
https://arxiv.org/abs/2507.06753
A remark on equivariant Riemannian isometric embeddings preserving symmetries
Dmitri Burago, Hongda Qiu
https://arxiv.org/abs/2507.23164 https://arxiv.org/…
Toroidal embedding of Chevalley groups over $\mathbb{Z}$
Shang Li
https://arxiv.org/abs/2506.02638 https://arxiv.org/pdf/2506.02638…
Robust Learning on Noisy Graphs via Latent Space Constraints with External Knowledge
Chunhui Gu, Mohammad Sadegh Nasr, James P. Long, Kim-Anh Do, Ehsan Irajizad
https://arxiv.org/abs/2507.05540
Towards Storage-Efficient Visual Document Retrieval: An Empirical Study on Reducing Patch-Level Embeddings
Yubo Ma, Jinsong Li, Yuhang Zang, Xiaobao Wu, Xiaoyi Dong, Pan Zhang, Yuhang Cao, Haodong Duan, Jiaqi Wang, Yixin Cao, Aixin Sun
https://arxiv.org/abs/2506.04997
This https://arxiv.org/abs/2502.19311 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLO_…
Synopsis: Secure and private trend inference from encrypted semantic embeddings
Madelyne Xiao, Palak Jain, Micha Gorelick, Sarah Scheffler
https://arxiv.org/abs/2505.23880
Advancement of Circular Economy Through Interdisciplinary Collaboration: A Bibliometric Approach
Keita Nishimoto, Koji Kimita, Shinsuke Murakami, Yin Long, Kimitaka Asatani, Ichiro Sakata
https://arxiv.org/abs/2507.04923
This https://arxiv.org/abs/2502.04049 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_ees…
This https://arxiv.org/abs/2505.14806 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_qbi…
Cropping outperforms dropout as an augmentation strategy for training self-supervised text embeddings
Rita Gonz\'alez-M\'arquez, Philipp Berens, Dmitry Kobak
https://arxiv.org/abs/2508.03453
Replaced article(s) found for cs.CV. https://arxiv.org/list/cs.CV/new
[8/10]:
- Preserving Topological and Geometric Embeddings for Point Cloud Recovery
Kaiyue Zhou, Zelong Tan, Hongxiao Wang, Ya-li Li, Shengjin Wang
WAKE: Watermarking Audio with Key Enrichment
Yaoxun Xu, Jianwei Yu, Hangting Chen, Zhiyong Wu, Xixin Wu, Dong Yu, Rongzhi Gu, Yi Luo
https://arxiv.org/abs/2506.05891
Multiplier Between Generalized Toeplitz Kernels
Anjali, R. K. Srivastava
https://arxiv.org/abs/2507.03452 https://arxiv.org/pdf/2507.…
Twisted embeddings of tori have small extrinsic systole
Sahana Vasudevan
https://arxiv.org/abs/2507.23766 https://arxiv.org/pdf/2507.23766
EAGLE: Efficient Alignment of Generalized Latent Embeddings for Multimodal Survival Prediction with Interpretable Attribution Analysis
Aakash Tripathi, Asim Waqas, Matthew B. Schabath, Yasin Yilmaz, Ghulam Rasool
https://arxiv.org/abs/2506.22446
Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval
Seongwan Park, Taeklim Kim, Youngjoong Ko
https://arxiv.org/abs/2506.00041
Spectral coverings without embeddings
Eric Boulter, Steven Rayan
https://arxiv.org/abs/2507.02127 https://arxiv.org/pdf/2507.02127
Evaluating the Effectiveness of Pre-Trained Audio Embeddings for Classification of Parkinson's Disease Speech Data
Emmy Postma, Cristian Tejedor-Garcia
https://arxiv.org/abs/2506.02078
Hierarchical Interaction Summarization and Contrastive Prompting for Explainable Recommendations
Yibin Liu, Ang Li, Shijian Li
https://arxiv.org/abs/2507.06044
This https://arxiv.org/abs/2410.16428 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csSD_…
Semantic Certainty Assessment in Vector Retrieval Systems: A Novel Framework for Embedding Quality Evaluation
Y. Du
https://arxiv.org/abs/2507.05933 https:…
On The Role of Pretrained Language Models in General-Purpose Text Embeddings: A Survey
Meishan Zhang, Xin Zhang, Xinping Zhao, Shouzheng Huang, Baotian Hu, Min Zhang
https://arxiv.org/abs/2507.20783
Universal embeddings of flag manifolds and rigidity phenomena
Andrea Loi, Roberto Mossa, Fabio Zuddas
https://arxiv.org/abs/2507.23606 https://arxiv.org/pd…
Do Recommender Systems Really Leverage Multimodal Content? A Comprehensive Analysis on Multimodal Representations for Recommendation
Claudio Pomo, Matteo Attimonelli, Danilo Danese, Fedelucio Narducci, Tommaso Di Noia
https://arxiv.org/abs/2508.04571
Verified Language Processing with Hybrid Explainability: A Technical Report
Oliver Robert Fox, Giacomo Bergami, Graham Morgan
https://arxiv.org/abs/2507.05017
LAPS-Diff: A Diffusion-Based Framework for Singing Voice Synthesis With Language Aware Prosody-Style Guided Learning
Sandipan Dhar, Mayank Gupta, Preeti Rao
https://arxiv.org/abs/2507.04966
A Dynamic Framework for Semantic Grouping of Common Data Elements (CDE) Using Embeddings and Clustering
Madan Krishnamurthy, Daniel Korn, Melissa A Haendel, Christopher J Mungall, Anne E Thessen
https://arxiv.org/abs/2506.02160
Soft Injection of Task Embeddings Outperforms Prompt-Based In-Context Learning
Jungwon Park, Wonjong Rhee
https://arxiv.org/abs/2507.20906 https://arxiv.or…
This https://arxiv.org/abs/2504.06212 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csLG_…
Word stress in self-supervised speech models: A cross-linguistic comparison
Martijn Bentum, Louis ten Bosch, Tomas O. Lentz
https://arxiv.org/abs/2507.04738
Analyzing and Improving Speaker Similarity Assessment for Speech Synthesis
Marc-Andr\'e Carbonneau, Benjamin van Niekerk, Hugo Seut\'e, Jean-Philippe Letendre, Herman Kamper, Julian Za\"idi
https://arxiv.org/abs/2507.02176
Rethinking Hybrid Retrieval: When Small Embeddings and LLM Re-ranking Beat Bigger Models
Arjun Rao, Hanieh Alipour, Nick Pendar
https://arxiv.org/abs/2506.00049
Resource-Efficient Adaptation of Large Language Models for Text Embeddings via Prompt Engineering and Contrastive Fine-tuning
Benedikt Roth, Stephan Rappensperger, Tianming Qiu, Hamza Imamovi\'c, Julian W\"ormann, Hao Shen
https://arxiv.org/abs/2507.22729
Improving Audio Classification by Transitioning from Zero- to Few-Shot
James Taylor, Wolfgang Mack
https://arxiv.org/abs/2507.20036 https://arxiv.org/pdf/2…
Interact2Vec -- An efficient neural network-based model for simultaneously learning users and items embeddings in recommender systems
Pedro R. Pires, Tiago A. Almeida
https://arxiv.org/abs/2506.22648
Evolutionary Feature-wise Thresholding for Binary Representation of NLP Embeddings
Soumen Sinha, Shahryar Rahnamayan, Azam Asilian Bidgoli
https://arxiv.org/abs/2507.17025
Semantic IDs for Music Recommendation
M. Jeffrey Mei, Florian Henkel, Samuel E. Sandberg, Oliver Bembom, Andreas F. Ehmann
https://arxiv.org/abs/2507.18800 https://
Intrinsic vs. Extrinsic Evaluation of Czech Sentence Embeddings: Semantic Relevance Doesn't Help with MT Evaluation
Petra Baran\v{c}\'ikov\'a, Ond\v{r}ej Bojar
https://arxiv.org/abs/2506.20203
Context is Gold to find the Gold Passage: Evaluating and Training Contextual Document Embeddings
Max Conti, Manuel Faysse, Gautier Viaud, Antoine Bosselut, C\'eline Hudelot, Pierre Colombo
https://arxiv.org/abs/2505.24782
ViLLA-MMBench: A Unified Benchmark Suite for LLM-Augmented Multimodal Movie Recommendation
Fatemeh Nazary, Ali Tourani, Yashar Deldjoo, Tommaso Di Noia
https://arxiv.org/abs/2508.04206
SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts
Marc Brinner, Sina Zarriess
https://arxiv.org/abs/2507.13105
Unsupervised Document and Template Clustering using Multimodal Embeddings
Phillipe R. Sampaio, Helene Maxcici
https://arxiv.org/abs/2506.12116 https://