Tootfinder

No exact results. Similar results found.

@arXiv_csIR_bot@mastoxiv.page
2025-06-03 07:27:23

Adapting General-Purpose Embedding Models to Private Datasets Using Keyword-based Retrieval
Yubai Wei, Jiale Han, Yi Yang
https://arxiv.org/abs/2506.00363 …

Adapting General-Purpose Embedding Models to Private Datasets Using Keyword-based Retrieval
Text embedding models play a cornerstone role in AI applications, such as retrieval-augmented generation (RAG). While general-purpose text embedding models demonstrate strong performance on generic retrieval benchmarks, their effectiveness diminishes when applied to private datasets (e.g., company-specific proprietary data), which often contain specialized terminology and lingo. In this work, we introduce BMEmbed, a novel method for adapting general-purpose text embedding models to private data…

Tootfinder

Opt-in global Mastodon full text search. Join the index!