
2025-06-03 07:39:40
GLoSS: Generative Language Models with Semantic Search for Sequential Recommendation
Krishna Acharya, Aleksandr V. Petrov, Juba Ziani
https://arxiv.org/abs/2506.01910
GLoSS: Generative Language Models with Semantic Search for Sequential Recommendation
Krishna Acharya, Aleksandr V. Petrov, Juba Ziani
https://arxiv.org/abs/2506.01910
Beyond Monoliths: Expert Orchestration for More Capable, Democratic, and Safe Large Language Models
Philip Quirke, Narmeen Oozeer, Chaithanya Bandi, Amir Abdullah, Jason Hoelscher-Obermaier, Jeff M. Phillips, Joshua Greaves, Clement Neo, Fazl Barez, Shriyash Upadhyay
https://arxiv.org/abs/2506.00051
This https://arxiv.org/abs/2409.17275 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
MassTool: A Multi-Task Search-Based Tool Retrieval Framework for Large Language Models
Jianghao Lin, Xinyuan Wang, Xinyi Dai, Menghui Zhu, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang
https://arxiv.org/abs/2507.00487
Testing the spin-bath view of self-attention: A Hamiltonian analysis of GPT-2 Transformer
Satadeep Bhattacharjee, Seung-Cheol Lee
https://arxiv.org/abs/2507.00683
TailorSQL: An NL2SQL System Tailored to Your Query Workload
Kapil Vaidya, Jialin Ding, Sebastian Kosak, David Kernert, Chuan Lei, Xiao Qin, Abhinav Tripathy, Ramesh Balan, Balakrishnan Narayanaswamy, Tim Kraska
https://arxiv.org/abs/2505.23039
Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach
Heng Er Metilda Chee, Jiayin Wang, Zhiqiang Guo, Weizhi Ma, Min Zhang
https://arxiv.org/abs/2506.01668
Querying Attack-Fault-Defense Trees: Property Specification in Smart Grid and Aerospace Case Studies
Reza Soltani, Stefano M. Nicoletti, Milan Lopuha\"a-Zwakenberg, Mari\"elle Stoelinga
https://arxiv.org/abs/2506.23789
Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation
Guanting Dong, Xiaoxi Li, Yuyao Zhang, Mengjie Deng
https://arxiv.org/abs/2506.21384
Read the Docs Before Rewriting: Equip Rewriter with Domain Knowledge via Continual Pre-training
Qi Wang, Yixuan Cao, Yifan Liu, Jiangtao Zhao, Ping Luo
https://arxiv.org/abs/2507.00477
In today's ISE 2025 lecture,, we will introduce SPARQL as a query language for knowledge graphs. Again, I'm trying out 'Dystopian Novels' as example knowledge graph playground. Let's see, if the students might know any of them. Wtat do you think? ;-)
#dystopia #literature
HI-SQL: Optimizing Text-to-SQL Systems through Dynamic Hint Integration
Ganesh Parab, Zishan Ahmad, Dagnachew Birru
https://arxiv.org/abs/2506.18916 https:…
KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction
Jang-Hyun Kim, Jinuk Kim, Sangwoo Kwon, Jae W. Lee, Sangdoo Yun, Hyun Oh Song
https://arxiv.org/abs/2505.23416
D\'ej\`a Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse
Jinwoo Hwang, Daeun Kim, Sangyeop Lee, Yoonsung Kim, Guseul Heo, Hojoon Kim, Yunseok Jeong, Tadiwos Meaza, Eunhyeok Park, Jeongseob Ahn, Jongse Park
https://arxiv.org/abs/2506.14107
Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval
Seongwan Park, Taeklim Kim, Youngjoong Ko
https://arxiv.org/abs/2506.00041
This https://arxiv.org/abs/2411.06426 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Query, Don't Train: Privacy-Preserving Tabular Prediction from EHR Data via SQL Queries
Josefa Lia Stoisser, Marc Boubnovski Martell, Kaspar M\"artens, Lawrence Phillips, Stephen Michael Town, Rory Donovan-Maiye, Julien Fauqueur
https://arxiv.org/abs/2505.21801
InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding
Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang
https://arxiv.org/abs/2506.15745
Lightweight Relevance Grader in RAG
Taehee Jeong
https://arxiv.org/abs/2506.14084 https://arxiv.org/pdf/2506.14084
Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach
Wenqi Guan, Yang Fang
https://arxiv.org/abs/2506.15512
This https://arxiv.org/abs/2505.21801 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
Retrieval-Confused Generation is a Good Defender for Privacy Violation Attack of Large Language Models
Wanli Peng, Xin Chen, Hang Fu, XinYu He, Xue Yiming, Juan Wen
https://arxiv.org/abs/2506.19889
This https://arxiv.org/abs/2505.14690 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csPL_…
Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation
Benjamin Elder, Anupama Murthi, Jungkoo Kang, Ankita Rajaram Naik, Kiran Kate, Kinjal Basu, Danish Contractor
https://arxiv.org/abs/2506.11266
Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Ailin Huang, Bingxin Li, Bruce Wang, Boyong Wu, Chao Yan, Chengli Feng, Heng Wang, Hongyu Zhou, Hongyuan Wang, Jingbei Li, Jianjian Sun, Joanna Wang, Mingrui Chen, Peng Liu, Ruihang Miao, Shilei Jiang, Tian Fei, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Ge, Zheng Gong, Zhewei Huang, Zixin Zhang, Bin Wang, Bo Li, Buyun Ma, Changxin Miao, Changyi Wan, Chen Xu, Dapeng Shi, Dingyuan Hu, Enle…
Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking
Wuwei Zhang, Fangcong Yin, Howard Yen, Danqi Chen, Xi Ye
https://arxiv.org/abs/2506.09944
This https://arxiv.org/abs/2502.17057 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
Harnessing the Power of Reinforcement Learning for Language-Model-Based Information Retriever via Query-Document Co-Augmentation
Jingming Liu, Yumeng Li, Wei Shi, Yao-Xiang Ding, Hui Su, Kun Zhou
https://arxiv.org/abs/2506.18670
PentaRAG: Large-Scale Intelligent Knowledge Retrieval for Enterprise LLM Applications
Abu Hanif Muhammad Syarubany, Chang Dong Yoo
https://arxiv.org/abs/2506.21593
Datrics Text2SQL: A Framework for Natural Language to SQL Query Generation
Tetiana Gladkykh, Kyrylo Kirykov
https://arxiv.org/abs/2506.12234 https://
Hardware-Centric Analysis of DeepSeek's Multi-Head Latent Attention
Robin Geens, Marian Verhelst
https://arxiv.org/abs/2506.02523 https://
SwiftSpec: Ultra-Low Latency LLM Decoding by Scaling Asynchronous Speculative Decoding
Ziyi Zhang, Ziheng Jiang, Chengquan Jiang, Menghan Yu, Size Zheng, Haibin Lin, Henry Hoffmann, Xin Liu
https://arxiv.org/abs/2506.11309
NetPress: Dynamically Generated LLM Benchmarks for Network Applications
Yajie Zhou, Jiajun Ruan, Eric S. Wang, Sadjad Fouladi, Francis Y. Yan, Kevin Hsieh, Zaoxing Liu
https://arxiv.org/abs/2506.03231
Revela: Dense Retriever Learning via Language Modeling
Fengyu Cai, Tong Chen, Xinran Zhao, Sihao Chen, Hongming Zhang, Sherry Tongshuang Wu, Iryna Gurevych, Heinz Koeppl
https://arxiv.org/abs/2506.16552
This https://arxiv.org/abs/2505.24226 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csAI_…
Training-Free Query Optimization via LLM-Based Plan Similarity
Nikita Vasilenko, Alexander Demin, Vladimir Boorlakov
https://arxiv.org/abs/2506.05853 https…
SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists
Lynn Khellaf, Ipek Baris Schlicht, Tilman Mirass, Julia Bayer, Tilman Wagner, Ruben Bouwmeester
https://arxiv.org/abs/2506.13188
TongSearch-QR: Reinforced Query Reasoning for Retrieval
Xubo Qin, Jun Bai, Jiaqi Li, Zixia Jia, Zilong Zheng
https://arxiv.org/abs/2506.11603 https://
Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion
Lingyuan Liu, Mengxiang Zhang
https://arxiv.org/abs/2506.04760
GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval
Lingyuan Liu, Mengxiang Zhang
https://arxiv.org/abs/2506.04762
This https://arxiv.org/abs/2505.19988 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
Tree-Based Text Retrieval via Hierarchical Clustering in RAGFrameworks: Application on Taiwanese Regulations
Chia-Heng Yu, Yen-Lung Tsai
https://arxiv.org/abs/2506.13607
This https://arxiv.org/abs/2503.00600 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
This https://arxiv.org/abs/2505.07155 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
This https://arxiv.org/abs/2412.00639 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…
Towards Understanding Bias in Synthetic Data for Evaluation
Hossein A. Rahmani, Varsha Ramineni, Nick Craswell, Bhaskar Mitra, Emine Yilmaz
https://arxiv.org/abs/2506.10301
This https://arxiv.org/abs/2503.18941 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csIR_…