Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csIR_bot@mastoxiv.page
2025-06-03 07:39:40

GLoSS: Generative Language Models with Semantic Search for Sequential Recommendation
Krishna Acharya, Aleksandr V. Petrov, Juba Ziani
arxiv.org/abs/2506.01910

@arXiv_csCY_bot@mastoxiv.page
2025-06-03 07:19:09

Beyond Monoliths: Expert Orchestration for More Capable, Democratic, and Safe Large Language Models
Philip Quirke, Narmeen Oozeer, Chaithanya Bandi, Amir Abdullah, Jason Hoelscher-Obermaier, Jeff M. Phillips, Joshua Greaves, Clement Neo, Fazl Barez, Shriyash Upadhyay
arxiv.org/abs/2506.00051

@arXiv_csCR_bot@mastoxiv.page
2025-06-02 09:57:08

This arxiv.org/abs/2409.17275 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csIR_bot@mastoxiv.page
2025-07-02 09:01:00

MassTool: A Multi-Task Search-Based Tool Retrieval Framework for Large Language Models
Jianghao Lin, Xinyuan Wang, Xinyi Dai, Menghui Zhu, Bo Chen, Ruiming Tang, Yong Yu, Weinan Zhang
arxiv.org/abs/2507.00487

@arXiv_condmatmtrlsci_bot@mastoxiv.page
2025-07-02 09:45:20

Testing the spin-bath view of self-attention: A Hamiltonian analysis of GPT-2 Transformer
Satadeep Bhattacharjee, Seung-Cheol Lee
arxiv.org/abs/2507.00683

@arXiv_csDB_bot@mastoxiv.page
2025-05-30 07:16:53

TailorSQL: An NL2SQL System Tailored to Your Query Workload
Kapil Vaidya, Jialin Ding, Sebastian Kosak, David Kernert, Chuan Lei, Xiao Qin, Abhinav Tripathy, Ramesh Balan, Balakrishnan Narayanaswamy, Tim Kraska
arxiv.org/abs/2505.23039

@arXiv_csMM_bot@mastoxiv.page
2025-06-03 07:22:17

Small Stickers, Big Meanings: A Multilingual Sticker Semantic Understanding Dataset with a Gamified Approach
Heng Er Metilda Chee, Jiayin Wang, Zhiqiang Guo, Weizhi Ma, Min Zhang
arxiv.org/abs/2506.01668

@arXiv_csLO_bot@mastoxiv.page
2025-07-01 09:29:03

Querying Attack-Fault-Defense Trees: Property Specification in Smart Grid and Aerospace Case Studies
Reza Soltani, Stefano M. Nicoletti, Milan Lopuha\"a-Zwakenberg, Mari\"elle Stoelinga
arxiv.org/abs/2506.23789

@arXiv_csCL_bot@mastoxiv.page
2025-06-27 09:54:19

Leveraging LLM-Assisted Query Understanding for Live Retrieval-Augmented Generation
Guanting Dong, Xiaoxi Li, Yuyao Zhang, Mengjie Deng
arxiv.org/abs/2506.21384

@arXiv_csIR_bot@mastoxiv.page
2025-07-02 08:39:29

Read the Docs Before Rewriting: Equip Rewriter with Domain Knowledge via Continual Pre-training
Qi Wang, Yixuan Cao, Yifan Liu, Jiangtao Zhao, Ping Luo
arxiv.org/abs/2507.00477

@lysander07@sigmoid.social
2025-06-25 05:00:12

In today's ISE 2025 lecture,, we will introduce SPARQL as a query language for knowledge graphs. Again, I'm trying out 'Dystopian Novels' as example knowledge graph playground. Let's see, if the students might know any of them. Wtat do you think? ;-)
#dystopia #literature

Example knowledge graph for the ISE 2025 lecture. The novels represented in this graph are:
- George Orwell: Nineteeneightyfour
- Harry Harrison: Make Room! Make Room!
- Octavia E. Butler: Parable of the Sower
@arXiv_csLG_bot@mastoxiv.page
2025-06-25 07:36:59

HI-SQL: Optimizing Text-to-SQL Systems through Dynamic Hint Integration
Ganesh Parab, Zishan Ahmad, Dagnachew Birru
arxiv.org/abs/2506.18916

@arXiv_csDB_bot@mastoxiv.page
2025-05-30 07:17:10

KVzip: Query-Agnostic KV Cache Compression with Context Reconstruction
Jang-Hyun Kim, Jinuk Kim, Sangwoo Kwon, Jae W. Lee, Sangdoo Yun, Hyun Oh Song
arxiv.org/abs/2505.23416

@arXiv_csDC_bot@mastoxiv.page
2025-06-18 08:12:32

D\'ej\`a Vu: Efficient Video-Language Query Engine with Learning-based Inter-Frame Computation Reuse
Jinwoo Hwang, Daeun Kim, Sangyeop Lee, Yoonsung Kim, Guseul Heo, Hojoon Kim, Yunseok Jeong, Tadiwos Meaza, Eunhyeok Park, Jeongseob Ahn, Jongse Park
arxiv.org/abs/2506.14107

@arXiv_csIR_bot@mastoxiv.page
2025-06-03 07:21:46

Decoding Dense Embeddings: Sparse Autoencoders for Interpreting and Discretizing Dense Retrieval
Seongwan Park, Taeklim Kim, Youngjoong Ko
arxiv.org/abs/2506.00041

@arXiv_csCR_bot@mastoxiv.page
2025-05-30 09:51:42

This arxiv.org/abs/2411.06426 has been replaced.
initial toot: mastoxiv.page/@arXiv_csCR_…

@arXiv_csDB_bot@mastoxiv.page
2025-05-29 07:17:12

Query, Don't Train: Privacy-Preserving Tabular Prediction from EHR Data via SQL Queries
Josefa Lia Stoisser, Marc Boubnovski Martell, Kaspar M\"artens, Lawrence Phillips, Stephen Michael Town, Rory Donovan-Maiye, Julien Fauqueur
arxiv.org/abs/2505.21801

@arXiv_eessIV_bot@mastoxiv.page
2025-06-23 08:43:40

InfiniPot-V: Memory-Constrained KV Cache Compression for Streaming Video Understanding
Minsoo Kim, Kyuhong Shim, Jungwook Choi, Simyung Chang
arxiv.org/abs/2506.15745

@arXiv_csAI_bot@mastoxiv.page
2025-06-18 08:04:15

Lightweight Relevance Grader in RAG
Taehee Jeong
arxiv.org/abs/2506.14084 arxiv.org/pdf/2506.14084

@arXiv_csHC_bot@mastoxiv.page
2025-06-19 08:23:34

Optimizing Web-Based AI Query Retrieval with GPT Integration in LangChain A CoT-Enhanced Prompt Engineering Approach
Wenqi Guan, Yang Fang
arxiv.org/abs/2506.15512

@arXiv_csDB_bot@mastoxiv.page
2025-05-30 09:51:22

This arxiv.org/abs/2505.21801 has been replaced.
initial toot: mastoxiv.page/@arXiv_csDB_…

@arXiv_csCR_bot@mastoxiv.page
2025-06-26 09:21:00

Retrieval-Confused Generation is a Good Defender for Privacy Violation Attack of Large Language Models
Wanli Peng, Xin Chen, Hang Fu, XinYu He, Xue Yiming, Juan Wen
arxiv.org/abs/2506.19889

@arXiv_csPL_bot@mastoxiv.page
2025-06-03 16:09:21

This arxiv.org/abs/2505.14690 has been replaced.
initial toot: mastoxiv.page/@arXiv_csPL_…

@arXiv_csSE_bot@mastoxiv.page
2025-06-16 10:15:59

Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation
Benjamin Elder, Anupama Murthi, Jungkoo Kang, Ankita Rajaram Naik, Kiran Kate, Kinjal Basu, Danish Contractor
arxiv.org/abs/2506.11266

@arXiv_csSD_bot@mastoxiv.page
2025-06-11 08:08:45

Step-Audio-AQAA: a Fully End-to-End Expressive Large Audio Language Model
Ailin Huang, Bingxin Li, Bruce Wang, Boyong Wu, Chao Yan, Chengli Feng, Heng Wang, Hongyu Zhou, Hongyuan Wang, Jingbei Li, Jianjian Sun, Joanna Wang, Mingrui Chen, Peng Liu, Ruihang Miao, Shilei Jiang, Tian Fei, Wang You, Xi Chen, Xuerui Yang, Yechang Huang, Yuxiang Zhang, Zheng Ge, Zheng Gong, Zhewei Huang, Zixin Zhang, Bin Wang, Bo Li, Buyun Ma, Changxin Miao, Changyi Wan, Chen Xu, Dapeng Shi, Dingyuan Hu, Enle…

@arXiv_csCL_bot@mastoxiv.page
2025-06-12 09:06:21

Query-Focused Retrieval Heads Improve Long-Context Reasoning and Re-ranking
Wuwei Zhang, Fangcong Yin, Howard Yen, Danqi Chen, Xi Ye
arxiv.org/abs/2506.09944

@arXiv_csIR_bot@mastoxiv.page
2025-05-30 09:53:40

This arxiv.org/abs/2502.17057 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_csIR_bot@mastoxiv.page
2025-06-24 11:43:30

Harnessing the Power of Reinforcement Learning for Language-Model-Based Information Retriever via Query-Document Co-Augmentation
Jingming Liu, Yumeng Li, Wei Shi, Yao-Xiang Ding, Hui Su, Kun Zhou
arxiv.org/abs/2506.18670

@erc_bk@fosstodon.org
2025-05-08 15:40:22

I feel like ChatGPT is taking a jab at other types of models here.

Images shows the response of ChatGPT to a query about map artifacts generated by a random forest model. It describes some of the predictions of the model as hallucinations which is a common critique of large language models.
@arXiv_csIR_bot@mastoxiv.page
2025-06-30 08:46:00

PentaRAG: Large-Scale Intelligent Knowledge Retrieval for Enterprise LLM Applications
Abu Hanif Muhammad Syarubany, Chang Dong Yoo
arxiv.org/abs/2506.21593

@arXiv_csDB_bot@mastoxiv.page
2025-06-17 09:28:39

Datrics Text2SQL: A Framework for Natural Language to SQL Query Generation
Tetiana Gladkykh, Kyrylo Kirykov
arxiv.org/abs/2506.12234

@arXiv_csAR_bot@mastoxiv.page
2025-06-04 07:17:25

Hardware-Centric Analysis of DeepSeek's Multi-Head Latent Attention
Robin Geens, Marian Verhelst
arxiv.org/abs/2506.02523

@arXiv_csDC_bot@mastoxiv.page
2025-06-16 07:26:29

SwiftSpec: Ultra-Low Latency LLM Decoding by Scaling Asynchronous Speculative Decoding
Ziyi Zhang, Ziheng Jiang, Chengquan Jiang, Menghan Yu, Size Zheng, Haibin Lin, Henry Hoffmann, Xin Liu
arxiv.org/abs/2506.11309

@arXiv_csNI_bot@mastoxiv.page
2025-06-05 07:20:19

NetPress: Dynamically Generated LLM Benchmarks for Network Applications
Yajie Zhou, Jiajun Ruan, Eric S. Wang, Sadjad Fouladi, Francis Y. Yan, Kevin Hsieh, Zaoxing Liu
arxiv.org/abs/2506.03231

@arXiv_csIR_bot@mastoxiv.page
2025-06-23 08:51:30

Revela: Dense Retriever Learning via Language Modeling
Fengyu Cai, Tong Chen, Xinran Zhao, Sihao Chen, Hongming Zhang, Sherry Tongshuang Wu, Iryna Gurevych, Heinz Koeppl
arxiv.org/abs/2506.16552

@arXiv_csAI_bot@mastoxiv.page
2025-06-03 18:16:23

This arxiv.org/abs/2505.24226 has been replaced.
initial toot: mastoxiv.page/@arXiv_csAI_…

@arXiv_csDB_bot@mastoxiv.page
2025-06-09 07:27:22

Training-Free Query Optimization via LLM-Based Plan Similarity
Nikita Vasilenko, Alexander Demin, Vladimir Boorlakov
arxiv.org/abs/2506.05853

@arXiv_csIR_bot@mastoxiv.page
2025-06-17 10:07:05

SPOT: Bridging Natural Language and Geospatial Search for Investigative Journalists
Lynn Khellaf, Ipek Baris Schlicht, Tilman Mirass, Julia Bayer, Tilman Wagner, Ruben Bouwmeester
arxiv.org/abs/2506.13188

@arXiv_csIR_bot@mastoxiv.page
2025-06-16 07:50:09

TongSearch-QR: Reinforced Query Reasoning for Retrieval
Xubo Qin, Jun Bai, Jiaqi Li, Zixia Jia, Zilong Zheng
arxiv.org/abs/2506.11603

@arXiv_csIR_bot@mastoxiv.page
2025-06-06 07:19:02

Exp4Fuse: A Rank Fusion Framework for Enhanced Sparse Retrieval using Large Language Model-based Query Expansion
Lingyuan Liu, Mengxiang Zhang
arxiv.org/abs/2506.04760

@arXiv_csIR_bot@mastoxiv.page
2025-06-06 07:19:23

GOLFer: Smaller LM-Generated Documents Hallucination Filter & Combiner for Query Expansion in Information Retrieval
Lingyuan Liu, Mengxiang Zhang
arxiv.org/abs/2506.04762

@arXiv_csDB_bot@mastoxiv.page
2025-06-04 13:32:54

This arxiv.org/abs/2505.19988 has been replaced.
initial toot: mastoxiv.page/@arXiv_csDB_…

@arXiv_csIR_bot@mastoxiv.page
2025-06-17 10:14:17

Tree-Based Text Retrieval via Hierarchical Clustering in RAGFrameworks: Application on Taiwanese Regulations
Chia-Heng Yu, Yen-Lung Tsai
arxiv.org/abs/2506.13607

@arXiv_csDB_bot@mastoxiv.page
2025-06-03 16:03:48

This arxiv.org/abs/2503.00600 has been replaced.
initial toot: mastoxiv.page/@arXiv_csDB_…

@arXiv_csIR_bot@mastoxiv.page
2025-06-03 16:41:58

This arxiv.org/abs/2505.07155 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_csIR_bot@mastoxiv.page
2025-06-03 16:14:58

This arxiv.org/abs/2412.00639 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…

@arXiv_csIR_bot@mastoxiv.page
2025-06-13 07:38:00

Towards Understanding Bias in Synthetic Data for Evaluation
Hossein A. Rahmani, Varsha Ramineni, Nick Craswell, Bhaskar Mitra, Emine Yilmaz
arxiv.org/abs/2506.10301

@arXiv_csIR_bot@mastoxiv.page
2025-06-10 16:34:59

This arxiv.org/abs/2503.18941 has been replaced.
initial toot: mastoxiv.page/@arXiv_csIR_…