Tootfinder

Opt-in global Mastodon full text search. Join the index!

@arXiv_csNI_bot@mastoxiv.page
2025-09-25 08:26:12

Poster: ChatIYP: Enabling Natural Language Access to the Internet Yellow Pages Database
Vasilis Andritsoudis, Pavlos Sermpezis, Ilias Dimitriadis, Athena Vakali
arxiv.org/abs/2509.19411

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:39:09

QAgent: A modular Search Agent with Interactive Query Understanding
Yi Jiang, Lei Shen, Lujie Niu, Sendong Zhao, Wenbo Su, Bo Zheng
arxiv.org/abs/2510.08383

@fanf@mendeddrum.org
2025-11-05 18:42:03

from my link log —
Pipelined Relational Query Language, PRQL: a simple, powerful, pipelined SQL replacement.
prql-lang.org/book/
saved 2025-11-05

@arXiv_csLG_bot@mastoxiv.page
2025-10-14 13:39:18

Query-Specific GNN: A Comprehensive Graph Representation Learning Method for Retrieval Augmented Generation
Yuchen Yan, Zhihua Liu, Hao Wang, Weiming Li, Xiaoshuai Hao
arxiv.org/abs/2510.11541

@michabbb@social.vivaldi.net
2025-12-13 22:39:44

📈 #LogsQL query language provides fast full-text search, advanced analytics, and data extraction/transformation at query time. Can be combined with Unix tools like grep, less, sort, and jq for log analysis.
🎯 Optimized for high cardinality fields like trace_id, user_id, and ip addresses. Supports logs with hundreds of fields (wide events), multitenancy, out-of-order ingestion, live taili…

@arXiv_csIR_bot@mastoxiv.page
2025-10-14 09:03:48

CardRewriter: Leveraging Knowledge Cards for Long-Tail Query Rewriting on Short-Video Platforms
Peiyuan Gong, Feiran Zhu, Yaqi Yin, Chenglei Dai, Chao Zhang, Kai Zheng, Wentian Bao, Jiaxin Mao, Yi Zhang
arxiv.org/abs/2510.10095

@tiotasram@kolektiva.social
2025-11-09 12:09:40

Imagine ChatGPT but instead of predicting text it just linked you to the to 3 documents most-influential on the probabilities that would have been used to predict that text.
Could even generate some info about which parts of each would have been combined how.
There would still be issues with how training data is sourced and filtered, but these could be solved by crawling normally respecting robots.txt and by paying filterers a fair wage with a more relaxed work schedule and mental health support.
The energy issues are mainly about wild future investment and wasteful query spam, not optimized present-day per-query usage.
Is this "just search?"
Yes, but it would have some advantages for a lot of use cases, mainly in synthesizing results across multiple documents and in leveraging a language model more fully to find relevant stuff.
When we talk about the harms of current corporate LLMs, the opportunity cost of NOT building things like this is part of that.
The equivalent for art would have been so amazing too! "Here are some artists that can do what you want, with examples pulled from their portfolios."
It would be a really cool coding assistant that I'd actually encourage my students to use (with some guidelines).
#AI #GenAI #LLMs

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:14:51

Query-Centric Graph Retrieval Augmented Generation
Yaxiong Wu, Jianyuan Bo, Yongyue Zhang, Sheng Liang, Yong Liu
arxiv.org/abs/2509.21237 a…

@arXiv_csDC_bot@mastoxiv.page
2025-10-01 07:38:05

Accelerating LLM Inference with Precomputed Query Storage
Jay H. Park, Youngju Cho, Choungsol Lee, Moonwook Oh, Euiseong Seo
arxiv.org/abs/2509.25919

@arXiv_csDB_bot@mastoxiv.page
2025-10-10 08:45:19

Implementing Semantic Join Operators Efficiently
Immanuel Trummer
arxiv.org/abs/2510.08489 arxiv.org/pdf/2510.08489

@arXiv_csCR_bot@mastoxiv.page
2025-10-14 11:49:48

PrediQL: Automated Testing of GraphQL APIs with LLMs
Shaolun Liu, Sina Marefat, Omar Tsai, Yu Chen, Zecheng Deng, Jia Wang, Mohammad A. Tayebi
arxiv.org/abs/2510.10407

Unlike keyword search,
semantic search lets you search using natural language.
It looks beyond exact matches to understand the meaning and intent behind your query.
This means it can surface relevant precedents even when they're phrased differently
—something keyword searches often miss.
Semantic search is currently available through an API, but we're already working to bring it to the website—stay tuned!
And don't worry, keyword search isn…

@arXiv_csAI_bot@mastoxiv.page
2025-10-09 10:15:41

Agentic generative AI for media content discovery at the national football league
Henry Wang, Sirajus Salekin, Jake Lee, Ross Claytor, Shinan Zhang, Michael Chi
arxiv.org/abs/2510.07297

@arXiv_csSE_bot@mastoxiv.page
2025-10-08 09:48:19

Extending ResourceLink: Patterns for Large Dataset Processing in MCP Applications
Scott Frees
arxiv.org/abs/2510.05968 arxiv.org/pdf/2510.0…

@datascience@genomic.social
2025-11-04 11:00:01

Polars is a lightning fast DataFrame library/in-memory query engine with parallel execution and cache efficiency. And now you can use is with the tidyverse syntax: #rstats

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:53:17

Query-Kontext: An Unified Multimodal Model for Image Generation and Editing
Yuxin Song, Wenkai Dong, Shizun Wang, Qi Zhang, Song Xue, Tao Yuan, Hu Yang, Haocheng Feng, Hang Zhou, Xinyan Xiao, Jingdong Wang
arxiv.org/abs/2509.26641

@arXiv_csIR_bot@mastoxiv.page
2025-10-06 07:36:29

A Simple but Effective Elaborative Query Reformulation Approach for Natural Language Recommendation
Qianfeng Wen, Yifan Liu, Justin Cui, Joshua Zhang, Anton Korikov, George-Kirollos Saad, Scott Sanner
arxiv.org/abs/2510.02656

@arXiv_csHC_bot@mastoxiv.page
2025-10-09 09:29:31

RAVEN: Realtime Accessibility in Virtual ENvironments for Blind and Low-Vision People
Xinyun Cao, Kexin Phyllis Ju, Chenglin Li, Venkatesh Potluri, Dhruv Jain
arxiv.org/abs/2510.06573

@arXiv_csCL_bot@mastoxiv.page
2025-10-02 10:34:31

SAGE-LD: Towards Scalable and Generalizable End-to-End Language Diarization via Simulated Data Augmentation
Sangmin Lee, Woongjib Choi, Jihyun Kim, Hong-Goo Kang
arxiv.org/abs/2510.00582

@arXiv_csRO_bot@mastoxiv.page
2025-10-03 09:12:01

VL-KnG: Visual Scene Understanding for Navigation Goal Identification using Spatiotemporal Knowledge Graphs
Mohamad Al Mdfaa, Svetlana Lukina, Timur Akhtyamov, Arthur Nigmatzyanov, Dmitrii Nalberskii, Sergey Zagoruyko, Gonzalo Ferrer
arxiv.org/abs/2510.01483

@arXiv_csAI_bot@mastoxiv.page
2025-10-06 09:07:19

AutoMaAS: Self-Evolving Multi-Agent Architecture Search for Large Language Models
Bo Ma, Hang Li, ZeHua Hu, XiaoFan Gui, LuYao Liu, Simon Liu
arxiv.org/abs/2510.02669

@arXiv_csIT_bot@mastoxiv.page
2025-09-26 07:50:41

On Theoretical Interpretations of Concept-Based In-Context Learning
Huaze Tang, Tianren Peng, Shao-lun Huang
arxiv.org/abs/2509.20882 arxiv…

@arXiv_csCV_bot@mastoxiv.page
2025-10-01 11:42:57

SQUARE: Semantic Query-Augmented Fusion and Efficient Batch Reranking for Training-free Zero-Shot Composed Image Retrieval
Ren-Di Wu, Yu-Yen Lin, Huei-Fang Yang
arxiv.org/abs/2509.26330

@arXiv_csLG_bot@mastoxiv.page
2025-10-01 11:57:07

TASP: Topology-aware Sequence Parallelism
Yida Wang (Capital Normal University, Infinigence-AI), Ke Hong (Tsinghua University, Infinigence-AI), Xiuhong Li (Infinigence-AI), Yuanchao Xu (Capital Normal University), Wenxun Wang (Tsinghua University), Guohao Dai (Infinigence-AI, Shanghai Jiao Tong University), Yu Wang (Tsinghua University)

@arXiv_csDB_bot@mastoxiv.page
2025-10-14 08:38:58

Poseidon: A OneGraph Engine
Brad Bebee, \"Umit V. \c{C}ataly\"urek, Olaf Hartig, Ankesh Khandelwal, Simone Rondelli, Michael Schmidt, Lefteris Sidirourgos, Bryan Thompson
arxiv.org/abs/2510.11166

@arXiv_csCL_bot@mastoxiv.page
2025-10-06 13:02:20

Replaced article(s) found for cs.CL. arxiv.org/list/cs.CL/new
[2/3]:
- Query-Level Uncertainty in Large Language Models
Lihu Chen, Gerard de Melo, Fabian M. Suchanek, Ga\"el Varoquaux

@arXiv_csIR_bot@mastoxiv.page
2025-10-13 09:05:00

Doc2Query : Topic-Coverage based Document Expansion and its Application to Dense Retrieval via Dual-Index Fusion
Tzu-Lin Kuo, Wei-Ning Chiu, Wei-Yun Ma, Pu-Jen Cheng
arxiv.org/abs/2510.09557

@arXiv_csCR_bot@mastoxiv.page
2025-10-09 09:31:21

Differentially Private Synthetic Text Generation for Retrieval-Augmented Generation (RAG)
Junki Mori, Kazuya Kakizaki, Taiki Miyagawa, Jun Sakuma
arxiv.org/abs/2510.06719

@arXiv_csSD_bot@mastoxiv.page
2025-10-01 09:43:38

The silence of the weights: an investigation of structural pruning strategies for attention-based audio signal architectures
Andrea Diecidue, Carlo Alberto Barbano, Piero Fraternali, Mathieu Fontaine, Enzo Tartaglione
arxiv.org/abs/2509.26207

@arXiv_csCL_bot@mastoxiv.page
2025-10-03 10:54:01

ARUQULA -- An LLM based Text2SPARQL Approach using ReAct and Knowledge Graph Exploration Utilities
Felix Brei, Lorenz B\"uhmann, Johannes Frey, Daniel Gerber, Lars-Peter Meyer, Claus Stadler, Kirill Bulert
arxiv.org/abs/2510.02200

@arXiv_csIR_bot@mastoxiv.page
2025-10-14 11:02:49

From Reasoning LLMs to BERT: A Two-Stage Distillation Framework for Search Relevance
Runze Xia, Yupeng Ji, Yuxi Zhou, Haodong Liu, Teng Zhang, Piji Li
arxiv.org/abs/2510.11056

@arXiv_csAI_bot@mastoxiv.page
2025-10-10 10:17:59

TaoSR-SHE: Stepwise Hybrid Examination Reinforcement Learning Framework for E-commerce Search Relevance
Pengkun Jiao, Yiming Jin, Jianhui Yang, Chenhe Dong, Zerui Huang, Shaowei Yao, Xiaojiang Zhou, Dan Ou, Haihong Tang
arxiv.org/abs/2510.07972

@arXiv_csDB_bot@mastoxiv.page
2025-10-01 07:43:46

From NL2SQL to NL2GeoSQL: GeoSQL-Eval for automated evaluation of LLMs on PostGIS queries
Shuyang Hou, Haoyue Jiao, Ziqi Liu, Lutong Xie, Guanyu Chen, Shaowen Wu, Xuefeng Guan, Huayi Wu
arxiv.org/abs/2509.25264

@arXiv_csCL_bot@mastoxiv.page
2025-09-26 10:07:51

BESPOKE: Benchmark for Search-Augmented Large Language Model Personalization via Diagnostic Feedback
Hyunseo Kim, Sangam Lee, Kwangwook Seo, Dongha Lee
arxiv.org/abs/2509.21106

@arXiv_csIR_bot@mastoxiv.page
2025-10-02 09:48:21

Bridging Language Gaps: Advances in Cross-Lingual Information Retrieval with Multilingual LLMs
Roksana Goworek, Olivia Macmillan-Scott, Eda B. \"Ozyi\u{g}it
arxiv.org/abs/2510.00908

@arXiv_csLG_bot@mastoxiv.page
2025-09-26 10:29:31

Mixture of Thoughts: Learning to Aggregate What Experts Think, Not Just What They Say
Jacob Fein-Ashley, Dhruv Parikh, Rajgopal Kannan, Viktor Prasanna
arxiv.org/abs/2509.21164

@arXiv_csCL_bot@mastoxiv.page
2025-09-25 10:46:22

Play by the Type Rules: Inferring Constraints for LLM Functions in Declarative Programs
Parker Glenn, Alfy Samuel, Daben Liu
arxiv.org/abs/2509.20208

@arXiv_csAI_bot@mastoxiv.page
2025-10-06 09:55:29

CoDA: Agentic Systems for Collaborative Data Visualization
Zichen Chen, Jiefeng Chen, Sercan \"O. Arik, Misha Sra, Tomas Pfister, Jinsung Yoon
arxiv.org/abs/2510.03194

@arXiv_csDB_bot@mastoxiv.page
2025-10-09 08:35:41

Automated Discovery of Test Oracles for Database Management Systems Using LLMs
Qiuyang Mang, Runyuan He, Suyang Zhong, Xiaoxuan Liu, Huanchen Zhang, Alvin Cheung
arxiv.org/abs/2510.06663

@arXiv_csIR_bot@mastoxiv.page
2025-10-10 08:14:38

TaoSR-AGRL: Adaptive Guided Reinforcement Learning Framework for E-commerce Search Relevance
Jianhui Yang, Yiming Jin, Pengkun Jiao, Chenhe Dong, Zerui Huang, Shaowei Yao, Xiaojiang Zhou, Dan Ou, Haihong Tang
arxiv.org/abs/2510.08048

@arXiv_csDB_bot@mastoxiv.page
2025-09-29 07:44:07

QueryGym: Step-by-Step Interaction with Relational Databases
Haritha Ananthakrishanan, Harsha Kokel, Kelsey Sikes, Debarun Bhattacharjya, Michael Katz, Shirin Sohrabi, Kavitha Srinivas
arxiv.org/abs/2509.21674

@arXiv_csAI_bot@mastoxiv.page
2025-10-03 10:25:01

Learning Compact Representations of LLM Abilities via Item Response Theory
Jianhao Chen, Chenxu Wang, Gengrui Zhang, Peng Ye, Lei Bai, Wei Hu, Yuzhong Qu, Shuyue Hu
arxiv.org/abs/2510.00844

@arXiv_csIR_bot@mastoxiv.page
2025-10-10 07:35:58

Reasoning by Exploration: A Unified Approach to Retrieval and Generation over Graphs
Haoyu Han, Kai Guo, Harry Shomer, Yu Wang, Yucheng Chu, Hang Li, Li Ma, Jiliang Tang
arxiv.org/abs/2510.07484

@arXiv_csCL_bot@mastoxiv.page
2025-10-03 10:57:21

Drawing Conclusions from Draws: Rethinking Preference Semantics in Arena-Style LLM Evaluation
Raphael Tang, Crystina Zhang, Wenyan Li, Carmen Lai, Pontus Stenetorp, Yao Lu
arxiv.org/abs/2510.02306

@arXiv_csAI_bot@mastoxiv.page
2025-10-02 10:33:01

Learning Compact Representations of LLM Abilities via Item Response Theory
Jianhao Chen, Chenxu Wang, Gengrui Zhang, Peng Ye, Lei Bai, Wei Hu, Yuzhong Qu, Shuyue Hu
arxiv.org/abs/2510.00844

@arXiv_csCL_bot@mastoxiv.page
2025-10-03 10:57:01

F2LLM Technical Report: Matching SOTA Embedding Performance with 6 Million Open-Source Data
Ziyin Zhang, Zihan Liao, Hang Yu, Peng Di, Rui Wang
arxiv.org/abs/2510.02294

@arXiv_csDB_bot@mastoxiv.page
2025-09-30 07:42:22

PARROT: A Benchmark for Evaluating LLMs in Cross-System SQL Translation
Wei Zhou, Guoliang Li, Haoyu Wang, Yuxing Han, Xufei Wu, Fan Wu, Xuanhe Zhou
arxiv.org/abs/2509.23338

@arXiv_csIR_bot@mastoxiv.page
2025-10-03 09:20:01

Study on LLMs for Promptagator-Style Dense Retriever Training
Daniel Gwon, Nour Jedidi, Jimmy Lin
arxiv.org/abs/2510.02241 arxiv.org/pdf/25…