
2025-08-21 08:07:29
Confidence Estimation for Text-to-SQL in Large Language Models
Sepideh Entezari Maleki, Mohammadreza Pourreza, Davood Rafiei
https://arxiv.org/abs/2508.14056 https://
Confidence Estimation for Text-to-SQL in Large Language Models
Sepideh Entezari Maleki, Mohammadreza Pourreza, Davood Rafiei
https://arxiv.org/abs/2508.14056 https://
CRED-SQL: Enhancing Real-world Large Scale Database Text-to-SQL Parsing through Cluster Retrieval and Execution Description
Shaoming Duan, Zirui Wang, Chuanyi Liu, Zhibin Zhu, Yuhao Zhang, Peiyi Han, Liang Yan, Zewu Penge
https://arxiv.org/abs/2508.12769
@… Wait until you hear how some whackos pronounce SQL.
Crosslisted article(s) found for cs.AI. https://arxiv.org/list/cs.AI/new
[9/10]:
- CRED-SQL: Enhancing Real-world Large Scale Database Text-to-SQL Parsing through Cluster Retrieval...
Shaoming Duan, Zirui Wang, Chuanyi Liu, Zhibin Zhu, Yuhao Zhang, Peiyi Han, Liang Yan, Zewu Penge
Quantum Phase Estimation Beyond the Gaussian Limit
Kimin Park, Tanjung Krisnanda, Yvonne Gao, Radim Filip
https://arxiv.org/abs/2508.13046 https://arxiv.or…
Software developer Heval Hazal Kurt discusses the pros and cons of relational databases, in comparison with their document-based counterparts, in this July 2025 article. Polyglot systems combining both paradigms are discussed as an ideal solution to differing data access needs within a same project.
"When to Choose NoSQL Over SQL"
PSA for #mechanicalkeyboard enthusiasts who are also using #SQL:
group buy ≠ groupby
(SCNR)
LLM-Driven Data Generation and a Novel Soft Metric for Evaluating Text-to-SQL in Aviation MRO
Patrick Sutanto, Jonathan Kenrick, Max Lorenz, Joan Santoso
https://arxiv.org/abs/2506.13785
Datrics Text2SQL: A Framework for Natural Language to SQL Query Generation
Tetiana Gladkykh, Kyrylo Kirykov
https://arxiv.org/abs/2506.12234 https://
HI-SQL: Optimizing Text-to-SQL Systems through Dynamic Hint Integration
Ganesh Parab, Zishan Ahmad, Dagnachew Birru
https://arxiv.org/abs/2506.18916 https:…
Today’s ride was 8.3 miles… nothing too exciting to report. (In other news I’ve been relearning SQL to pull data from the RunGap SQLite database.)
#biking #bikeTooter #mke
This https://arxiv.org/abs/2505.14690 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csPL_…
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL
Yifu Liu, Yin Zhu, Yingqi Gao, Zhiling Luo, Xiaoxia Li, Xiaorong Shi, Yuntao Hong, Jinyang Gao, Yu Li, Bolin Ding, Jingren Zhou
https://arxiv.org/abs/2507.04701
Crosslisted article(s) found for cs.DB. https://arxiv.org/list/cs.DB/new
[1/1]:
- Confidence Estimation for Text-to-SQL in Large Language Models
Sepideh Entezari Maleki, Mohammadreza Pourreza, Davood Rafiei
Little #SQLServer Tricks: Fix a Database Stuck in Restoring Mode
https://improveandrepeat.com/2025/07/little-sql-server-tricks-fix-a-datab…
Leveraging large language models for SQL behavior-based database intrusion detection
Meital Shlezinger, Shay Akirav, Lei Zhou, Liang Guo, Avi Kessel, Guoliang Li
https://arxiv.org/abs/2508.05690
Don't you love when you're trying to pwn something and stumble across a different bug than the one you were trying to find?
Some years ago I was trying to XSS a webapp and was confronted with a SQL error.
Turns out it *was* XSS-able, but you had to SQL-escape your javascript first.
Note to self: Look up the tech advisor for the movie "Jason Bourne".
#jasonbourne #sql
Invocable APIs derived from NL2SQL datasets for LLM Tool-Calling Evaluation
Benjamin Elder, Anupama Murthi, Jungkoo Kang, Ankita Rajaram Naik, Kiran Kate, Kinjal Basu, Danish Contractor
https://arxiv.org/abs/2506.11266
🔧 Integrate tools to your agent in less than 10 lines of code - reuse between multiple agents or frameworks
💬 Query databases in plain English directly from your #IDE - no SQL writing needed
SQL-Exchange: Transforming SQL Queries Across Domains
Mohammadreza Daviran, Brian Lin, Davood Rafiei
https://arxiv.org/abs/2508.07087 https://arxiv.org/pdf…
Day 16
Just published a deep dive into building a secure login page with Next.js, NestJS, JWT, and PostgreSQL.
- Email verification
- Role-based access control
- Subscription enforcement
- Token decoding in frontend
- SQL-level inserts for system roles
Includes full code snippets and explanation of the entire flow.
Perfect if you're working on full-stack apps with JavaScript, TypeScript, and SQL.
Focus and Context and LLMs | Taras' Blog on AI, Perf, Hacks
#AI
Im using case_when() quite a lot, case_match() is new to me: #rstats
New chart library for SQL notebook https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/chart-library.json
Spark SQL pipe (|>) for Spark 4.0.0?!
https://issues.apache.org/jira/browse/SPARK-49555
https://
SLM-SQL: An Exploration of Small Language Models for Text-to-SQL
Lei Sheng, Shuai-Shuai Xu
https://arxiv.org/abs/2507.22478 https://arxiv.org/pdf/2507.2247…
@… This is very cool!
I did something similar for PostgreSQL last month.
https://codeberg.org/ooble/tremuloides
SQLord: A Robust Enterprise Text-to-SQL Solution via Reverse Data Generation and Workflow Decomposition
Song Cheng, Qiannan Cheng, Linbo Jin, Lei Yi, Guannan Zhang
https://arxiv.org/abs/2507.10629
Improving Table Retrieval with Question Generation from Partial Tables
Hsing-Ping Liang, Che-Wei Chang, Yao-Chung Fan
https://arxiv.org/abs/2508.06168 https://
It's the year 2025 and Microsoft have released version 21 (Twenty One!) of Sql Server Management Studio and apparently they've lost the secret to the technology that makes it so that when you double click a file to open it, it doesn't start a whole new instance but instead opens the file in the running instance
I mean you read the stories about how NASA forgot how to be able to read the tapes full of Mars data or whatever but you don't expect it to happen to something y…
This https://arxiv.org/abs/2412.05561 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
Replaced article(s) found for cs.PL. https://arxiv.org/list/cs.PL/new
[1/1]:
- MCTS-SQL: Light-Weight LLMs can Master the Text-to-SQL through Monte Carlo Tree Search
Shuozhi Yuan, Limin Chen, Miaomiao Yuan, Jin Zhao
My experience is limited to ASP.NET, SQL, and some Python at a Fortune 500 company, so take this with a grain of salt. When he talks about agents, it sounds like automation to me. I’ve been writing jobs (or agents) for decades to run automated tasks on a schedule. If you want to add another point of failure into your job, knock yourself out. Also, I’m wary of encouraging neophytes to outsource work they don’t know how to do. That’s a recipe for disaster.
SQLBarber: A System Leveraging Large Language Models to Generate Customized and Realistic SQL Workloads
Jiale Lao, Immanuel Trummer
https://arxiv.org/abs/2507.06192
SQL Notebook: Random cluster in PostScript
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/random-cluster.json
Lightweight Transformers for Zero-Shot and Fine-Tuned Text-to-SQL Generation Using Spider
Chirag Seth, Utkarsh Singh
https://arxiv.org/abs/2508.04623 https://
Qymera: Simulating Quantum Circuits using RDBMS
Tim Littau, Rihan Hai
https://arxiv.org/abs/2506.08759 https://arxiv.org/pdf/2506.087…
Interactive Text-to-SQL via Expected Information Gain for Disambiguation
Luyu Qiu, Jianing Li, Chi Su, Lei Chen
https://arxiv.org/abs/2507.06467 https://…
GPU Acceleration of SQL Analytics on Compressed Data
Zezhou Huang, Krystian Sakowski, Hans Lehnert, Wei Cui, Carlo Curino, Matteo Interlandi, Marius Dumitru, Rathijit Sen
https://arxiv.org/abs/2506.10092
Confidence Scoring for LLM-Generated SQL in Supply Chain Data Extraction
Jiekai Ma, Yikai Zhao
https://arxiv.org/abs/2506.17203 https://
E3-Rewrite: Learning to Rewrite SQL for Executability, Equivalence,and Efficiency
Dongjie Xu, Yue Cui, Weijie Shi, Qingzhi Ma, Hanghui Guo, Jiaming Li, Yao Zhao, Ruiyuan Zhang, Shimin Di, Jia Zhu, Kai Zheng, Jiajie Xu
https://arxiv.org/abs/2508.09023
This https://arxiv.org/abs/2506.03308 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csCR_…
Text-to-SQL Task-oriented Dialogue Ontology Construction
Renato Vukovic, Carel van Niekerk, Michael Heck, Benjamin Ruppik, Hsien-Chin Lin, Shutong Feng, Nurul Lubis, Milica Gasic
https://arxiv.org/abs/2507.23358
SWE-SQL: Illuminating LLM Pathways to Solve User SQL Issues in Real-World Applications
Jinyang Li, Xiaolong Li, Ge Qu, Per Jacobsson, Bowen Qin, Binyuan Hui, Shuzheng Si, Nan Huo, Xiaohan Xu, Yue Zhang, Ziwei Tang, Yuanshuai Li, Florensia Widjaja, Xintong Zhu, Feige Zhou, Yongfeng Huang, Yannis Papakonstantinou, Fatma Ozcan, Chenhao Ma, Reynold Cheng
Replaced article(s) found for cs.DB. https://arxiv.org/list/cs.DB/new
[1/1]:
- EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing
Yizhang Zhu, Runzhi Jiang, Boyan Li, Nan Tang, Yuyu Luo
SQL Notebook pattern fill
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/pattern-fill.json
Replaced article(s) found for cs.CL. https://arxiv.org/list/cs.CL/new
[3/3]:
- Structure Guided Large Language Model for SQL Generation
Qinggang Zhang, Hao Chen, Junnan Dong, Shengyuan Chen, Feiran Huang, Xiao Huang
A Learned Cost Model-based Cross-engine Optimizer for SQL Workloads
Andr\'as Strausz, Niels Pardon, Ioana Giurgiu
https://arxiv.org/abs/2506.02802 http…
eSapiens: A Real-World NLP Framework for Multimodal Document Understanding and Enterprise Knowledge Processing
Isaac Shi, Zeyuan Li, Wenli Wang, Lewei He, Yang Yang, Tianyu Shi
https://arxiv.org/abs/2506.16768
Raqlet: Cross-Paradigm Compilation for Recursive Queries
Amir Shaikhha, Youning Xia, Meisam Tarabkhah, Jazal Saleem, Anna Herlihy
https://arxiv.org/abs/2508.03978 https://
This https://arxiv.org/abs/2505.19988 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
A Functional Data Model and Query Language is All You Need
Jens Dittrich
https://arxiv.org/abs/2507.20671 https://arxiv.org/pdf/2507.20671
Reasoning-Table: Exploring Reinforcement Learning for Table Reasoning
Fangyu Lei, Jinxiang Meng, Yiming Huang, Tinghong Chen, Yun Zhang, Shizhu He, Jun Zhao, Kang Liu
https://arxiv.org/abs/2506.01710
Plots in SQL Notebook
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/plots.json
TailorSQL: An NL2SQL System Tailored to Your Query Workload
Kapil Vaidya, Jialin Ding, Sebastian Kosak, David Kernert, Chuan Lei, Xiao Qin, Abhinav Tripathy, Ramesh Balan, Balakrishnan Narayanaswamy, Tim Kraska
https://arxiv.org/abs/2505.23039
SQL notebook can now also do mathematical notation
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/mathematical-notation.json
Query, Don't Train: Privacy-Preserving Tabular Prediction from EHR Data via SQL Queries
Josefa Lia Stoisser, Marc Boubnovski Martell, Kaspar M\"artens, Lawrence Phillips, Stephen Michael Town, Rory Donovan-Maiye, Julien Fauqueur
https://arxiv.org/abs/2505.21801
Beyond Natural Language Plans: Structure-Aware Planning for Query-Focused Table Summarization
Weijia Zhang, Songgaojun Deng, Evangelos Kanoulas
https://arxiv.org/abs/2507.22829 …
How to use Linux on iPad to convert PNG to GIF
This is a proof of concept. When you create animations in PostScript on SQL Notebook, you can download the PNG images as ZIP file.
Now, If I installed Linux and ffmpeg on iPad, would that work? As a matter of fact, I will show you that it is possible, but it is very slow.
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/how-to-use-linux-on-ipad-to-convert-png-to-gif.json
Replaced article(s) found for cs.DB. https://arxiv.org/list/cs.DB/new
[1/1]:
- MCTS-SQL: Light-Weight LLMs can Master the Text-to-SQL through Monte Carlo Tree Search
Shuozhi Yuan, Limin Chen, Miaomiao Yuan, Jin Zhao
While I was browsing for LaTeX samples of charts, I found this periodic table of elements and was quite surprised looking at the complexity of the source code. I thought PostScript could do more readable code.
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/periodic-table-of-elements.json
LINEAGEX: A Column Lineage Extraction System for SQL
Shi Heng Zhang, Zhengjie Miao, Jiannan Wang
https://arxiv.org/abs/2505.23133 https://
SQL notebook: rounded polygons
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/round-polygons.json
QUITE: A Query Rewrite System Beyond Rules with LLM Agents
Yuyang Song, Hanxu Yan, Jiale Lao, Yibo Wang, Yufei Li, Yuanchun Zhou, Jianguo Wang, Mingjie Tang
https://arxiv.org/abs/2506.07675
Replaced article(s) found for cs.DB. https://arxiv.org/list/cs.DB/new
[1/1]:
- Structure Guided Large Language Model for SQL Generation
Qinggang Zhang, Hao Chen, Junnan Dong, Shengyuan Chen, Feiran Huang, Xiao Huang
SQL notebook 3D rendering
https://www.belle-nuit.com/sql-notebook/index.html?url=../site/files/3d-rendering.json
This https://arxiv.org/abs/2505.21801 has been replaced.
initial toot: https://mastoxiv.page/@arXiv_csDB_…
Training-Free Query Optimization via LLM-Based Plan Similarity
Nikita Vasilenko, Alexander Demin, Vladimir Boorlakov
https://arxiv.org/abs/2506.05853 https…
LLM4Hint: Leveraging Large Language Models for Hint Recommendation in Offline Query Optimization
Suchen Liu, Jun Gao, Yinjun Han, Yang Lin
https://arxiv.org/abs/2507.03384
Rethinking Analytical Processing in the GPU Era
Bobbi Yogatama, Yifei Yang, Kevin Kristensen, Devesh Sarda, Abigale Kim, Adrian Cockcroft, Yu Teng, Joshua Patterson, Gregory Kimball, Wes McKinney, Weiwei Gong, Xiangyao Yu
https://arxiv.org/abs/2508.04701
SSCard: Substring Cardinality Estimation using Suffix Tree-Guided Learned FM-Index
Yirui Zhan, Wen Nie, Jun Gao
https://arxiv.org/abs/2505.24312 https://…
GaussMaster: An LLM-based Database Copilot System
Wei Zhou, Ji Sun, Xuanhe Zhou, Guoliang Li, Luyang Liu, Hao Wu, Tianyuan Wang
https://arxiv.org/abs/2506.23322
An advanced AI driven database system
M. Tedeschi, S. Rizwan, C. Shringi, V. Devram Chandgir, S. Belich
https://arxiv.org/abs/2507.17778 https://arxiv.org/…
PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking
Yan Zhou, Chunwei Liu, Bhuvan Urgaonkar, Zhengle Wang, Magnus Mueller, Chao Zhang, Songyue Zhang, Pascal Pfeil, Dominik Horn, Zhengchun Liu, Davide Pagano, Tim Kraska, Samuel Madden, Ju Fan
https://arxiv.org/abs/2506.16379