
2025-07-22 07:41:00
Schemora: schema matching via multi-stage recommendation and metadata enrichment using off-the-shelf llms
Osman Erman Gungor, Derak Paulsen, William Kang
https://arxiv.org/abs/2507.14376
Schemora: schema matching via multi-stage recommendation and metadata enrichment using off-the-shelf llms
Osman Erman Gungor, Derak Paulsen, William Kang
https://arxiv.org/abs/2507.14376
TASER: Table Agents for Schema-guided Extraction and Recommendation
Nicole Cho, Kirsty Fielding, William Watson, Sumitra Ganesh, Manuela Veloso
https://arxiv.org/abs/2508.13404 …
Conceptual and Design Principles for a Self-Referential Algorithm Mimicking Neuronal Assembly Functions
Paolo Totaro, Alberto Mangiante
https://arxiv.org/abs/2507.14011
On Website Technicals (2021-11) - Tech updates: more meters, reutils tweets upgrade, DB-based Event and Product schema.org. - https://www.earth.org.uk/note-on-site-technicals-54.html
CRED-SQL: Enhancing Real-world Large Scale Database Text-to-SQL Parsing through Cluster Retrieval and Execution Description
Shaoming Duan, Zirui Wang, Chuanyi Liu, Zhibin Zhu, Yuhao Zhang, Peiyi Han, Liang Yan, Zewu Penge
https://arxiv.org/abs/2508.12769
LLMATCH: A Unified Schema Matching Framework with Large Language Models
Sha Wang, Yuchen Li, Hanhua Xiao, Bing Tian Dai, Roy Ka-Wei Lee, Yanfei Dong, Lambert Deng
https://arxiv.org/abs/2507.10897
A Comparative Study of Delta Parquet, Iceberg, and Hudi for Automotive Data Engineering Use Cases
Dinesh Eswararaj, Ajay Babu Nellipudi, Vandana Kollati
https://arxiv.org/abs/2508.13396
Maya-Tupi graphs: a generalization of split graphs
J\'ulio Ara\'ujo, C\'esar Hern\'andez-Cruz, Cl\'audia Linhares
https://arxiv.org/abs/2508.13424 https://…
Evaluating ASR robustness to spontaneous speech errors: A study of WhisperX using a Speech Error Database
John Alderete, Macarious Kin Fung Hui, Aanchan Mohan
https://arxiv.org/abs/2508.13060
Human-AI Schema Discovery and Application for Creative Problem Solving
Sitong Wang
https://arxiv.org/abs/2508.05045 https://arxiv.org/pdf/2508.05045…
oh, i almost forgot:
- #PHP is great
- #JSON schema validation is great: https://github.com/opis/json-schema
A Schema.org Mapping for Brazilian Legal Norms: Toward Interoperable Legal Graphs and Open Government Data
Hudson de Martim
https://arxiv.org/abs/2508.00827 https://
PIMBS: Efficient Body Schema Learning for Musculoskeletal Humanoids with Physics-Informed Neural Networks
Kento Kawaharazuka, Takahiro Hattori, Keita Yoneda, Kei Okada
https://arxiv.org/abs/2506.20343
AI-assisted JSON Schema Creation and Mapping
Felix Neubauer, J\"urgen Pleiss, Benjamin Uekermann
https://arxiv.org/abs/2508.05192 https://arxiv.org/pd…
XML Prompting as Grammar-Constrained Interaction: Fixed-Point Semantics, Convergence Guarantees, and Human-AI Protocols
Faruk Alpay, Taylan Alpay
https://arxiv.org/abs/2509.08182
Disentangling the schema turn: Restoring the information base to conceptual modelling
Chris Partridge, Andrew Mitchell, Sergio de Cesare, Oscar Xiberta Soto
https://arxiv.org/abs/2509.01617
FIRESPARQL: A LLM-based Framework for SPARQL Query Generation over Scholarly Knowledge Graphs
Xueli Pan, Victor de Boer, Jacco van Ossenbruggen
https://arxiv.org/abs/2508.10467 …
An Agentic Toolkit for Adaptive Information Extraction from Regulatory Documents
Gaye Colakoglu, G\"urkan Solmaz, Jonathan F\"urst
https://arxiv.org/abs/2509.11773 htt…
Schema-Guided Response Generation using Multi-Frame Dialogue State for Motivational Interviewing Systems
Jie Zeng, Yukiko I. Nakano
https://arxiv.org/abs/2508.20635 https://
Cross-Dataset Semantic Segmentation Performance Analysis: Unifying NIST Point Cloud City Datasets for 3D Deep Learning
Alexander Nikitas Dimopoulos, Joseph Grasso
https://arxiv.org/abs/2508.00822
Automated Creation and Enrichment Framework for Improved Invocation of Enterprise APIs as Tools
Prerna Agarwal, Himanshu Gupta, Soujanya Soni, Rohith Vallam, Renuka Sindhgatta, Sameep Mehta
https://arxiv.org/abs/2509.11626
A Unified Ontology for Scalable Knowledge Graph-Driven Operational Data Analytics in High-Performance Computing Systems
Junaid Ahmed Khan, Andrea Bartolini
https://arxiv.org/abs/2507.06107
BEAVR: Bimanual, multi-Embodiment, Accessible, Virtual Reality Teleoperation System for Robots
Alejandro Posadas-Nava, Alejandro Carrasco, Richard Linares
https://arxiv.org/abs/2508.09606
Template-Based Schema Matching of Multi-Layout Tenancy Schedules:A Comparative Study of a Template-Based Hybrid Matcher and the ALITE Full Disjunction Model
Tim Uilkema, Yao Ma, Seyed Sahand Mohammadi Ziabari, Joep van Vliet
https://arxiv.org/abs/2507.02020
Schema sorted finally!
#11ty
Went for a modular set of includes in the end.
from my link log —
Row polymorphic programming.
https://www.stranger.systems/posts/by-slug/row-polymorphic-programming.html
saved 2025-07-14
The Open mulTiwavelength Transient Event Repository (OTTER): Infrastructure Release and Tidal Disruption Event Catalog
Noah Franz, Kate D Alexander, Sebastian Gomez, Collin T Christy, Tanmoy Laskar, Sjoert van Velzen, Nicholas Earl, Suvi Gezari, Mitchell Karmen, Raffaella Margutti, Jeniveve Pearson, V. Ashley Villar, Ann I Zabludoff
https://
In PostgreSQL, managing schema changes without experiencing downtime can be quite challenging. At this year’s Berlin Buzzwords, Gülçin Yıldırım Jelínek explored locking mechanisms in PostgreSQL, specifically focusing on table-level locks acquired through Data Definition Language (DDL) operations, and discussed various tools and techniques that can help minimise the impact of locking.
Watch the full session:
Evaluating NL2SQL via SQL2NL
Mohammadtaher Safarzadeh, Afshin Oroojlooyjadid, Dan Roth
https://arxiv.org/abs/2509.04657 https://arxiv.org/pdf/2509.04657
🤖 Automate database management tasks - let AI handle queries, table creation & indexing
📊 Generate context-aware application code & tests with real-time database schema understanding
🔄 Dynamic reloading support - update tools without redeploying your application
SecureFed: A Two-Phase Framework for Detecting Malicious Clients in Federated Learning
Likhitha Annapurna Kavuri, Akshay Mhatre, Akarsh K Nair, Deepti Gupta
https://arxiv.org/abs/2506.16458
BDIViz: An Interactive Visualization System for Biomedical Schema Matching with LLM-Powered Validation
Eden Wu, Dishita G Turakhia, Guande Wu, Christos Koutras, Sarah Keegan, Wenke Liu, Beata Szeitz, David Fenyo, Cl\'audio T. Silva, Juliana Freire
https://arxiv.org/abs/2507.16117
Multimodal Information Retrieval for Open World with Edit Distance Weak Supervision
KMA Solaiman, Bharat Bhargava
https://arxiv.org/abs/2506.20070 https://…
The KG-ER Conceptual Schema Language
Enrico Franconi, Beno\^it Groz, Jan Hidders, Nina Pardal, S{\l}awek Staworko, Jan Van den Bussche, Piotr Wieczorek
https://arxiv.org/abs/2508.02548
Setting The Table with Intent: Intent-aware Schema Generation and Editing for Literature Review Tables
Vishakh Padmakumar, Joseph Chee Chang, Kyle Lo, Doug Downey, Aakanksha Naik
https://arxiv.org/abs/2507.19521
From Provable Correctness to Probabilistic Generation: A Comparative Review of Program Synthesis Paradigms
Zurabi Kobaladze, Anna Arnania, Tamar Sanikidze
https://arxiv.org/abs/2508.00013
Fuzzy, Symbolic, and Contextual: Enhancing LLM Instruction via Cognitive Scaffolding
Vanessa Figueiredo
https://arxiv.org/abs/2508.21204 https://arxiv.org/…
MR-UIE: Multi-Perspective Reasoning with Reinforcement Learning for Universal Information Extraction
Zhongqiu Li, Shiquan Wang, Ruiyu Fang, Mengjiao Bao, Zhenhe Wu, Shuangyong Song, Yongxiang Li, Zhongjiang He
https://arxiv.org/abs/2509.09082
Improving Knowledge Graph Understanding with Contextual Views
Antrea Christou, Cogan Shimizu
https://arxiv.org/abs/2508.02413 https://arxiv.org/pdf/2508.02…
Illuminating Patterns of Divergence: DataDios SmartDiff for Large-Scale Data Difference Analysis
Aryan Poduri, Yashwant Tailor
https://arxiv.org/abs/2509.00293 https://
GLiNER2: An Efficient Multi-Task Information Extraction System with Schema-Driven Interface
Urchade Zaratiana, Gil Pasternak, Oliver Boyd, George Hurn-Maloney, Ash Lewis
https://arxiv.org/abs/2507.18546
Cost for research -- how cost data of research can be included in open metadata to be reused and evaluated
Julia Bartlewski, Christoph Broschinski, Gernot Deinzer, Cornelia Lang, Dirk Pieper, Bianca Schweighofer, Colin Sippl, Lisa-Marie Stein, Alexander Wagner, Silke Weisheit
https://arxiv.org/abs/2506.18517
SQL-Exchange: Transforming SQL Queries Across Domains
Mohammadreza Daviran, Brian Lin, Davood Rafiei
https://arxiv.org/abs/2508.07087 https://arxiv.org/pdf…
XiYan-SQL: A Novel Multi-Generator Framework For Text-to-SQL
Yifu Liu, Yin Zhu, Yingqi Gao, Zhiling Luo, Xiaoxia Li, Xiaorong Shi, Yuntao Hong, Jinyang Gao, Yu Li, Bolin Ding, Jingren Zhou
https://arxiv.org/abs/2507.04701
🔧 Strict-Mode Function Calling
Beta API now supports strict-mode for function calling, ensuring the output always complies with the defined JSON schema.
🔗 Anthropic API Compatibility
The API is now compatible with Anthropic API, allowing seamless integration with Claude Code.
💰 Pricing Adjustment (Effective Sept. 5, 2025, 16:00 UTC)
Input tokens: $ 0.07/million (cache hit) / $ 0.56/million (cache miss)
Output tokens: $ 1.68/million
The off-peak discount e…
KERAG: Knowledge-Enhanced Retrieval-Augmented Generation for Advanced Question Answering
Yushi Sun, Kai Sun, Yifan Ethan Xu, Xiao Yang, Xin Luna Dong, Nan Tang, Lei Chen
https://arxiv.org/abs/2509.04716
Making REST APIs Agent-Ready: From OpenAPI to Model Context Protocol Servers for Tool-Augmented LLMs
Meriem Mastouri, Emna Ksontini, Wael Kessentini
https://arxiv.org/abs/2507.16044
Lightweight Transformers for Zero-Shot and Fine-Tuned Text-to-SQL Generation Using Spider
Chirag Seth, Utkarsh Singh
https://arxiv.org/abs/2508.04623 https://
HARPT: A Corpus for Analyzing Consumers' Trust and Privacy Concerns in Mobile Health Apps
Timoteo Kelly, Abdulkadir Korkmaz, Samuel Mallet, Connor Souders, Sadra Aliakbarpour, Praveen Rao
https://arxiv.org/abs/2506.19268
Data-Aware Socratic Query Refinement in Database Systems
Ruiyuan Zhang, Chrysanthi Kosyfaki, Xiaofang Zhou
https://arxiv.org/abs/2508.05061 https://arxiv.o…
Replaced article(s) found for cs.DB. https://arxiv.org/list/cs.DB/new
[1/1]:
- Enhancing Text2Cypher with Schema Filtering
Makbule Gulcin Ozsoy
https://
Crosslisted article(s) found for cs.DB. https://arxiv.org/list/cs.DB/new
[1/1]:
- Robust Detection of Synthetic Tabular Data under Schema Variability
G. Charbel N. Kindji (MALT), Elisa Fromont (MALT), Lina Maria Rojas-Barahona, Tanguy Urvoy
SQL-of-Thought: Multi-agentic Text-to-SQL with Guided Error Correction
Saumya Chaturvedi, Aman Chadha, Laurent Bindschaedler
https://arxiv.org/abs/2509.00581 https://
Evaluating Structured Decoding for Text-to-Table Generation: Evidence from Three Datasets
Julian Oestreich, Lydia M\"uller
https://arxiv.org/abs/2508.15910 https://
AI-Driven Generation of Data Contracts in Modern Data Engineering Systems
Harshraj Bhoite
https://arxiv.org/abs/2507.21056 https://arxiv.org/pdf/2507.21056…