Saved in:
| Main Authors: | Jiang, Wenjia, Wang, Yiwei, Han, Boyan, Zhou, Joey Tianyi, Zhang, Chi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01952 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DeepEye-SQL: A Software-Engineering-Inspired Text-to-SQL Framework
by: Li, Boyan, et al.
Published: (2025)
by: Li, Boyan, et al.
Published: (2025)
A Plug-and-Play Natural Language Rewriter for Natural Language to SQL
by: Ma, Peixian, et al.
Published: (2024)
by: Ma, Peixian, et al.
Published: (2024)
Towards Next Generation Data Engineering Pipelines
by: Kramer, Kevin M., et al.
Published: (2025)
by: Kramer, Kevin M., et al.
Published: (2025)
Learning to Be A Doctor: Searching for Effective Medical Agent Architectures
by: Zhuang, Yangyang, et al.
Published: (2025)
by: Zhuang, Yangyang, et al.
Published: (2025)
DeepEye: A Steerable Self-driving Data Agent System
by: Li, Boyan, et al.
Published: (2026)
by: Li, Boyan, et al.
Published: (2026)
AegisTS: A Hierarchical Agent System with Reinforcement Learning for Multivariate Time Series Data Cleaning
by: Shi, Yuhan, et al.
Published: (2026)
by: Shi, Yuhan, et al.
Published: (2026)
ROSE: An Intent-Centered Evaluation Metric for NL2SQL
by: Pei, Wenqi, et al.
Published: (2026)
by: Pei, Wenqi, et al.
Published: (2026)
EllieSQL: Cost-Efficient Text-to-SQL with Complexity-Aware Routing
by: Zhu, Yizhang, et al.
Published: (2025)
by: Zhu, Yizhang, et al.
Published: (2025)
DPC: Training-Free Text-to-SQL Candidate Selection via Dual-Paradigm Consistency
by: Li, Boyan, et al.
Published: (2026)
by: Li, Boyan, et al.
Published: (2026)
The Dawn of Natural Language to SQL: Are We Fully Ready?
by: Li, Boyan, et al.
Published: (2024)
by: Li, Boyan, et al.
Published: (2024)
NL2SQL-BUGs: A Benchmark for Detecting Semantic Errors in NL2SQL Translation
by: Liu, Xinyu, et al.
Published: (2025)
by: Liu, Xinyu, et al.
Published: (2025)
UniDataBench: Evaluating Data Analytics Agents Across Structured and Unstructured Data
by: Weng, Han, et al.
Published: (2025)
by: Weng, Han, et al.
Published: (2025)
Exploring the Heterogeneity of Tabular Data: A Diversity-aware Data Generator via LLMs
by: Tang, Yafeng, et al.
Published: (2025)
by: Tang, Yafeng, et al.
Published: (2025)
AI-Driven Generation of Data Contracts in Modern Data Engineering Systems
by: Bhoite, Harshraj
Published: (2025)
by: Bhoite, Harshraj
Published: (2025)
Alpha-SQL: Zero-Shot Text-to-SQL using Monte Carlo Tree Search
by: Li, Boyan, et al.
Published: (2025)
by: Li, Boyan, et al.
Published: (2025)
Aixel: A Unified, Adaptive and Extensible System for AI-powered Data Analysis
by: Zhang, Meihui, et al.
Published: (2025)
by: Zhang, Meihui, et al.
Published: (2025)
Crane: An Accurate and Scalable Neural Sketch for Graph Stream Summarization
by: Wang, Boyan, et al.
Published: (2026)
by: Wang, Boyan, et al.
Published: (2026)
Honesty-Aware Multi-Agent Framework for High-Fidelity Synthetic Data Generation in Digital Psychiatric Intake Doctor-Patient Interactions
by: Zhang, Xinyuan, et al.
Published: (2026)
by: Zhang, Xinyuan, et al.
Published: (2026)
Towards Automated Cross-domain Exploratory Data Analysis through Large Language Models
by: Zhu, Jun-Peng, et al.
Published: (2024)
by: Zhu, Jun-Peng, et al.
Published: (2024)
DSL-R1: From SQL to DSL for Training Retrieval Agents across Structured and Unstructured Data with Reinforcement Learning
by: Hu, Yunhai, et al.
Published: (2026)
by: Hu, Yunhai, et al.
Published: (2026)
PV-SQL: Synergizing Database Probing and Rule-based Verification for Text-to-SQL Agents
by: Tian, Yuan, et al.
Published: (2026)
by: Tian, Yuan, et al.
Published: (2026)
Efficient Data Ingestion in Cloud-based architecture: a Data Engineering Design Pattern Proposal
by: Rucco, Chiara, et al.
Published: (2025)
by: Rucco, Chiara, et al.
Published: (2025)
Unveiling Challenges for LLMs in Enterprise Data Engineering
by: Bodensohn, Jan-Micha, et al.
Published: (2025)
by: Bodensohn, Jan-Micha, et al.
Published: (2025)
Replacing Multi-Step Assembly of Data Preparation Pipelines with One-Step LLM Pipeline Generation for Table QA
by: Li, Fengyu, et al.
Published: (2026)
by: Li, Fengyu, et al.
Published: (2026)
Towards Autonomous Graph Data Analytics with Analytics-Augmented Generation
by: Wang, Qiange, et al.
Published: (2026)
by: Wang, Qiange, et al.
Published: (2026)
A Survey of Text-to-SQL in the Era of LLMs: Where are we, and where are we going?
by: Liu, Xinyu, et al.
Published: (2024)
by: Liu, Xinyu, et al.
Published: (2024)
AppAgentX: Evolving GUI Agents as Proficient Smartphone Users
by: Jiang, Wenjia, et al.
Published: (2025)
by: Jiang, Wenjia, et al.
Published: (2025)
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
by: Zhu, Yizhang, et al.
Published: (2025)
by: Zhu, Yizhang, et al.
Published: (2025)
MQRLD: A Multimodal Data Retrieval Platform with Query-aware Feature Representation and Learned Index Based on Data Lake
by: Sheng, Ming, et al.
Published: (2024)
by: Sheng, Ming, et al.
Published: (2024)
ByteCard: Enhancing ByteDance's Data Warehouse with Learned Cardinality Estimation
by: Han, Yuxing, et al.
Published: (2024)
by: Han, Yuxing, et al.
Published: (2024)
Exploring Distance Query Processing in Edge Computing Environments
by: Zhang, Xiubo, et al.
Published: (2024)
by: Zhang, Xiubo, et al.
Published: (2024)
TiInsight: A SQL-based Automated Exploratory Data Analysis System through Large Language Models
by: Zhu, Jun-Peng, et al.
Published: (2026)
by: Zhu, Jun-Peng, et al.
Published: (2026)
Toward a Cognitive Data Model: Exploring a Mind-Inspired Approach to Database Design
by: Pieris, Dhammika
Published: (2025)
by: Pieris, Dhammika
Published: (2025)
First Tree-like Quantum Data Structure: Quantum B+ Tree
by: Liu, Hao, et al.
Published: (2024)
by: Liu, Hao, et al.
Published: (2024)
DeepMapping: Learned Data Mapping for Lossless Compression and Efficient Lookup
by: Zhou, Lixi, et al.
Published: (2023)
by: Zhou, Lixi, et al.
Published: (2023)
OneDB: A Distributed Multi-Metric Data Similarity Search System
by: Qian, Tang, et al.
Published: (2025)
by: Qian, Tang, et al.
Published: (2025)
Algorithmic Complexity Attacks on All Learned Cardinality Estimators: A Data-centric Approach
by: Li, Yingze, et al.
Published: (2025)
by: Li, Yingze, et al.
Published: (2025)
A Database Engineered System for Big Data Analytics on Tornado Climatology
by: Bian, Fengfan, et al.
Published: (2024)
by: Bian, Fengfan, et al.
Published: (2024)
Hard to Read, Easy to Jailbreak: How Visual Degradation Bypasses MLLM Safety Alignment
by: Song, Zhixue, et al.
Published: (2026)
by: Song, Zhixue, et al.
Published: (2026)
BMTree: Designing, Learning, and Updating Piecewise Space-Filling Curves for Multi-Dimensional Data Indexing
by: Li, Jiangneng, et al.
Published: (2025)
by: Li, Jiangneng, et al.
Published: (2025)
Similar Items
-
DeepEye-SQL: A Software-Engineering-Inspired Text-to-SQL Framework
by: Li, Boyan, et al.
Published: (2025) -
A Plug-and-Play Natural Language Rewriter for Natural Language to SQL
by: Ma, Peixian, et al.
Published: (2024) -
Towards Next Generation Data Engineering Pipelines
by: Kramer, Kevin M., et al.
Published: (2025) -
Learning to Be A Doctor: Searching for Effective Medical Agent Architectures
by: Zhuang, Yangyang, et al.
Published: (2025) -
DeepEye: A Steerable Self-driving Data Agent System
by: Li, Boyan, et al.
Published: (2026)