Saved in:
| Main Authors: | Zhang, Wenxiao, Liu, Yu, sun, Qiang, Ding, Yihao, Li, Sirui, Liu, Yanbing, Hong, Jin B., Liu, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.08597 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
by: Zhang, Hengrui, et al.
Published: (2025)
by: Zhang, Hengrui, et al.
Published: (2025)
Invoice Information Extraction: Methods and Performance Evaluation
by: Yashwant, Sai, et al.
Published: (2025)
by: Yashwant, Sai, et al.
Published: (2025)
Schema Lineage Extraction at Scale: Multilingual Pipelines, Composite Evaluation, and Language-Model Benchmarks
by: Yin, Jiaqi, et al.
Published: (2025)
by: Yin, Jiaqi, et al.
Published: (2025)
Towards Controllable Time Series Generation
by: Bao, Yifan, et al.
Published: (2024)
by: Bao, Yifan, et al.
Published: (2024)
Can Language Models Enable In-Context Database?
by: Pan, Yu, et al.
Published: (2024)
by: Pan, Yu, et al.
Published: (2024)
Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks
by: Chen, Junlan, et al.
Published: (2025)
by: Chen, Junlan, et al.
Published: (2025)
Is Long Context All You Need? Leveraging LLM's Extended Context for NL2SQL
by: Chung, Yeounoh, et al.
Published: (2025)
by: Chung, Yeounoh, et al.
Published: (2025)
Docs2Synth: A Synthetic Data Trained Retriever Framework for Scanned Visually Rich Documents Understanding
by: Ding, Yihao, et al.
Published: (2026)
by: Ding, Yihao, et al.
Published: (2026)
Transforming Football Data into Object-centric Event Logs with Spatial Context Information
by: Chan, Vito, et al.
Published: (2025)
by: Chan, Vito, et al.
Published: (2025)
LEDD: Large Language Model-Empowered Data Discovery in Data Lakes
by: An, Qi, et al.
Published: (2025)
by: An, Qi, et al.
Published: (2025)
OpenOmni: A Collaborative Open Source Tool for Building Future-Ready Multimodal Conversational Agents
by: Sun, Qiang, et al.
Published: (2024)
by: Sun, Qiang, et al.
Published: (2024)
LLMIA: An Out-of-the-Box Index Advisor via In-Context Learning with LLMs
by: Zhao, Xinxin, et al.
Published: (2025)
by: Zhao, Xinxin, et al.
Published: (2025)
Enhancing Knowledge Graph Completion with Entity Neighborhood and Relation Context
by: Chen, Jianfang, et al.
Published: (2025)
by: Chen, Jianfang, et al.
Published: (2025)
TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes
by: Zhang, Chao, et al.
Published: (2025)
by: Zhang, Chao, et al.
Published: (2025)
A2RAG: Adaptive Agentic Graph Retrieval for Cost-Aware and Reliable Reasoning
by: Liu, Jiate, et al.
Published: (2026)
by: Liu, Jiate, et al.
Published: (2026)
ARCADE: A Real-Time Data System for Hybrid and Continuous Query Processing across Diverse Data Modalities
by: Yang, Jingyi, et al.
Published: (2025)
by: Yang, Jingyi, et al.
Published: (2025)
PRISMA: Reinforcement Learning Guided Two-Stage Policy Optimization in Multi-Agent Architecture for Open-Domain Multi-Hop Question Answering
by: Liu, Yu, et al.
Published: (2026)
by: Liu, Yu, et al.
Published: (2026)
Beyond Relational: Semantic-Aware Multi-Modal Analytics with LLM-Native Query Optimization
by: Zhu, Junhao, et al.
Published: (2025)
by: Zhu, Junhao, et al.
Published: (2025)
Category-Aware Semantic Caching for Heterogeneous LLM Workloads
by: Wang, Chen, et al.
Published: (2025)
by: Wang, Chen, et al.
Published: (2025)
MINT: Multi-Vector Search Index Tuning
by: Zhu, Jiongli, et al.
Published: (2025)
by: Zhu, Jiongli, et al.
Published: (2025)
QDA-SQL: Questions Enhanced Dialogue Augmentation for Multi-Turn Text-to-SQL
by: Sun, Yinggang, et al.
Published: (2024)
by: Sun, Yinggang, et al.
Published: (2024)
Data Quality Awareness: A Journey from Traditional Data Management to Data Science Systems
by: Dong, Sijie, et al.
Published: (2024)
by: Dong, Sijie, et al.
Published: (2024)
STGformer: Efficient Spatiotemporal Graph Transformer for Traffic Forecasting
by: Wang, Hongjun, et al.
Published: (2024)
by: Wang, Hongjun, et al.
Published: (2024)
From Lossy to Verified: A Provenance-Aware Tiered Memory for Agents
by: Zhu, Qiming, et al.
Published: (2026)
by: Zhu, Qiming, et al.
Published: (2026)
PSM-SQL: Progressive Schema Learning with Multi-granularity Semantics for Text-to-SQL
by: Yang, Zhuopan, et al.
Published: (2025)
by: Yang, Zhuopan, et al.
Published: (2025)
CONCERTO: Complex Query Execution Mechanism-Aware Learned Cost Estimation
by: Zhang, Kaixin, et al.
Published: (2024)
by: Zhang, Kaixin, et al.
Published: (2024)
OneKE: A Dockerized Schema-Guided LLM Agent-based Knowledge Extraction System
by: Luo, Yujie, et al.
Published: (2024)
by: Luo, Yujie, et al.
Published: (2024)
CSI-Bench: A Large-Scale In-the-Wild Dataset for Multi-task WiFi Sensing
by: Zhu, Guozhen, et al.
Published: (2025)
by: Zhu, Guozhen, et al.
Published: (2025)
OCPM$^2$: Extending the Process Mining Methodology for Object-Centric Event Data Extraction
by: Miri, Najmeh, et al.
Published: (2025)
by: Miri, Najmeh, et al.
Published: (2025)
Chain-of-Query: Unleashing the Power of LLMs in SQL-Aided Table Understanding via Multi-Agent Collaboration
by: Sui, Songyuan, et al.
Published: (2025)
by: Sui, Songyuan, et al.
Published: (2025)
In-Context Adaptation to Concept Drift for Learned Database Operations
by: Zhu, Jiaqi, et al.
Published: (2025)
by: Zhu, Jiaqi, et al.
Published: (2025)
A Multi-Agent System for Semantic Mapping of Relational Data to Knowledge Graphs
by: Trajanoska, Milena, et al.
Published: (2025)
by: Trajanoska, Milena, et al.
Published: (2025)
Cardinality Estimation for High Dimensional Similarity Queries with Adaptive Bucket Probing
by: Chen, Zhonghan, et al.
Published: (2026)
by: Chen, Zhonghan, et al.
Published: (2026)
Schema-Aware Multi-Task Learning for Complex Text-to-SQL
by: Wu, Yangjun, et al.
Published: (2024)
by: Wu, Yangjun, et al.
Published: (2024)
Position: Foundation Models for Tabular Data within Systemic Contexts Need Grounding
by: Klein, Tassilo, et al.
Published: (2025)
by: Klein, Tassilo, et al.
Published: (2025)
Thucy: An LLM-based Multi-Agent System for Claim Verification across Relational Databases
by: Theologitis, Michael, et al.
Published: (2025)
by: Theologitis, Michael, et al.
Published: (2025)
Conflict Detection for Temporal Knowledge Graphs:A Fast Constraint Mining Algorithm and New Benchmarks
by: Chen, Jianhao, et al.
Published: (2023)
by: Chen, Jianhao, et al.
Published: (2023)
CTBench: Cryptocurrency Time Series Generation Benchmark
by: Ang, Yihao, et al.
Published: (2025)
by: Ang, Yihao, et al.
Published: (2025)
Xling: A Learned Filter Framework for Accelerating High-Dimensional Approximate Similarity Join
by: Wang, Yifan, et al.
Published: (2024)
by: Wang, Yifan, et al.
Published: (2024)
Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First
by: Liu, Shu, et al.
Published: (2025)
by: Liu, Shu, et al.
Published: (2025)
Similar Items
-
ScaleDoc: Scaling LLM-based Predicates over Large Document Collections
by: Zhang, Hengrui, et al.
Published: (2025) -
Invoice Information Extraction: Methods and Performance Evaluation
by: Yashwant, Sai, et al.
Published: (2025) -
Schema Lineage Extraction at Scale: Multilingual Pipelines, Composite Evaluation, and Language-Model Benchmarks
by: Yin, Jiaqi, et al.
Published: (2025) -
Towards Controllable Time Series Generation
by: Bao, Yifan, et al.
Published: (2024) -
Can Language Models Enable In-Context Database?
by: Pan, Yu, et al.
Published: (2024)