Saved in:
| Main Authors: | Wang, Mengying, Ma, Hanchao, Bian, Yiyang, Fan, Yangxin, Wu, Yinghui |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.11262 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Graph Query Generation with Constraint-guided Large Language Agents
by: Wang, Mengying, et al.
Published: (2026)
by: Wang, Mengying, et al.
Published: (2026)
ML-Asset Management: Curation, Discovery, and Utilization
by: Wang, Mengying, et al.
Published: (2025)
by: Wang, Mengying, et al.
Published: (2025)
Interpreting Graph Inference with Skyline Explanations
by: Qiu, Dazhuo, et al.
Published: (2025)
by: Qiu, Dazhuo, et al.
Published: (2025)
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
by: Zhang, Shaolei, et al.
Published: (2025)
by: Zhang, Shaolei, et al.
Published: (2025)
FlexiDataGen: An Adaptive LLM Framework for Dynamic Semantic Dataset Generation in Sensitive Domains
by: Jelodar, Hamed, et al.
Published: (2025)
by: Jelodar, Hamed, et al.
Published: (2025)
Conceptual Schema Inference for Tabular Datasets using Large Language Models
by: Wu, Zhenyu, et al.
Published: (2025)
by: Wu, Zhenyu, et al.
Published: (2025)
Data Quality Awareness: A Journey from Traditional Data Management to Data Science Systems
by: Dong, Sijie, et al.
Published: (2024)
by: Dong, Sijie, et al.
Published: (2024)
Enhancing Crash Frequency Modeling Based on Augmented Multi-Type Data by Hybrid VAE-Diffusion-Based Generative Neural Networks
by: Chen, Junlan, et al.
Published: (2025)
by: Chen, Junlan, et al.
Published: (2025)
Towards Automated Data Sciences with Natural Language and SageCopilot: Practices and Lessons Learned
by: Liao, Yuan, et al.
Published: (2024)
by: Liao, Yuan, et al.
Published: (2024)
Generating Robust Counterfactual Witnesses for Graph Neural Networks
by: Qiu, Dazhuo, et al.
Published: (2024)
by: Qiu, Dazhuo, et al.
Published: (2024)
Compliance Rating Scheme: A Data Provenance Framework for Generative AI Datasets
by: Bohacek, Matyas, et al.
Published: (2025)
by: Bohacek, Matyas, et al.
Published: (2025)
NFDI4DSO: Towards a BFO Compliant Ontology for Data Science
by: Gesese, Genet Asefa, et al.
Published: (2024)
by: Gesese, Genet Asefa, et al.
Published: (2024)
Data Science: a Natural Ecosystem
by: Porcu, Emilio, et al.
Published: (2025)
by: Porcu, Emilio, et al.
Published: (2025)
Efficient Dynamic Attributed Graph Generation
by: Li, Fan, et al.
Published: (2024)
by: Li, Fan, et al.
Published: (2024)
AgenticData: An Agentic Data Analytics System for Heterogeneous Data
by: Sun, Ji, et al.
Published: (2025)
by: Sun, Ji, et al.
Published: (2025)
LEDD: Large Language Model-Empowered Data Discovery in Data Lakes
by: An, Qi, et al.
Published: (2025)
by: An, Qi, et al.
Published: (2025)
The FormAI Dataset: Generative AI in Software Security Through the Lens of Formal Verification
by: Tihanyi, Norbert, et al.
Published: (2023)
by: Tihanyi, Norbert, et al.
Published: (2023)
TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes
by: Zhang, Chao, et al.
Published: (2025)
by: Zhang, Chao, et al.
Published: (2025)
LLM/Agent-as-Data-Analyst: A Survey
by: Tang, Zirui, et al.
Published: (2025)
by: Tang, Zirui, et al.
Published: (2025)
Powering In-Database Dynamic Model Slicing for Structured Data Analytics
by: Zeng, Lingze, et al.
Published: (2024)
by: Zeng, Lingze, et al.
Published: (2024)
EPIC: Generative AI Platform for Accelerating HPC Operational Data Analytics
by: Karimi, Ahmad Maroof, et al.
Published: (2025)
by: Karimi, Ahmad Maroof, et al.
Published: (2025)
AI-Driven Generation of Data Contracts in Modern Data Engineering Systems
by: Bhoite, Harshraj
Published: (2025)
by: Bhoite, Harshraj
Published: (2025)
Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query Systems for Multi-Modal Data
by: Li, Ruyu, et al.
Published: (2026)
by: Li, Ruyu, et al.
Published: (2026)
Towards Automated Cross-domain Exploratory Data Analysis through Large Language Models
by: Zhu, Jun-Peng, et al.
Published: (2024)
by: Zhu, Jun-Peng, et al.
Published: (2024)
Searching Clinical Data Using Generative AI
by: Hanswadkar, Karan, et al.
Published: (2025)
by: Hanswadkar, Karan, et al.
Published: (2025)
Metasql: A Generate-then-Rank Framework for Natural Language to SQL Translation
by: Fan, Yuankai, et al.
Published: (2024)
by: Fan, Yuankai, et al.
Published: (2024)
Exploiting Formal Concept Analysis for Data Modeling in Data Lakes
by: Bendimerad, Anes, et al.
Published: (2024)
by: Bendimerad, Anes, et al.
Published: (2024)
Evaluating LLMs for Text-to-SQL Generation With Complex SQL Workload
by: Ma, Limin, et al.
Published: (2024)
by: Ma, Limin, et al.
Published: (2024)
LaDe: The First Comprehensive Last-mile Delivery Dataset from Industry
by: Wu, Lixia, et al.
Published: (2023)
by: Wu, Lixia, et al.
Published: (2023)
Large Language Models as Data Preprocessors
by: Zhang, Haochen, et al.
Published: (2023)
by: Zhang, Haochen, et al.
Published: (2023)
A Survey of Data Agents: Emerging Paradigm or Overstated Hype?
by: Zhu, Yizhang, et al.
Published: (2025)
by: Zhu, Yizhang, et al.
Published: (2025)
Generating the Traces You Need: A Conditional Generative Model for Process Mining Data
by: Graziosi, Riccardo, et al.
Published: (2024)
by: Graziosi, Riccardo, et al.
Published: (2024)
RetClean: Retrieval-Based Data Cleaning Using Foundation Models and Data Lakes
by: Naeem, Zan Ahmad, et al.
Published: (2023)
by: Naeem, Zan Ahmad, et al.
Published: (2023)
LaPuda: LLM-Enabled Policy-Based Query Optimizer for Multi-modal Data
by: Wang, Yifan, et al.
Published: (2024)
by: Wang, Yifan, et al.
Published: (2024)
Robo-DM: Data Management For Large Robot Datasets
by: Chen, Kaiyuan, et al.
Published: (2025)
by: Chen, Kaiyuan, et al.
Published: (2025)
SING-SQL: A Synthetic Data Generation Framework for In-Domain Text-to-SQL Translation
by: Caferoğlu, Hasan Alp, et al.
Published: (2025)
by: Caferoğlu, Hasan Alp, et al.
Published: (2025)
CoddLLM: Empowering Large Language Models for Data Analytics
by: Zhang, Jiani, et al.
Published: (2025)
by: Zhang, Jiani, et al.
Published: (2025)
Quality Assessment of Tabular Data using Large Language Models and Code Generation
by: Akella, Ashlesha, et al.
Published: (2025)
by: Akella, Ashlesha, et al.
Published: (2025)
CubeGraph: Efficient Retrieval-Augmented Generation for Spatial and Temporal Data
by: Yang, Mingyu, et al.
Published: (2026)
by: Yang, Mingyu, et al.
Published: (2026)
Towards Improving Interpretability of Language Model Generation through a Structured Knowledge Discovery Approach
by: Liu, Shuqi, et al.
Published: (2025)
by: Liu, Shuqi, et al.
Published: (2025)
Similar Items
-
Graph Query Generation with Constraint-guided Large Language Agents
by: Wang, Mengying, et al.
Published: (2026) -
ML-Asset Management: Curation, Discovery, and Utilization
by: Wang, Mengying, et al.
Published: (2025) -
Interpreting Graph Inference with Skyline Explanations
by: Qiu, Dazhuo, et al.
Published: (2025) -
DeepAnalyze: Agentic Large Language Models for Autonomous Data Science
by: Zhang, Shaolei, et al.
Published: (2025) -
FlexiDataGen: An Adaptive LLM Framework for Dynamic Semantic Dataset Generation in Sensitive Domains
by: Jelodar, Hamed, et al.
Published: (2025)