Saved in:
| Main Authors: | Cappuzzo, Riccardo, Coelho, Aimee, Lefebvre, Felix, Papotti, Paolo, Varoquaux, Gael |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.06282 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning
by: Lefebvre, Félix, et al.
Published: (2025)
by: Lefebvre, Félix, et al.
Published: (2025)
Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL
by: Papicchio, Simone, et al.
Published: (2025)
by: Papicchio, Simone, et al.
Published: (2025)
Table Foundation Models: on knowledge pre-training for tabular learning
by: Kim, Myung Jun, et al.
Published: (2025)
by: Kim, Myung Jun, et al.
Published: (2025)
The Stretto Execution Engine for LLM-Augmented Data Systems
by: Sanmartino, Gabriele, et al.
Published: (2026)
by: Sanmartino, Gabriele, et al.
Published: (2026)
Leveraging Approximate Caching for Faster Retrieval-Augmented Generation
by: Bergman, Shai, et al.
Published: (2025)
by: Bergman, Shai, et al.
Published: (2025)
Graph-Based Feature Augmentation for Predictive Tasks on Relational Datasets
by: Qiao, Lianpeng, et al.
Published: (2025)
by: Qiao, Lianpeng, et al.
Published: (2025)
FeatNavigator: Automatic Feature Augmentation on Tabular Data
by: Liang, Jiaming, et al.
Published: (2024)
by: Liang, Jiaming, et al.
Published: (2024)
Pruning Minimal Reasoning Graphs for Efficient Retrieval-Augmented Generation
by: Wang, Ning, et al.
Published: (2026)
by: Wang, Ning, et al.
Published: (2026)
Synthesize, Retrieve, and Propagate: A Unified Predictive Modeling Framework for Relational Databases
by: Li, Ning, et al.
Published: (2025)
by: Li, Ning, et al.
Published: (2025)
GRAM: Generative Retrieval Augmented Matching of Data Schemas in the Context of Data Security
by: Liu, Xuanqing, et al.
Published: (2024)
by: Liu, Xuanqing, et al.
Published: (2024)
FeatAug: Automatic Feature Augmentation From One-to-Many Relationship Tables
by: Qi, Danrui, et al.
Published: (2024)
by: Qi, Danrui, et al.
Published: (2024)
MedPix 2.0: A Comprehensive Multimodal Biomedical Data set for Advanced AI Applications with Retrieval Augmented Generation and Knowledge Graphs
by: Siragusa, Irene, et al.
Published: (2024)
by: Siragusa, Irene, et al.
Published: (2024)
RelationalFactQA: A Benchmark for Evaluating Tabular Fact Retrieval from Large Language Models
by: Satriani, Dario, et al.
Published: (2025)
by: Satriani, Dario, et al.
Published: (2025)
Imputation for prediction: beware of diminishing returns
by: Morvan, Marine Le, et al.
Published: (2024)
by: Morvan, Marine Le, et al.
Published: (2024)
Hippasus: Effective and Efficient Automatic Feature Augmentation for Machine Learning Tasks on Relational Data
by: Papadias, Serafeim, et al.
Published: (2026)
by: Papadias, Serafeim, et al.
Published: (2026)
Predictive Query-based Pipeline for Graph Data
by: Neto, Plácido A Souza
Published: (2024)
by: Neto, Plácido A Souza
Published: (2024)
Table Integration in Data Lakes Unleashed: Pairwise Integrability Judgment, Integrable Set Discovery, and Multi-Tuple Conflict Resolution
by: Ji, Daomin, et al.
Published: (2024)
by: Ji, Daomin, et al.
Published: (2024)
Combining the Strengths of Dutch Survey and Register Data in a Data Challenge to Predict Fertility (PreFer)
by: Sivak, Elizaveta, et al.
Published: (2024)
by: Sivak, Elizaveta, et al.
Published: (2024)
Efficient and Reproducible Biomedical Question Answering using Retrieval Augmented Generation
by: Stuhlmann, Linus, et al.
Published: (2025)
by: Stuhlmann, Linus, et al.
Published: (2025)
TabSketchFM: Sketch-based Tabular Representation Learning for Data Discovery over Data Lakes
by: Khatiwada, Aamod, et al.
Published: (2024)
by: Khatiwada, Aamod, et al.
Published: (2024)
SQL-GEN: Bridging the Dialect Gap for Text-to-SQL Via Synthetic Data And Model Merging
by: Pourreza, Mohammadreza, et al.
Published: (2024)
by: Pourreza, Mohammadreza, et al.
Published: (2024)
Towards Pattern-aware Data Augmentation for Temporal Knowledge Graph Completion
by: Zhang, Jiasheng, et al.
Published: (2024)
by: Zhang, Jiasheng, et al.
Published: (2024)
Relational Database Distillation: From Structured Tables to Condensed Graph Data
by: Gao, Xinyi, et al.
Published: (2025)
by: Gao, Xinyi, et al.
Published: (2025)
A Unified Replay-based Continuous Learning Framework for Spatio-Temporal Prediction on Streaming Data
by: Miao, Hao, et al.
Published: (2024)
by: Miao, Hao, et al.
Published: (2024)
MechDetect: Detecting Data-Dependent Errors
by: Jung, Philipp, et al.
Published: (2025)
by: Jung, Philipp, et al.
Published: (2025)
Tabular Data Augmentation for Machine Learning: Progress and Prospects of Embracing Generative AI
by: Cui, Lingxi, et al.
Published: (2024)
by: Cui, Lingxi, et al.
Published: (2024)
DobLIX: A Dual-Objective Learned Index for Log-Structured Merge Trees
by: Heidari, Alireza, et al.
Published: (2025)
by: Heidari, Alireza, et al.
Published: (2025)
CARTE: Pretraining and Transfer for Tabular Learning
by: Kim, Myung Jun, et al.
Published: (2024)
by: Kim, Myung Jun, et al.
Published: (2024)
Pre-Execution Query Slot-Time Prediction in Cloud Data Warehouses: A Feature-Scoped Machine Learning Approach
by: Pathak, Prashant Kumar
Published: (2026)
by: Pathak, Prashant Kumar
Published: (2026)
Chameleon: Foundation Models for Fairness-aware Multi-modal Data Augmentation to Enhance Coverage of Minorities
by: Erfanian, Mahdi, et al.
Published: (2024)
by: Erfanian, Mahdi, et al.
Published: (2024)
TabICL: A Tabular Foundation Model for In-Context Learning on Large Data
by: Qu, Jingang, et al.
Published: (2025)
by: Qu, Jingang, et al.
Published: (2025)
HedraRAG: Coordinating LLM Generation and Database Retrieval in Heterogeneous RAG Serving
by: Hu, Zhengding, et al.
Published: (2025)
by: Hu, Zhengding, et al.
Published: (2025)
Generating the Traces You Need: A Conditional Generative Model for Process Mining Data
by: Graziosi, Riccardo, et al.
Published: (2024)
by: Graziosi, Riccardo, et al.
Published: (2024)
Database Entity Recognition with Data Augmentation and Deep Learning
by: Fu, Zikun, et al.
Published: (2025)
by: Fu, Zikun, et al.
Published: (2025)
Something's Fishy In The Data Lake: A Critical Re-evaluation of Table Union Search Benchmarks
by: Boutaleb, Allaa, et al.
Published: (2025)
by: Boutaleb, Allaa, et al.
Published: (2025)
ReCellTy: Domain-Specific Knowledge Graph Retrieval-Augmented LLMs Reasoning Workflow for Single-Cell Annotation
by: Han, Dezheng, et al.
Published: (2025)
by: Han, Dezheng, et al.
Published: (2025)
Data Augmentation in Graph Neural Networks: The Role of Generated Synthetic Graphs
by: Bas, Sumeyye, et al.
Published: (2024)
by: Bas, Sumeyye, et al.
Published: (2024)
Data Driven Decision Making with Time Series and Spatio-temporal Data
by: Yang, Bin, et al.
Published: (2025)
by: Yang, Bin, et al.
Published: (2025)
In-Database Data Imputation
by: Perini, Massimo, et al.
Published: (2024)
by: Perini, Massimo, et al.
Published: (2024)
Aegis: A Correlation-Based Data Masking Advisor for Data Sharing Ecosystems
by: Laskar, Omar Islam, et al.
Published: (2025)
by: Laskar, Omar Islam, et al.
Published: (2025)
Similar Items
-
Scalable Feature Learning on Huge Knowledge Graphs for Downstream Machine Learning
by: Lefebvre, Félix, et al.
Published: (2025) -
Think2SQL: Reinforce LLM Reasoning Capabilities for Text2SQL
by: Papicchio, Simone, et al.
Published: (2025) -
Table Foundation Models: on knowledge pre-training for tabular learning
by: Kim, Myung Jun, et al.
Published: (2025) -
The Stretto Execution Engine for LLM-Augmented Data Systems
by: Sanmartino, Gabriele, et al.
Published: (2026) -
Leveraging Approximate Caching for Faster Retrieval-Augmented Generation
by: Bergman, Shai, et al.
Published: (2025)