Saved in:
| Main Authors: | Rucco, Chiara, Longo, Antonella, Saad, Motaz |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2503.16079 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Formalizing ETLT and ELTL Design Patterns and Proposing Enhanced Variants: A Systematic Framework for Modern Data Engineering
by: Rucco, Chiara, et al.
Published: (2025)
by: Rucco, Chiara, et al.
Published: (2025)
Designing Data Spaces: Navigating the European Initiatives Along Technical Specifications
by: Martella, Angelo, et al.
Published: (2025)
by: Martella, Angelo, et al.
Published: (2025)
MatrixGate: A High-performance Data Ingestion Tool for Time-series Databases
by: Wang, Shuhui, et al.
Published: (2024)
by: Wang, Shuhui, et al.
Published: (2024)
CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion
by: Zeng, Xianzhi, et al.
Published: (2024)
by: Zeng, Xianzhi, et al.
Published: (2024)
Automatic String Data Validation with Pattern Discovery
by: Lin, Xinwei, et al.
Published: (2024)
by: Lin, Xinwei, et al.
Published: (2024)
RADx Data Hub: A Cloud Platform for FAIR, Harmonized COVID-19 Data
by: Martinez-Romero, Marcos, et al.
Published: (2025)
by: Martinez-Romero, Marcos, et al.
Published: (2025)
Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD -- Extended Version
by: Yu, Geoffrey X., et al.
Published: (2024)
by: Yu, Geoffrey X., et al.
Published: (2024)
Intra-Query Runtime Elasticity for Cloud-Native Data Analysis
by: Zhang, Xukang, et al.
Published: (2025)
by: Zhang, Xukang, et al.
Published: (2025)
Skyrise: Exploiting Serverless Cloud Infrastructure for Elastic Data Processing
by: Bodner, Thomas, et al.
Published: (2025)
by: Bodner, Thomas, et al.
Published: (2025)
ByteHouse: ByteDance's Cloud-Native Data Warehouse for Real-Time Multimodal Data Analytics
by: Han, Yuxing, et al.
Published: (2026)
by: Han, Yuxing, et al.
Published: (2026)
SQLAgent: Learning to Explore Before Generating as a Data Engineer
by: Jiang, Wenjia, et al.
Published: (2026)
by: Jiang, Wenjia, et al.
Published: (2026)
Unveiling Challenges for LLMs in Enterprise Data Engineering
by: Bodensohn, Jan-Micha, et al.
Published: (2025)
by: Bodensohn, Jan-Micha, et al.
Published: (2025)
Towards Next Generation Data Engineering Pipelines
by: Kramer, Kevin M., et al.
Published: (2025)
by: Kramer, Kevin M., et al.
Published: (2025)
Designing a Secure, Scalable, and Cost-Effective Cloud Storage Solution: A Novel Approach to Data Management using NextCloud, TrueNAS, and QEMU/KVM
by: Aryan, Prakash, et al.
Published: (2024)
by: Aryan, Prakash, et al.
Published: (2024)
An Empirical Evaluation of Serverless Cloud Infrastructure for Large-Scale Data Processing
by: Bodner, Thomas, et al.
Published: (2025)
by: Bodner, Thomas, et al.
Published: (2025)
Brame: Hierarchical Data Management Framework for Cloud-Edge-Device Collaboration
by: Liu, Xianglong, et al.
Published: (2025)
by: Liu, Xianglong, et al.
Published: (2025)
SafeLoad: Efficient Admission Control Framework for Identifying Memory-Overloading Queries in Cloud Data Warehouses
by: Wu, Yifan, et al.
Published: (2026)
by: Wu, Yifan, et al.
Published: (2026)
AI-Driven Generation of Data Contracts in Modern Data Engineering Systems
by: Bhoite, Harshraj
Published: (2025)
by: Bhoite, Harshraj
Published: (2025)
Evaluating Continuous Basic Graph Patterns over Dynamic Link Data Graphs
by: Gergatsoulis, Manolis, et al.
Published: (2022)
by: Gergatsoulis, Manolis, et al.
Published: (2022)
A Comprehensive Scalable Framework for Cloud-Native Pattern Detection with Enhanced Expressiveness
by: Mavroudopoulos, Ioannis, et al.
Published: (2024)
by: Mavroudopoulos, Ioannis, et al.
Published: (2024)
TCDRM: A Tenant Budget-Aware Data Replication Framework for Multi-Cloud Computing
by: Bernardin, Santatra Hagamalala, et al.
Published: (2025)
by: Bernardin, Santatra Hagamalala, et al.
Published: (2025)
Proposal for a National Serials Data System.
by: Adams, Scott
Published: (1969)
by: Adams, Scott
Published: (1969)
Task Cascades for Efficient Unstructured Data Processing
by: Shankar, Shreya, et al.
Published: (2026)
by: Shankar, Shreya, et al.
Published: (2026)
FREYJA: Efficient Join Discovery in Data Lakes
by: Maynou, Marc, et al.
Published: (2024)
by: Maynou, Marc, et al.
Published: (2024)
A Database Engineered System for Big Data Analytics on Tornado Climatology
by: Bian, Fengfan, et al.
Published: (2024)
by: Bian, Fengfan, et al.
Published: (2024)
Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores
by: Petrescu, Diana, et al.
Published: (2022)
by: Petrescu, Diana, et al.
Published: (2022)
Research on the efficiency of data loading and storage in Data Lakehouse architectures for the formation of analytical data systems
by: Borodii, Ivan, et al.
Published: (2026)
by: Borodii, Ivan, et al.
Published: (2026)
HistogramTools for Efficient Data Analysis and Distribution Representation in Large Data Sets
by: Malhotra, Shubham
Published: (2025)
by: Malhotra, Shubham
Published: (2025)
LASER: A Data-Centric Method for Low-Cost and Efficient SQL Rewriting based on SQL-GRPO
by: Li, Jiahui, et al.
Published: (2026)
by: Li, Jiahui, et al.
Published: (2026)
FairDAG: Consensus Fairness over Multi-Proposer Causal Design
by: Kang, Dakai, et al.
Published: (2025)
by: Kang, Dakai, et al.
Published: (2025)
Efficient Data Valuation Approximation in Federated Learning: A Sampling-based Approach
by: Wei, Shuyue, et al.
Published: (2025)
by: Wei, Shuyue, et al.
Published: (2025)
Efficient Mining of Low-Utility Sequential Patterns
by: Zhu, Jian, et al.
Published: (2025)
by: Zhu, Jian, et al.
Published: (2025)
Enzyme: Incremental View Maintenance for Data Engineering
by: Yadav, Ritwik, et al.
Published: (2026)
by: Yadav, Ritwik, et al.
Published: (2026)
Spezi Data Pipeline: Streamlining FHIR-based Interoperable Digital Health Data Workflows
by: Bikia, Vasiliki, et al.
Published: (2025)
by: Bikia, Vasiliki, et al.
Published: (2025)
Prompt Engineering Techniques for Context-dependent Text-to-SQL in Arabic
by: Almohaimeed, Saleh, et al.
Published: (2025)
by: Almohaimeed, Saleh, et al.
Published: (2025)
Validating Temporal Compliance Patterns: A Unified Approach with $MTL_f$ over various Data Models
by: Zaki, Nesma M., et al.
Published: (2024)
by: Zaki, Nesma M., et al.
Published: (2024)
Toward a Cognitive Data Model: Exploring a Mind-Inspired Approach to Database Design
by: Pieris, Dhammika
Published: (2025)
by: Pieris, Dhammika
Published: (2025)
Push Down Optimization for Distributed Multi Cloud Data Integration
by: Kodali, Ravi Kiran, et al.
Published: (2026)
by: Kodali, Ravi Kiran, et al.
Published: (2026)
Utility-based Privacy Preserving Data Mining
by: Zhou, Qingfeng, et al.
Published: (2025)
by: Zhou, Qingfeng, et al.
Published: (2025)
Enabling Data Dependency-based Query Optimization
by: Lindner, Daniel, et al.
Published: (2024)
by: Lindner, Daniel, et al.
Published: (2024)
Similar Items
-
Formalizing ETLT and ELTL Design Patterns and Proposing Enhanced Variants: A Systematic Framework for Modern Data Engineering
by: Rucco, Chiara, et al.
Published: (2025) -
Designing Data Spaces: Navigating the European Initiatives Along Technical Specifications
by: Martella, Angelo, et al.
Published: (2025) -
MatrixGate: A High-performance Data Ingestion Tool for Time-series Databases
by: Wang, Shuhui, et al.
Published: (2024) -
CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion
by: Zeng, Xianzhi, et al.
Published: (2024) -
Automatic String Data Validation with Pattern Discovery
by: Lin, Xinwei, et al.
Published: (2024)