Saved in:
| Main Authors: | Mahadevan, Ananth, Mathioudakis, Michael, Mäkelä, Eetu, Tolonen, Mikko |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2401.07290 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke
by: Wu, Yu, et al.
Published: (2026)
by: Wu, Yu, et al.
Published: (2026)
Error Patterns in Historical OCR: A Comparative Analysis of TrOCR and a Vision-Language Model
by: Vesalainen, Ari, et al.
Published: (2026)
by: Vesalainen, Ari, et al.
Published: (2026)
Detecting Latin in Historical Books with Large Language Models: A Multimodal Benchmark
by: Wu, Yu, et al.
Published: (2025)
by: Wu, Yu, et al.
Published: (2025)
WaZI: A Learned and Workload-aware Z-Index
by: Pai, Sachith, et al.
Published: (2023)
by: Pai, Sachith, et al.
Published: (2023)
CSQL: Mapping Documents into Causal Databases
by: Mahadevan, Sridhar
Published: (2026)
by: Mahadevan, Sridhar
Published: (2026)
Optimizing Big Active Data Management Systems
by: Shirazi, Shahrzad Haji Amin, et al.
Published: (2024)
by: Shirazi, Shahrzad Haji Amin, et al.
Published: (2024)
Enabling the Reuse of Personal Data in Research: A Classification Model for Legal Compliance
by: Noguera, Eduard Mata i, et al.
Published: (2025)
by: Noguera, Eduard Mata i, et al.
Published: (2025)
EmpireDB: Data System to Accelerate Computational Sciences
by: Alabi, Daniel, et al.
Published: (2024)
by: Alabi, Daniel, et al.
Published: (2024)
The State of Scientific Poster Sharing and Reuse
by: Gasimova, Aydan, et al.
Published: (2026)
by: Gasimova, Aydan, et al.
Published: (2026)
Energy Profiling of Data-Sharing Pipelines: Modeling, Estimation, and Reuse Strategies
by: Masoudi, Sepideh, et al.
Published: (2025)
by: Masoudi, Sepideh, et al.
Published: (2025)
Dataversifying Natural Sciences: Pioneering a Data Lake Architecture for Curated Data-Centric Experiments in Life \& Earth Sciences
by: Vargas-Solar, Genoveva, et al.
Published: (2024)
by: Vargas-Solar, Genoveva, et al.
Published: (2024)
Understanding and Reusing Test Suites Across Database Systems
by: Zhong, Suyang, et al.
Published: (2024)
by: Zhong, Suyang, et al.
Published: (2024)
Data Quality Awareness: A Journey from Traditional Data Management to Data Science Systems
by: Dong, Sijie, et al.
Published: (2024)
by: Dong, Sijie, et al.
Published: (2024)
MLego: Interactive and Scalable Topic Exploration Through Model Reuse
by: Ye, Fei, et al.
Published: (2025)
by: Ye, Fei, et al.
Published: (2025)
HI-SQL: Optimizing Text-to-SQL Systems through Dynamic Hint Integration
by: Parab, Ganesh, et al.
Published: (2025)
by: Parab, Ganesh, et al.
Published: (2025)
Optimizing Relational Queries over Array-Valued Data in Columnar Systems
by: Zeblah, Maroua, et al.
Published: (2026)
by: Zeblah, Maroua, et al.
Published: (2026)
Local Shapley: Model-Induced Locality and Optimal Reuse in Data Valuation
by: Yang, Xuan, et al.
Published: (2026)
by: Yang, Xuan, et al.
Published: (2026)
Learning Lineage Constraints for Data Science Operations
by: Zhao, Jinjin
Published: (2025)
by: Zhao, Jinjin
Published: (2025)
Query Optimization Beyond Data Systems: The Case for Multi-Agent Systems
by: Kaoudi, Zoi, et al.
Published: (2025)
by: Kaoudi, Zoi, et al.
Published: (2025)
Enterprise Data Science Platform: A Unified Architecture for Federated Data Access
by: Miyamoto, Ryoto, et al.
Published: (2025)
by: Miyamoto, Ryoto, et al.
Published: (2025)
LDI: Localized Data Imputation for Text-Rich Tables
by: Omidvartehrani, Soroush, et al.
Published: (2025)
by: Omidvartehrani, Soroush, et al.
Published: (2025)
Experiversum: an Ecosystem for Curating and Enhancing Data-Driven Experimental Science
by: Vargas-Solar, Genoveva, et al.
Published: (2025)
by: Vargas-Solar, Genoveva, et al.
Published: (2025)
Data Science: a Natural Ecosystem
by: Porcu, Emilio, et al.
Published: (2025)
by: Porcu, Emilio, et al.
Published: (2025)
Optimizing Data Lakes' Queries
by: Gregory, et al.
Published: (2025)
by: Gregory, et al.
Published: (2025)
HEXGEN-FLOW: Optimizing LLM Inference Request Scheduling for Agentic Text-to-SQL
by: Peng, You, et al.
Published: (2025)
by: Peng, You, et al.
Published: (2025)
Same Data, Different Schemas: Robustness of LLM-based Text-to-SQL
by: Kanchinadam, Nitin, et al.
Published: (2026)
by: Kanchinadam, Nitin, et al.
Published: (2026)
Generating Skyline Datasets for Data Science Models
by: Wang, Mengying, et al.
Published: (2025)
by: Wang, Mengying, et al.
Published: (2025)
Towards Responsible and Fair Data Science: Resource Allocation for Inclusive and Sustainable Analytics
by: Vargas-Solar, Genoveva
Published: (2025)
by: Vargas-Solar, Genoveva
Published: (2025)
Data and Text Processing for Health and Life Sciences
by: Couto, Francisco M.
Published: (2020)
by: Couto, Francisco M.
Published: (2020)
Enabling Data Dependency-based Query Optimization
by: Lindner, Daniel, et al.
Published: (2024)
by: Lindner, Daniel, et al.
Published: (2024)
Modeling and Optimization for Massive Data Allocation in Database
by: Niu, Panpan, et al.
Published: (2026)
by: Niu, Panpan, et al.
Published: (2026)
Data-CASE: Grounding Data Regulations for Compliant Data Processing Systems
by: Chakraborty, Vishal, et al.
Published: (2023)
by: Chakraborty, Vishal, et al.
Published: (2023)
A Description of the Text Data Base System TDBS. Stockholm Papers in Library and Information Science.
by: Lofstrom, Mats
Published: (1982)
by: Lofstrom, Mats
Published: (1982)
Evaluating the Data Model Robustness of Text-to-SQL Systems Based on Real User Queries
by: Fürst, Jonathan, et al.
Published: (2024)
by: Fürst, Jonathan, et al.
Published: (2024)
NFDI4DSO: Towards a BFO Compliant Ontology for Data Science
by: Gesese, Genet Asefa, et al.
Published: (2024)
by: Gesese, Genet Asefa, et al.
Published: (2024)
SynSQL: Synthesizing Relational Databases for Robust Evaluation of Text-to-SQL Systems
by: Habibollah, Mohammadamin, et al.
Published: (2026)
by: Habibollah, Mohammadamin, et al.
Published: (2026)
A Datalake for Data-driven Social Science Research
by: Arya, Puneet, et al.
Published: (2025)
by: Arya, Puneet, et al.
Published: (2025)
QUEST: Query Optimization in Unstructured Document Analysis
by: Sun, Zhaoze, et al.
Published: (2025)
by: Sun, Zhaoze, et al.
Published: (2025)
Optimizing Dataflow Systems for Scalable Interactive Visualization
by: Yang, Junran, et al.
Published: (2024)
by: Yang, Junran, et al.
Published: (2024)
Text2SQL-Flow: A Robust SQL-Aware Data Augmentation Framework for Text-to-SQL
by: Cai, Qifeng, et al.
Published: (2025)
by: Cai, Qifeng, et al.
Published: (2025)
Similar Items
-
Matching Meaning at Scale: Evaluating Semantic Search for 18th-Century Intellectual History through the Case of Locke
by: Wu, Yu, et al.
Published: (2026) -
Error Patterns in Historical OCR: A Comparative Analysis of TrOCR and a Vision-Language Model
by: Vesalainen, Ari, et al.
Published: (2026) -
Detecting Latin in Historical Books with Large Language Models: A Multimodal Benchmark
by: Wu, Yu, et al.
Published: (2025) -
WaZI: A Learned and Workload-aware Z-Index
by: Pai, Sachith, et al.
Published: (2023) -
CSQL: Mapping Documents into Causal Databases
by: Mahadevan, Sridhar
Published: (2026)