:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Lei, Hongqin, Tang, Haowei, Zhang, Zhe
Format:	Preprint
Published:	2025
Subjects:	Databases
Online Access:	https://arxiv.org/abs/2508.06077
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

LakeHopper: Cross Data Lakes Column Type Annotation through Model Adaptation
by: Sun, Yushi, et al.
Published: (2026)

Cross-domain-aware Worker Selection with Training for Crowdsourced Annotation
by: Sun, Yushi, et al.
Published: (2024)

GRACE: A Dynamic Coreset Selection Framework for Large Language Model Optimization
by: Tang, Tianhao, et al.
Published: (2026)

ABase: the Multi-Tenant NoSQL Serverless Database for Diverse and Dynamic Workloads in Large-scale Cloud Environments
by: Kang, Rong, et al.
Published: (2025)

StraTyper: Automated Semantic Type Discovery and Multi-Type Annotation for Dataset Collections
by: Koutras, Christos, et al.
Published: (2026)

VIDEX: A Disaggregated and Extensible Virtual Index for the Cloud and AI Era
by: Kang, Rong, et al.
Published: (2025)

Descriptor: Multi-Regional Cloud Honeypot Dataset (MURHCAD)
by: Feito-Casares, Enrique, et al.
Published: (2026)

OSM+: Billion-Level OpenStreetMap Dataset for City-wide Experiments
by: Zheng, Guanjie, et al.
Published: (2025)

LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation
by: Teng, Fei, et al.
Published: (2025)

Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores
by: Petrescu, Diana, et al.
Published: (2022)

LSMGraph: A High-Performance Dynamic Graph Storage System with Multi-Level CSR
by: Yu, Song, et al.
Published: (2024)

UniEntrezDB: Large-scale Gene Ontology Annotation Dataset and Evaluation Benchmarks with Unified Entrez Gene Identifiers
by: Miao, Yuwei, et al.
Published: (2024)

A Survey on Open Dataset Search in the LLM Era: Retrospectives and Perspectives
by: Li, Pengyue, et al.
Published: (2025)

DPCD: A Quality Assessment Database for Dynamic Point Clouds
by: Liu, Yating, et al.
Published: (2025)

Detecting Dynamic Relationships in Object-Centric Event Logs
by: Gianola, Alessandro, et al.
Published: (2026)

Automatic Configuration Tuning on Cloud Database: A Survey
by: Zhang, Limeng, et al.
Published: (2024)

Stateful Entities: Object-oriented Cloud Applications as Distributed Dataflows
by: Psarakis, Kyriakos, et al.
Published: (2021)

$\textit{Dirigo}$: A Method to Extract Event Logs for Object-Centric Processes
by: Wei, Jia, et al.
Published: (2024)

Data Agents: Levels, State of the Art, and Open Problems
by: Luo, Yuyu, et al.
Published: (2026)

MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
by: Lin, Teng, et al.
Published: (2025)

Dynamic and Scalable Data Preparation for Object-Centric Process Mining
by: Bosmans, Lien, et al.
Published: (2024)

DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the Cloud
by: Wang, Qinlong, et al.
Published: (2023)

Orchestration for Domain-specific Edge-Cloud Language Models
by: Patidar, Prasoon, et al.
Published: (2025)

Couler: Unified Machine Learning Workflow Optimization in Cloud
by: Wang, Xiaoda, et al.
Published: (2024)

Intra-Query Runtime Elasticity for Cloud-Native Data Analysis
by: Zhang, Xukang, et al.
Published: (2025)

Efficient Cloud-edge Collaborative Approaches to SPARQL Queries over Large RDF graphs
by: Ma, Shidan, et al.
Published: (2026)

ConStruM: A Structure-Guided LLM Framework for Context-Aware Schema Matching
by: Chen, Houming, et al.
Published: (2026)

LAKEGEN: A LLM-based Tabular Corpus Generator for Evaluating Dataset Discovery in Data Lakes
by: Dai, Zhenwei, et al.
Published: (2025)

High Throughput Shortest Distance Query Processing on Large Dynamic Road Networks
by: Zhou, Xinjie, et al.
Published: (2024)

Distributed Processing of kNN Queries over Moving Objects on Dynamic Road Networks
by: Tao, Mingjin, et al.
Published: (2025)

AutoDDG: Automated Dataset Description Generation using Large Language Models
by: Zhang, Haoxiang, et al.
Published: (2025)

A Universal Scheme for Dynamic Partitioned Shortest Path Index: Survey, Improvement, and Experiments
by: Zhang, Mengxuan, et al.
Published: (2023)

Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD -- Extended Version
by: Yu, Geoffrey X., et al.
Published: (2024)

Retrieve-and-Verify: A Table Context Selection Framework for Accurate Column Annotations
by: Ding, Zhihao, et al.
Published: (2025)

Gamma Acyclicity, Annotated Relations, and Consistency Witness Functions
by: Atserias, Albert, et al.
Published: (2025)

PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking
by: Zhou, Yan, et al.
Published: (2025)

Towards Cross-Model Efficiency in SQL/PGQ
by: Rotschield, Hadar, et al.
Published: (2025)

Cloud-Native Vector Search: A Comprehensive Performance Analysis
by: Li, Zhaoheng, et al.
Published: (2025)

OpenGLT: A Comprehensive Benchmark of Graph Neural Networks for Graph-Level Tasks
by: Li, Haoyang, et al.
Published: (2025)

Saving Money for Analytical Workloads in the Cloud
by: Srivastava, Tapan, et al.
Published: (2024)