Saved in:
| Main Authors: | Lei, Hongqin, Tang, Haowei, Zhang, Zhe |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.06077 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LakeHopper: Cross Data Lakes Column Type Annotation through Model Adaptation
by: Sun, Yushi, et al.
Published: (2026)
by: Sun, Yushi, et al.
Published: (2026)
Cross-domain-aware Worker Selection with Training for Crowdsourced Annotation
by: Sun, Yushi, et al.
Published: (2024)
by: Sun, Yushi, et al.
Published: (2024)
GRACE: A Dynamic Coreset Selection Framework for Large Language Model Optimization
by: Tang, Tianhao, et al.
Published: (2026)
by: Tang, Tianhao, et al.
Published: (2026)
ABase: the Multi-Tenant NoSQL Serverless Database for Diverse and Dynamic Workloads in Large-scale Cloud Environments
by: Kang, Rong, et al.
Published: (2025)
by: Kang, Rong, et al.
Published: (2025)
StraTyper: Automated Semantic Type Discovery and Multi-Type Annotation for Dataset Collections
by: Koutras, Christos, et al.
Published: (2026)
by: Koutras, Christos, et al.
Published: (2026)
VIDEX: A Disaggregated and Extensible Virtual Index for the Cloud and AI Era
by: Kang, Rong, et al.
Published: (2025)
by: Kang, Rong, et al.
Published: (2025)
Descriptor: Multi-Regional Cloud Honeypot Dataset (MURHCAD)
by: Feito-Casares, Enrique, et al.
Published: (2026)
by: Feito-Casares, Enrique, et al.
Published: (2026)
OSM+: Billion-Level OpenStreetMap Dataset for City-wide Experiments
by: Zheng, Guanjie, et al.
Published: (2025)
by: Zheng, Guanjie, et al.
Published: (2025)
LLMLog: Advanced Log Template Generation via LLM-driven Multi-Round Annotation
by: Teng, Fei, et al.
Published: (2025)
by: Teng, Fei, et al.
Published: (2025)
Accelerating Transfer Learning with Near-Data Computation on Cloud Object Stores
by: Petrescu, Diana, et al.
Published: (2022)
by: Petrescu, Diana, et al.
Published: (2022)
LSMGraph: A High-Performance Dynamic Graph Storage System with Multi-Level CSR
by: Yu, Song, et al.
Published: (2024)
by: Yu, Song, et al.
Published: (2024)
UniEntrezDB: Large-scale Gene Ontology Annotation Dataset and Evaluation Benchmarks with Unified Entrez Gene Identifiers
by: Miao, Yuwei, et al.
Published: (2024)
by: Miao, Yuwei, et al.
Published: (2024)
A Survey on Open Dataset Search in the LLM Era: Retrospectives and Perspectives
by: Li, Pengyue, et al.
Published: (2025)
by: Li, Pengyue, et al.
Published: (2025)
DPCD: A Quality Assessment Database for Dynamic Point Clouds
by: Liu, Yating, et al.
Published: (2025)
by: Liu, Yating, et al.
Published: (2025)
Detecting Dynamic Relationships in Object-Centric Event Logs
by: Gianola, Alessandro, et al.
Published: (2026)
by: Gianola, Alessandro, et al.
Published: (2026)
Automatic Configuration Tuning on Cloud Database: A Survey
by: Zhang, Limeng, et al.
Published: (2024)
by: Zhang, Limeng, et al.
Published: (2024)
Stateful Entities: Object-oriented Cloud Applications as Distributed Dataflows
by: Psarakis, Kyriakos, et al.
Published: (2021)
by: Psarakis, Kyriakos, et al.
Published: (2021)
$\textit{Dirigo}$: A Method to Extract Event Logs for Object-Centric Processes
by: Wei, Jia, et al.
Published: (2024)
by: Wei, Jia, et al.
Published: (2024)
Data Agents: Levels, State of the Art, and Open Problems
by: Luo, Yuyu, et al.
Published: (2026)
by: Luo, Yuyu, et al.
Published: (2026)
MEBench: Benchmarking Large Language Models for Cross-Document Multi-Entity Question Answering
by: Lin, Teng, et al.
Published: (2025)
by: Lin, Teng, et al.
Published: (2025)
Dynamic and Scalable Data Preparation for Object-Centric Process Mining
by: Bosmans, Lien, et al.
Published: (2024)
by: Bosmans, Lien, et al.
Published: (2024)
DLRover-RM: Resource Optimization for Deep Recommendation Models Training in the Cloud
by: Wang, Qinlong, et al.
Published: (2023)
by: Wang, Qinlong, et al.
Published: (2023)
Orchestration for Domain-specific Edge-Cloud Language Models
by: Patidar, Prasoon, et al.
Published: (2025)
by: Patidar, Prasoon, et al.
Published: (2025)
Couler: Unified Machine Learning Workflow Optimization in Cloud
by: Wang, Xiaoda, et al.
Published: (2024)
by: Wang, Xiaoda, et al.
Published: (2024)
Intra-Query Runtime Elasticity for Cloud-Native Data Analysis
by: Zhang, Xukang, et al.
Published: (2025)
by: Zhang, Xukang, et al.
Published: (2025)
Efficient Cloud-edge Collaborative Approaches to SPARQL Queries over Large RDF graphs
by: Ma, Shidan, et al.
Published: (2026)
by: Ma, Shidan, et al.
Published: (2026)
ConStruM: A Structure-Guided LLM Framework for Context-Aware Schema Matching
by: Chen, Houming, et al.
Published: (2026)
by: Chen, Houming, et al.
Published: (2026)
LAKEGEN: A LLM-based Tabular Corpus Generator for Evaluating Dataset Discovery in Data Lakes
by: Dai, Zhenwei, et al.
Published: (2025)
by: Dai, Zhenwei, et al.
Published: (2025)
High Throughput Shortest Distance Query Processing on Large Dynamic Road Networks
by: Zhou, Xinjie, et al.
Published: (2024)
by: Zhou, Xinjie, et al.
Published: (2024)
Distributed Processing of kNN Queries over Moving Objects on Dynamic Road Networks
by: Tao, Mingjin, et al.
Published: (2025)
by: Tao, Mingjin, et al.
Published: (2025)
AutoDDG: Automated Dataset Description Generation using Large Language Models
by: Zhang, Haoxiang, et al.
Published: (2025)
by: Zhang, Haoxiang, et al.
Published: (2025)
A Universal Scheme for Dynamic Partitioned Shortest Path Index: Survey, Improvement, and Experiments
by: Zhang, Mengxuan, et al.
Published: (2023)
by: Zhang, Mengxuan, et al.
Published: (2023)
Blueprinting the Cloud: Unifying and Automatically Optimizing Cloud Data Infrastructures with BRAD -- Extended Version
by: Yu, Geoffrey X., et al.
Published: (2024)
by: Yu, Geoffrey X., et al.
Published: (2024)
Retrieve-and-Verify: A Table Context Selection Framework for Accurate Column Annotations
by: Ding, Zhihao, et al.
Published: (2025)
by: Ding, Zhihao, et al.
Published: (2025)
Gamma Acyclicity, Annotated Relations, and Consistency Witness Functions
by: Atserias, Albert, et al.
Published: (2025)
by: Atserias, Albert, et al.
Published: (2025)
PBench: Workload Synthesizer with Real Statistics for Cloud Analytics Benchmarking
by: Zhou, Yan, et al.
Published: (2025)
by: Zhou, Yan, et al.
Published: (2025)
Towards Cross-Model Efficiency in SQL/PGQ
by: Rotschield, Hadar, et al.
Published: (2025)
by: Rotschield, Hadar, et al.
Published: (2025)
Cloud-Native Vector Search: A Comprehensive Performance Analysis
by: Li, Zhaoheng, et al.
Published: (2025)
by: Li, Zhaoheng, et al.
Published: (2025)
OpenGLT: A Comprehensive Benchmark of Graph Neural Networks for Graph-Level Tasks
by: Li, Haoyang, et al.
Published: (2025)
by: Li, Haoyang, et al.
Published: (2025)
Saving Money for Analytical Workloads in the Cloud
by: Srivastava, Tapan, et al.
Published: (2024)
by: Srivastava, Tapan, et al.
Published: (2024)
Similar Items
-
LakeHopper: Cross Data Lakes Column Type Annotation through Model Adaptation
by: Sun, Yushi, et al.
Published: (2026) -
Cross-domain-aware Worker Selection with Training for Crowdsourced Annotation
by: Sun, Yushi, et al.
Published: (2024) -
GRACE: A Dynamic Coreset Selection Framework for Large Language Model Optimization
by: Tang, Tianhao, et al.
Published: (2026) -
ABase: the Multi-Tenant NoSQL Serverless Database for Diverse and Dynamic Workloads in Large-scale Cloud Environments
by: Kang, Rong, et al.
Published: (2025) -
StraTyper: Automated Semantic Type Discovery and Multi-Type Annotation for Dataset Collections
by: Koutras, Christos, et al.
Published: (2026)