Saved in:
| Main Authors: | Sun, Yongkang, Ding, Zhihao, Wang, Huiqiang, Cheng, Reynold, Shi, Jieming |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.17298 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Retrieve-and-Verify: A Table Context Selection Framework for Accurate Column Annotations
by: Ding, Zhihao, et al.
Published: (2025)
by: Ding, Zhihao, et al.
Published: (2025)
EasyTUS: A Comprehensive Framework for Fast and Accurate Table Union Search across Data Lakes
by: Otto, Tim
Published: (2025)
by: Otto, Tim
Published: (2025)
An Efficient Proximity Graph-based Approach to Table Union Search
by: Xie, Yiming, et al.
Published: (2025)
by: Xie, Yiming, et al.
Published: (2025)
Fast Maximum Common Subgraph Search: A Redundancy-Reduced Backtracking Approach
by: Yu, Kaiqiang, et al.
Published: (2025)
by: Yu, Kaiqiang, et al.
Published: (2025)
Metadata-driven Table Union Search: Leveraging Semantics for Restricted Access Data Integration
by: Martorana, Margherita, et al.
Published: (2025)
by: Martorana, Margherita, et al.
Published: (2025)
Something's Fishy In The Data Lake: A Critical Re-evaluation of Table Union Search Benchmarks
by: Boutaleb, Allaa, et al.
Published: (2025)
by: Boutaleb, Allaa, et al.
Published: (2025)
Tursio for Credit Unions: Structured Data Search with Automated Context Graphs
by: Tripathi, Shivani, et al.
Published: (2026)
by: Tripathi, Shivani, et al.
Published: (2026)
Efficient Methods for Accurate Sparse Trajectory Recovery and Map Matching
by: Tian, Wei, et al.
Published: (2025)
by: Tian, Wei, et al.
Published: (2025)
On Efficient Approximate Aggregate Nearest Neighbor Queries over Learned Representations
by: Wang, Carrie, et al.
Published: (2025)
by: Wang, Carrie, et al.
Published: (2025)
Multi-granularity Spatiotemporal Flow Patterns
by: Kosyfaki, Chrysanthi, et al.
Published: (2025)
by: Kosyfaki, Chrysanthi, et al.
Published: (2025)
Gen-T: Table Reclamation in Data Lakes
by: Fan, Grace, et al.
Published: (2024)
by: Fan, Grace, et al.
Published: (2024)
A Sampling-based Framework for Hypothesis Testing on Large Attributed Graphs
by: Wang, Yun, et al.
Published: (2024)
by: Wang, Yun, et al.
Published: (2024)
Fuzzy Integration of Data Lake Tables
by: Khatiwada, Aamod, et al.
Published: (2025)
by: Khatiwada, Aamod, et al.
Published: (2025)
Finding Locally Densest Subgraphs: Convex Programming with Edge and Triangle Density
by: Yang, Yi, et al.
Published: (2025)
by: Yang, Yi, et al.
Published: (2025)
AutoComp: Automated Data Compaction for Log-Structured Tables in Data Lakes
by: Gruenheid, Anja, et al.
Published: (2025)
by: Gruenheid, Anja, et al.
Published: (2025)
Filter-Centric Vector Indexing: Geometric Transformation for Efficient Filtered Vector Search
by: Heidari, Alireza, et al.
Published: (2025)
by: Heidari, Alireza, et al.
Published: (2025)
Retrieve, Merge, Predict: Augmenting Tables with Data Lakes
by: Cappuzzo, Riccardo, et al.
Published: (2024)
by: Cappuzzo, Riccardo, et al.
Published: (2024)
Dataversifying Natural Sciences: Pioneering a Data Lake Architecture for Curated Data-Centric Experiments in Life \& Earth Sciences
by: Vargas-Solar, Genoveva, et al.
Published: (2024)
by: Vargas-Solar, Genoveva, et al.
Published: (2024)
BEACON: A Benchmark for Efficient and Accurate Counting of Subgraphs
by: Najafi, Mohammad Matin, et al.
Published: (2025)
by: Najafi, Mohammad Matin, et al.
Published: (2025)
FREYJA: Efficient Join Discovery in Data Lakes
by: Maynou, Marc, et al.
Published: (2024)
by: Maynou, Marc, et al.
Published: (2024)
SIEVE: Effective Filtered Vector Search with Collection of Indexes
by: Li, Zhaoheng, et al.
Published: (2025)
by: Li, Zhaoheng, et al.
Published: (2025)
Novel Table Search [Technical Report]
by: Kassaie, Besat, et al.
Published: (2026)
by: Kassaie, Besat, et al.
Published: (2026)
Diverse Unionable Tuple Search: Novelty-Driven Discovery in Data Lakes [Technical Report]
by: Khatiwada, Aamod, et al.
Published: (2025)
by: Khatiwada, Aamod, et al.
Published: (2025)
DataFactory: Collaborative Multi-Agent Framework for Advanced Table Question Answering
by: Wang, Tong, et al.
Published: (2026)
by: Wang, Tong, et al.
Published: (2026)
LASER: A Data-Centric Method for Low-Cost and Efficient SQL Rewriting based on SQL-GRPO
by: Li, Jiahui, et al.
Published: (2026)
by: Li, Jiahui, et al.
Published: (2026)
Effective and Efficient Conductance-based Community Search at Billion Scale
by: Lin, Longlong, et al.
Published: (2025)
by: Lin, Longlong, et al.
Published: (2025)
BPI: A Novel Efficient and Reliable Search Structure for Hybrid Storage Blockchain
by: Zhao, Xinkui, et al.
Published: (2025)
by: Zhao, Xinkui, et al.
Published: (2025)
Effective and General Distance Computation for Approximate Nearest Neighbor Search
by: Yang, Mingyu, et al.
Published: (2024)
by: Yang, Mingyu, et al.
Published: (2024)
LakeVisage: Towards Scalable, Flexible and Interactive Visualization Recommendation for Data Discovery over Data Lakes
by: Hu, Yihao, et al.
Published: (2025)
by: Hu, Yihao, et al.
Published: (2025)
QCFuse: Query-Centric Cache Fusion for Efficient RAG Inference
by: Yan, Jianxin, et al.
Published: (2026)
by: Yan, Jianxin, et al.
Published: (2026)
Dynamic and Scalable Data Preparation for Object-Centric Process Mining
by: Bosmans, Lien, et al.
Published: (2024)
by: Bosmans, Lien, et al.
Published: (2024)
HCT-QA: A Benchmark for Question Answering on Human-Centric Tables
by: Ahmad, Mohammad S., et al.
Published: (2025)
by: Ahmad, Mohammad S., et al.
Published: (2025)
CANDY: A Benchmark for Continuous Approximate Nearest Neighbor Search with Dynamic Data Ingestion
by: Zeng, Xianzhi, et al.
Published: (2024)
by: Zeng, Xianzhi, et al.
Published: (2024)
DEG: Efficient Hybrid Vector Search Using the Dynamic Edge Navigation Graph
by: Yin, Ziqi, et al.
Published: (2025)
by: Yin, Ziqi, et al.
Published: (2025)
Time and Relations into Focus: Ontological Foundations of Object-Centric Event Data
by: Hooshyar, Hosna, et al.
Published: (2025)
by: Hooshyar, Hosna, et al.
Published: (2025)
Advancing Object-Centric Process Mining with Multi-Dimensional Data Operations
by: Khayatbashi, Shahrzad, et al.
Published: (2024)
by: Khayatbashi, Shahrzad, et al.
Published: (2024)
SINDI: an Efficient Index for Approximate Maximum Inner Product Search on Sparse Vectors
by: Li, Ruoxuan, et al.
Published: (2025)
by: Li, Ruoxuan, et al.
Published: (2025)
LakeHopper: Cross Data Lakes Column Type Annotation through Model Adaptation
by: Sun, Yushi, et al.
Published: (2026)
by: Sun, Yushi, et al.
Published: (2026)
Humans, Machine Learning, and Language Models in Union: A Cognitive Study on Table Unionability
by: Marimuthu, Sreeram, et al.
Published: (2025)
by: Marimuthu, Sreeram, et al.
Published: (2025)
Create Benchmarks for Data Lakes
by: Lyu, Yi, et al.
Published: (2026)
by: Lyu, Yi, et al.
Published: (2026)
Similar Items
-
Retrieve-and-Verify: A Table Context Selection Framework for Accurate Column Annotations
by: Ding, Zhihao, et al.
Published: (2025) -
EasyTUS: A Comprehensive Framework for Fast and Accurate Table Union Search across Data Lakes
by: Otto, Tim
Published: (2025) -
An Efficient Proximity Graph-based Approach to Table Union Search
by: Xie, Yiming, et al.
Published: (2025) -
Fast Maximum Common Subgraph Search: A Redundancy-Reduced Backtracking Approach
by: Yu, Kaiqiang, et al.
Published: (2025) -
Metadata-driven Table Union Search: Leveraging Semantics for Restricted Access Data Integration
by: Martorana, Margherita, et al.
Published: (2025)