Saved in:
| Main Authors: | Karimi, Ahmad Maroof, Shin, Woong, Hines, Jesse, Ghosal, Tirthankar, Sattar, Naw Safrin, Wang, Feiyi |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.16212 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Profiling and Modeling of Power Characteristics of Leadership-Scale HPC System Workloads
by: Karimi, Ahmad Maroof, et al.
Published: (2024)
by: Karimi, Ahmad Maroof, et al.
Published: (2024)
Exploring the Frontiers of Energy Efficiency using Power Management at System Scale
by: Karimi, Ahmad Maroof, et al.
Published: (2024)
by: Karimi, Ahmad Maroof, et al.
Published: (2024)
Integrating Multi-Agent Simulation, Behavioral Forensics, and Trust-Aware Machine Learning for Adaptive Insider Threat Detection
by: Kausar, Firdous, et al.
Published: (2026)
by: Kausar, Firdous, et al.
Published: (2026)
Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems
by: Karimi, Ahmad Maroof, et al.
Published: (2026)
by: Karimi, Ahmad Maroof, et al.
Published: (2026)
SiriusHelper: An LLM Agent-Based Operations Assistant for Big Data Platforms
by: Shen, Yu, et al.
Published: (2026)
by: Shen, Yu, et al.
Published: (2026)
AgenticData: An Agentic Data Analytics System for Heterogeneous Data
by: Sun, Ji, et al.
Published: (2025)
by: Sun, Ji, et al.
Published: (2025)
Searching Clinical Data Using Generative AI
by: Hanswadkar, Karan, et al.
Published: (2025)
by: Hanswadkar, Karan, et al.
Published: (2025)
AI-Driven Generation of Data Contracts in Modern Data Engineering Systems
by: Bhoite, Harshraj
Published: (2025)
by: Bhoite, Harshraj
Published: (2025)
TAIJI: MCP-based Multi-Modal Data Analytics on Data Lakes
by: Zhang, Chao, et al.
Published: (2025)
by: Zhang, Chao, et al.
Published: (2025)
RetClean: Retrieval-Based Data Cleaning Using Foundation Models and Data Lakes
by: Naeem, Zan Ahmad, et al.
Published: (2023)
by: Naeem, Zan Ahmad, et al.
Published: (2023)
VERSA: Verified Event Data Format for Reliable Soccer Analytics
by: Jo, Geonhee, et al.
Published: (2026)
by: Jo, Geonhee, et al.
Published: (2026)
Powering In-Database Dynamic Model Slicing for Structured Data Analytics
by: Zeng, Lingze, et al.
Published: (2024)
by: Zeng, Lingze, et al.
Published: (2024)
CoddLLM: Empowering Large Language Models for Data Analytics
by: Zhang, Jiani, et al.
Published: (2025)
by: Zhang, Jiani, et al.
Published: (2025)
Semantic Operators: A Declarative Model for Rich, AI-based Data Processing
by: Patel, Liana, et al.
Published: (2024)
by: Patel, Liana, et al.
Published: (2024)
Enhancing Pavement Sensor Data Acquisition for AI-Driven Transportation Research
by: Gowda, Manish Kumar Krishne, et al.
Published: (2025)
by: Gowda, Manish Kumar Krishne, et al.
Published: (2025)
GEE-OPs: An Operator Knowledge Base for Geospatial Code Generation on the Google Earth Engine Platform Powered by Large Language Models
by: Hou, Shuyang, et al.
Published: (2024)
by: Hou, Shuyang, et al.
Published: (2024)
Common Data Format (CDF): A Standardized Format for Match-Data in Football (Soccer)
by: Anzer, Gabriel, et al.
Published: (2025)
by: Anzer, Gabriel, et al.
Published: (2025)
Beyond Single-Modal Analytics: A Framework for Integrating Heterogeneous LLM-Based Query Systems for Multi-Modal Data
by: Li, Ruyu, et al.
Published: (2026)
by: Li, Ruyu, et al.
Published: (2026)
The FormAI Dataset: Generative AI in Software Security Through the Lens of Formal Verification
by: Tihanyi, Norbert, et al.
Published: (2023)
by: Tihanyi, Norbert, et al.
Published: (2023)
Capturing and Anticipating User Intents in Data Analytics via Knowledge Graphs
by: Pons, Gerard, et al.
Published: (2024)
by: Pons, Gerard, et al.
Published: (2024)
DataLab: A Unified Platform for LLM-Powered Business Intelligence
by: Weng, Luoxuan, et al.
Published: (2024)
by: Weng, Luoxuan, et al.
Published: (2024)
Compliance Rating Scheme: A Data Provenance Framework for Generative AI Datasets
by: Bohacek, Matyas, et al.
Published: (2025)
by: Bohacek, Matyas, et al.
Published: (2025)
Tabular Data Augmentation for Machine Learning: Progress and Prospects of Embracing Generative AI
by: Cui, Lingxi, et al.
Published: (2024)
by: Cui, Lingxi, et al.
Published: (2024)
Supporting Our AI Overlords: Redesigning Data Systems to be Agent-First
by: Liu, Shu, et al.
Published: (2025)
by: Liu, Shu, et al.
Published: (2025)
AI-Driven Frameworks for Enhancing Data Quality in Big Data Ecosystems: Error_Detection, Correction, and Metadata Integration
by: Elouataoui, Widad
Published: (2024)
by: Elouataoui, Widad
Published: (2024)
Beyond Pixels: Introspective and Interactive Grounding for Visualization Agents
by: Lu, Yiyang, et al.
Published: (2026)
by: Lu, Yiyang, et al.
Published: (2026)
A Generative AI Framework for Intelligent Utility Billing CO 2 Analytics and Sustainable Resource Optimisation
by: Manjunath, Pavan, et al.
Published: (2026)
by: Manjunath, Pavan, et al.
Published: (2026)
Generating Skyline Datasets for Data Science Models
by: Wang, Mengying, et al.
Published: (2025)
by: Wang, Mengying, et al.
Published: (2025)
Aixel: A Unified, Adaptive and Extensible System for AI-powered Data Analysis
by: Zhang, Meihui, et al.
Published: (2025)
by: Zhang, Meihui, et al.
Published: (2025)
CMDBench: A Benchmark for Coarse-to-fine Multimodal Data Discovery in Compound AI Systems
by: Feng, Yanlin, et al.
Published: (2024)
by: Feng, Yanlin, et al.
Published: (2024)
A Survey of Large Language Models on Generative Graph Analytics: Query, Learning, and Applications
by: Shang, Wenbo, et al.
Published: (2024)
by: Shang, Wenbo, et al.
Published: (2024)
Is Agent Memory a Database? Rethinking Data Foundations for Long-Term AI Agent Memory
by: Orogat, Abdelghny, et al.
Published: (2026)
by: Orogat, Abdelghny, et al.
Published: (2026)
GPU-Accelerated ANNS: Quantized for Speed, Built for Change
by: McCoy, Hunter, et al.
Published: (2026)
by: McCoy, Hunter, et al.
Published: (2026)
Text-to-SQL for Enterprise Data Analytics
by: Chen, Albert, et al.
Published: (2025)
by: Chen, Albert, et al.
Published: (2025)
Beyond Relational: Semantic-Aware Multi-Modal Analytics with LLM-Native Query Optimization
by: Zhu, Junhao, et al.
Published: (2025)
by: Zhu, Junhao, et al.
Published: (2025)
A Demonstration of SQLyzr: A Platform for Fine-Grained Text-to-SQL Evaluation and Analysis
by: Abedini, Sepideh, et al.
Published: (2026)
by: Abedini, Sepideh, et al.
Published: (2026)
In-Context Adaptation to Concept Drift for Learned Database Operations
by: Zhu, Jiaqi, et al.
Published: (2025)
by: Zhu, Jiaqi, et al.
Published: (2025)
Integrating Meteorological and Operational Data: A Novel Approach to Understanding Railway Delays in Finland
by: Borin, Vinicius Pozzobon, et al.
Published: (2026)
by: Borin, Vinicius Pozzobon, et al.
Published: (2026)
PETLP: A Privacy-by-Design Pipeline for Social Media Data in AI Research
by: Oh, Nick, et al.
Published: (2025)
by: Oh, Nick, et al.
Published: (2025)
SING-SQL: A Synthetic Data Generation Framework for In-Domain Text-to-SQL Translation
by: Caferoğlu, Hasan Alp, et al.
Published: (2025)
by: Caferoğlu, Hasan Alp, et al.
Published: (2025)
Similar Items
-
Profiling and Modeling of Power Characteristics of Leadership-Scale HPC System Workloads
by: Karimi, Ahmad Maroof, et al.
Published: (2024) -
Exploring the Frontiers of Energy Efficiency using Power Management at System Scale
by: Karimi, Ahmad Maroof, et al.
Published: (2024) -
Integrating Multi-Agent Simulation, Behavioral Forensics, and Trust-Aware Machine Learning for Adaptive Insider Threat Detection
by: Kausar, Firdous, et al.
Published: (2026) -
Instruction-Tuned LLMs for Parsing and Mining Unstructured Logs on Leadership HPC Systems
by: Karimi, Ahmad Maroof, et al.
Published: (2026) -
SiriusHelper: An LLM Agent-Based Operations Assistant for Big Data Platforms
by: Shen, Yu, et al.
Published: (2026)