Saved in:
| Main Authors: | Kon, Patrick Tser Jern, Liu, Jiachen, Ding, Qiuyi, Qiu, Yiming, Yang, Zhenning, Huang, Yibo, Srinivasa, Jayanth, Lee, Myungjin, Chowdhury, Mosharaf, Chen, Ang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.16069 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EXP-Bench: Can AI Conduct AI Research Experiments?
by: Kon, Patrick Tser Jern, et al.
Published: (2025)
by: Kon, Patrick Tser Jern, et al.
Published: (2025)
Cloud Infrastructure Management in the Age of AI Agents
by: Yang, Zhenning, et al.
Published: (2025)
by: Yang, Zhenning, et al.
Published: (2025)
Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery
by: Yang, Zhenning, et al.
Published: (2026)
by: Yang, Zhenning, et al.
Published: (2026)
Ambig-IaC: Multi-level Disambiguation for Interactive Cloud Infrastructure-as-Code Synthesis
by: Yang, Zhenning, et al.
Published: (2026)
by: Yang, Zhenning, et al.
Published: (2026)
SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2026)
by: Kon, Patrick Tser Jern, et al.
Published: (2026)
Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services
by: Liu, Jiachen, et al.
Published: (2024)
by: Liu, Jiachen, et al.
Published: (2024)
Software-Defined Agentic Serving
by: Agarwal, Saurabh, et al.
Published: (2026)
by: Agarwal, Saurabh, et al.
Published: (2026)
Model-Based Diagnosis: Automating End-to-End Diagnosis of Network Failures
by: Wu, Changrong, et al.
Published: (2025)
by: Wu, Changrong, et al.
Published: (2025)
Venn: Resource Management for Collaborative Learning Jobs
by: Liu, Jiachen, et al.
Published: (2023)
by: Liu, Jiachen, et al.
Published: (2023)
Toward Cross-Layer Energy Optimizations in AI Systems
by: Chung, Jae-Won, et al.
Published: (2024)
by: Chung, Jae-Won, et al.
Published: (2024)
FedTrans: Efficient Federated Learning via Multi-Model Transformation
by: Zhu, Yuxuan, et al.
Published: (2024)
by: Zhu, Yuxuan, et al.
Published: (2024)
Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI
by: Jin, Jianli, et al.
Published: (2025)
by: Jin, Jianli, et al.
Published: (2025)
Cornfigurator: Automated Planning for Any-to-Any Multimodal Model Serving
by: Ma, Jeff J., et al.
Published: (2025)
by: Ma, Jeff J., et al.
Published: (2025)
Mordal: Automated Pretrained Model Selection for Vision Language Models
by: He, Shiqi, et al.
Published: (2025)
by: He, Shiqi, et al.
Published: (2025)
Nalar: An agent serving framework
by: Laju, Marco, et al.
Published: (2026)
by: Laju, Marco, et al.
Published: (2026)
The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization
by: Chung, Jae-Won, et al.
Published: (2025)
by: Chung, Jae-Won, et al.
Published: (2025)
Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
by: Yang, Zhenning, et al.
Published: (2025)
by: Yang, Zhenning, et al.
Published: (2025)
Addressing Variable Heterogeneity in Distributed Multimodal Training with Entrain
by: Jang, Insu, et al.
Published: (2026)
by: Jang, Insu, et al.
Published: (2026)
Efficient Distributed MLLM Training with Cornstarch
by: Jang, Insu, et al.
Published: (2025)
by: Jang, Insu, et al.
Published: (2025)
Disaggregating Embedding Recommendation Systems with FlexEMR
by: Huang, Yibo, et al.
Published: (2024)
by: Huang, Yibo, et al.
Published: (2024)
Cornserve: A Distributed Serving System for Any-to-Any Multimodal Models
by: Chung, Jae-Won, et al.
Published: (2026)
by: Chung, Jae-Won, et al.
Published: (2026)
KAIROS: Stateful, Context-Aware Power-Efficient Agentic Inference Serving
by: Yuan, Yichao, et al.
Published: (2026)
by: Yuan, Yichao, et al.
Published: (2026)
Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension
by: Yin, Fan, et al.
Published: (2024)
by: Yin, Fan, et al.
Published: (2024)
AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite
by: Bragg, Jonathan, et al.
Published: (2025)
by: Bragg, Jonathan, et al.
Published: (2025)
ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
by: Chen, Ziru, et al.
Published: (2024)
by: Chen, Ziru, et al.
Published: (2024)
RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable Guarantees
by: Xian, Xun, et al.
Published: (2024)
by: Xian, Xun, et al.
Published: (2024)
Kareus: Joint Reduction of Dynamic and Static Energy in Large Model Training
by: Wu, Ruofan, et al.
Published: (2026)
by: Wu, Ruofan, et al.
Published: (2026)
SciIF: Benchmarking Scientific Instruction Following Towards Rigorous Scientific Intelligence
by: Su, Encheng, et al.
Published: (2026)
by: Su, Encheng, et al.
Published: (2026)
SQUiD: Synthesizing Relational Databases from Unstructured Text
by: Sadia, Mushtari, et al.
Published: (2025)
by: Sadia, Mushtari, et al.
Published: (2025)
Diverse Score Distillation
by: Xu, Yanbo, et al.
Published: (2024)
by: Xu, Yanbo, et al.
Published: (2024)
Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval
by: Zhang, Yuwei, et al.
Published: (2025)
by: Zhang, Yuwei, et al.
Published: (2025)
Sphinx: Efficiently Serving Novel View Synthesis using Regression-Guided Selective Refinement
by: Xia, Yuchen, et al.
Published: (2025)
by: Xia, Yuchen, et al.
Published: (2025)
Branch-and-Browse: Efficient and Controllable Web Exploration with Tree-Structured Reasoning and Action Memory
by: He, Shiqi, et al.
Published: (2025)
by: He, Shiqi, et al.
Published: (2025)
Agent-Q: Fine-Tuning Large Language Models for Quantum Circuit Generation and Optimization
by: Jern, Linus, et al.
Published: (2025)
by: Jern, Linus, et al.
Published: (2025)
Manifesto for Scientifically Sound Artificial Intelligence Towards an Artificial Intelligence Serving Scientific Rigor
by: Febba, Michel
Published: (2025)
by: Febba, Michel
Published: (2025)
An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems
by: Tian, Fangqiao, et al.
Published: (2025)
by: Tian, Fangqiao, et al.
Published: (2025)
OpenG2G: A Simulation Platform for AI Datacenter-Grid Runtime Coordination
by: Chung, Jae-Won, et al.
Published: (2026)
by: Chung, Jae-Won, et al.
Published: (2026)
A Retrieve-and-Read Framework for Knowledge Graph Link Prediction
by: Pahuja, Vardaan, et al.
Published: (2022)
by: Pahuja, Vardaan, et al.
Published: (2022)
TetriServe: Efficient DiT Serving for Heterogeneous Image Generation
by: Lu, Runyu, et al.
Published: (2025)
by: Lu, Runyu, et al.
Published: (2025)
Where Do the Joules Go? Diagnosing Inference Energy Consumption
by: Chung, Jae-Won, et al.
Published: (2026)
by: Chung, Jae-Won, et al.
Published: (2026)
Similar Items
-
EXP-Bench: Can AI Conduct AI Research Experiments?
by: Kon, Patrick Tser Jern, et al.
Published: (2025) -
Cloud Infrastructure Management in the Age of AI Agents
by: Yang, Zhenning, et al.
Published: (2025) -
Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery
by: Yang, Zhenning, et al.
Published: (2026) -
Ambig-IaC: Multi-level Disambiguation for Interactive Cloud Infrastructure-as-Code Synthesis
by: Yang, Zhenning, et al.
Published: (2026) -
SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2026)