:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Kon, Patrick Tser Jern, Liu, Jiachen, Ding, Qiuyi, Qiu, Yiming, Yang, Zhenning, Huang, Yibo, Srinivasa, Jayanth, Lee, Myungjin, Chowdhury, Mosharaf, Chen, Ang
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2502.16069
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

EXP-Bench: Can AI Conduct AI Research Experiments?
by: Kon, Patrick Tser Jern, et al.
Published: (2025)

Cloud Infrastructure Management in the Age of AI Agents
by: Yang, Zhenning, et al.
Published: (2025)

Experiment-as-Code Labs: A Declarative Stack for AI-Driven Scientific Discovery
by: Yang, Zhenning, et al.
Published: (2026)

Ambig-IaC: Multi-level Disambiguation for Interactive Cloud Infrastructure-as-Code Synthesis
by: Yang, Zhenning, et al.
Published: (2026)

SWE-Protégé: Learning to Selectively Collaborate With an Expert Unlocks Small Language Models as Software Engineering Agents
by: Kon, Patrick Tser Jern, et al.
Published: (2026)

Andes: Defining and Enhancing Quality-of-Experience in LLM-Based Text Streaming Services
by: Liu, Jiachen, et al.
Published: (2024)

Software-Defined Agentic Serving
by: Agarwal, Saurabh, et al.
Published: (2026)

Model-Based Diagnosis: Automating End-to-End Diagnosis of Network Failures
by: Wu, Changrong, et al.
Published: (2025)

Venn: Resource Management for Collaborative Learning Jobs
by: Liu, Jiachen, et al.
Published: (2023)

Toward Cross-Layer Energy Optimizations in AI Systems
by: Chung, Jae-Won, et al.
Published: (2024)

FedTrans: Efficient Federated Learning via Multi-Model Transformation
by: Zhu, Yuxuan, et al.
Published: (2024)

Dora: QoE-Aware Hybrid Parallelism for Distributed Edge AI
by: Jin, Jianli, et al.
Published: (2025)

Cornfigurator: Automated Planning for Any-to-Any Multimodal Model Serving
by: Ma, Jeff J., et al.
Published: (2025)

Mordal: Automated Pretrained Model Selection for Vision Language Models
by: He, Shiqi, et al.
Published: (2025)

Nalar: An agent serving framework
by: Laju, Marco, et al.
Published: (2026)

The ML.ENERGY Benchmark: Toward Automated Inference Energy Measurement and Optimization
by: Chung, Jae-Won, et al.
Published: (2025)

Automated Cloud Infrastructure-as-Code Reconciliation with AI Agents
by: Yang, Zhenning, et al.
Published: (2025)

Addressing Variable Heterogeneity in Distributed Multimodal Training with Entrain
by: Jang, Insu, et al.
Published: (2026)

Efficient Distributed MLLM Training with Cornstarch
by: Jang, Insu, et al.
Published: (2025)

Disaggregating Embedding Recommendation Systems with FlexEMR
by: Huang, Yibo, et al.
Published: (2024)

Cornserve: A Distributed Serving System for Any-to-Any Multimodal Models
by: Chung, Jae-Won, et al.
Published: (2026)

KAIROS: Stateful, Context-Aware Power-Efficient Agentic Inference Serving
by: Yuan, Yichao, et al.
Published: (2026)

Characterizing Truthfulness in Large Language Model Generations with Local Intrinsic Dimension
by: Yin, Fan, et al.
Published: (2024)

AstaBench: Rigorous Benchmarking of AI Agents with a Scientific Research Suite
by: Bragg, Jonathan, et al.
Published: (2025)

ScienceAgentBench: Toward Rigorous Assessment of Language Agents for Data-Driven Scientific Discovery
by: Chen, Ziru, et al.
Published: (2024)

RAW: A Robust and Agile Plug-and-Play Watermark Framework for AI-Generated Images with Provable Guarantees
by: Xian, Xun, et al.
Published: (2024)

Kareus: Joint Reduction of Dynamic and Static Energy in Large Model Training
by: Wu, Ruofan, et al.
Published: (2026)

SciIF: Benchmarking Scientific Instruction Following Towards Rigorous Scientific Intelligence
by: Su, Encheng, et al.
Published: (2026)

SQUiD: Synthesizing Relational Databases from Unstructured Text
by: Sadia, Mushtari, et al.
Published: (2025)

Diverse Score Distillation
by: Xu, Yanbo, et al.
Published: (2024)

Attention Reveals More Than Tokens: Training-Free Long-Context Reasoning with Attention-guided Retrieval
by: Zhang, Yuwei, et al.
Published: (2025)

Sphinx: Efficiently Serving Novel View Synthesis using Regression-Guided Selective Refinement
by: Xia, Yuchen, et al.
Published: (2025)

Branch-and-Browse: Efficient and Controllable Web Exploration with Tree-Structured Reasoning and Action Memory
by: He, Shiqi, et al.
Published: (2025)

Agent-Q: Fine-Tuning Large Language Models for Quantum Circuit Generation and Optimization
by: Jern, Linus, et al.
Published: (2025)

Manifesto for Scientifically Sound Artificial Intelligence Towards an Artificial Intelligence Serving Scientific Rigor
by: Febba, Michel
Published: (2025)

An Outlook on the Opportunities and Challenges of Multi-Agent AI Systems
by: Tian, Fangqiao, et al.
Published: (2025)

OpenG2G: A Simulation Platform for AI Datacenter-Grid Runtime Coordination
by: Chung, Jae-Won, et al.
Published: (2026)

A Retrieve-and-Read Framework for Knowledge Graph Link Prediction
by: Pahuja, Vardaan, et al.
Published: (2022)

TetriServe: Efficient DiT Serving for Heterogeneous Image Generation
by: Lu, Runyu, et al.
Published: (2025)

Where Do the Joules Go? Diagnosing Inference Energy Consumption
by: Chung, Jae-Won, et al.
Published: (2026)