:: Library Catalog

Image de couverture de livre

Enregistré dans:

Détails bibliographiques
Auteurs principaux:	Ye, Fanjiang, Zhao, Zepeng, Mu, Yi, Shen, Jucheng, Li, Renjie, Wang, Kaijian, Agarwal, Saurabh, Lee, Myungjin, Cao, Triston, Akella, Aditya, Krishnamurthy, Arvind, Ng, T. S. Eugene, Tu, Zhengzhong, Wang, Yuke
Format:	Preprint
Publié:	2025
Sujets:	Machine Learning Systems and Control
Accès en ligne:	https://arxiv.org/abs/2508.17756
Tags:	Ajouter un tag Pas de tags, Soyez le premier à ajouter un tag!

Documents similaires

GENSERVE: Efficient Co-Serving of Heterogeneous Diffusion Model Workloads
par: Ye, Fanjiang, et autres
Publié: (2026)

Software-Defined Agentic Serving
par: Agarwal, Saurabh, et autres
Publié: (2026)

An Efficient and Adaptive Watermark Detection System with Tile-based Error Correction
par: Zhong, Xinrui, et autres
Publié: (2025)

Nalar: An agent serving framework
par: Laju, Marco, et autres
Publié: (2026)

HeadsUp! High-Fidelity Portrait Image Super-Resolution
par: Li, Renjie, et autres
Publié: (2025)

Patchwork: A Unified Framework for RAG Serving
par: Hu, Bodun, et autres
Publié: (2025)

SYMPHONY: Improving Memory Management for LLM Inference Workloads
par: Agarwal, Saurabh, et autres
Publié: (2024)

ALTO: Adaptive LoRA Tuning and Orchestration for Heterogeneous LoRA Training Workloads
par: Zuo, Jingwei, et autres
Publié: (2026)

CUCo: An Agentic Framework for Compute and Communication Co-design
par: Hu, Bodun, et autres
Publié: (2026)

Improving the Throughput of Diffusion-based Large Language Models via a Training-Free Confidence-Aware Calibration
par: Shen, Jucheng, et autres
Publié: (2025)

PANDORA: Diffusion Policy Learning for Dexterous Robotic Piano Playing
par: Huang, Yanjia, et autres
Publié: (2025)

GuideSR: Rethinking Guidance for One-Step High-Fidelity Diffusion-Based Super-Resolution
par: Arora, Aditya, et autres
Publié: (2025)

On the Fundamental Limitations of Decentralized Learnable Reward Shaping in Cooperative Multi-Agent Reinforcement Learning
par: Akella, Aditya
Publié: (2025)

PATCHEDSERVE: A Patch Management Framework for SLO-Optimized Hybrid Resolution Diffusion Serving
par: Sun, Desen, et autres
Publié: (2025)

VISTA: Generative Visual Imagination for Vision-and-Language Navigation
par: Huang, Yanjia, et autres
Publié: (2025)

Line Coverage with Multiple Robots: Algorithms and Experiments
par: Agarwal, Saurav, et autres
Publié: (2022)

Comparative Analysis of Time Series Foundation Models for Demographic Forecasting: Enhancing Predictive Accuracy in US Population Dynamics
par: Akella, Aditya, et autres
Publié: (2025)

CAST: Contrastive Adaptation and Distillation for Semi-Supervised Instance Segmentation
par: Taghavi, Pardis, et autres
Publié: (2025)

Training a Student Expert via Semi-Supervised Foundation Model Distillation
par: Taghavi, Pardis, et autres
Publié: (2026)

BlockLLM: Multi-tenant Finer-grained Serving for Large Language Models
par: Hu, Bodun, et autres
Publié: (2024)

Reducing the GPU Memory Bottleneck with Lossless Compression for ML -- Extended
par: Kamath, Aditya K, et autres
Publié: (2026)

Mechanistic Interpretability of Reinforcement Learning Agents
par: Trim, Tristan, et autres
Publié: (2024)

Empowering Distributed Training with Sparsity-driven Data Synchronization
par: Wang, Zhuang, et autres
Publié: (2023)

4KAgent: Agentic Any Image to 4K Super-Resolution
par: Zuo, Yushen, et autres
Publié: (2025)

DriveGen: Towards Infinite Diverse Traffic Scenarios with Large Models
par: Zhang, Shenyu, et autres
Publié: (2025)

SuperFedNAS: Cost-Efficient Federated Neural Architecture Search for On-Device Inference
par: Khare, Alind, et autres
Publié: (2023)

Beyond Static Cutoffs: One-Shot Dynamic Thresholding for Diffusion Language Models
par: Shen, Jucheng, et autres
Publié: (2025)

Characterization-Guided GPU Fault Resilience in NVIDIA MPS
par: Liu, Rixin, et autres
Publié: (2026)

SparkVSR: Interactive Video Super-Resolution via Sparse Keyframe Propagation
par: Yu, Jiongze, et autres
Publié: (2026)

Vanishing layer thickness limit of convection in multilayer porous media
par: Sha, Kaijian, et autres
Publié: (2025)

Vanishing permeability limit of convection in multilayer porous media
par: Sha, Kaijian, et autres
Publié: (2026)

Fast Video Generation with Sliding Tile Attention
par: Zhang, Peiyuan, et autres
Publié: (2025)

A sufficient condition for the height function to be constant in $ I_g\times_ρ\mathbb{P}^n $
par: Cao, Kaijian
Publié: (2024)

Clinical Median Images and Deep Learning: Advancing Automated Detection of Ultrasound Transducer Uniformity Artifacts
par: Yang Kaijian
Publié: (2026)

Region-R1: Reinforcing Query-Side Region Cropping for Multi-Modal Re-Ranking
par: Hu, Chan-Wei, et autres
Publié: (2026)

PISCO: Precise Video Instance Insertion with Sparse Control
par: Gao, Xiangbo, et autres
Publié: (2026)

On-the-Fly Adaptive Distillation of Transformer to Dual-State Linear Attention
par: Ro, Yeonju, et autres
Publié: (2025)

CWebGen -- A tool to study colour structure of scattering amplitudes in IR limit
par: Agarwal, Neelima, et autres
Publié: (2024)

3D Hand Pose Estimation in Everyday Egocentric Images
par: Prakash, Aditya, et autres
Publié: (2023)

$Δ$-AttnMask: Attention-Guided Masked Hidden States for Efficient Data Selection and Augmentation
par: Hu, Jucheng, et autres
Publié: (2025)