:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Ruhan, Wang, Zhiyong, Huang, Chengkai, Wang, Rui, Yu, Tong, Yao, Lina, Lui, John C. S., Zhou, Dongruo
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2506.07440
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

FERA: Uncertainty-Aware Federated Reasoning for Large Language Models
by: Wang, Ruhan, et al.
Published: (2026)

Provable Zero-Shot Generalization in Offline Reinforcement Learning
by: Wang, Zhiyong, et al.
Published: (2025)

Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
by: Wang, Zhiyong, et al.
Published: (2024)

How to Provably Improve Return Conditioned Supervised Learning?
by: Liu, Zhishuai, et al.
Published: (2025)

Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
by: Zhao, Runze, et al.
Published: (2025)

Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
by: Wang, Zhiyong, et al.
Published: (2024)

Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
by: Wang, Ruhan, et al.
Published: (2024)

DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training
by: Wang, Zihao, et al.
Published: (2024)

Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning
by: Li, Shangzhe, et al.
Published: (2025)

Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
by: Yang, Hantao, et al.
Published: (2024)

Diffusion Policies for Risk-Averse Behavior Modeling in Offline Reinforcement Learning
by: Chen, Xiaocong, et al.
Published: (2024)

Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes
by: Su, Xiaoxin, et al.
Published: (2024)

Breaking the $\log(1/Δ_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids
by: Jin, Tianyuan, et al.
Published: (2025)

Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
by: Di, Qiwei, et al.
Published: (2024)

On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
by: Yu, Yue, et al.
Published: (2025)

Refiner: Data Refining against Gradient Leakage Attacks in Federated Learning
by: Fan, Mingyuan, et al.
Published: (2022)

Federated Large Language Models: Current Progress and Future Directions
by: Yao, Yuhang, et al.
Published: (2024)

Quantum Diffusion Models for Few-Shot Learning
by: Wang, Ruhan, et al.
Published: (2024)

Iterative Refinement of Flow Policies in Probability Space for Online Reinforcement Learning
by: Sun, Mingyang, et al.
Published: (2025)

Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
by: Zhao, Runze, et al.
Published: (2025)

Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
by: Liu, Xutong, et al.
Published: (2024)

Large Language Model-Enhanced Multi-Armed Bandits
by: Sun, Jiahang, et al.
Published: (2025)

Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models
by: Huang, Chengkai, et al.
Published: (2025)

Federated Linear Dueling Bandits
by: Huang, Xuhan, et al.
Published: (2025)

iFlip: Iterative Feedback-driven Counterfactual Example Refinement
by: Wang, Yilong, et al.
Published: (2026)

FedConPE: Efficient Federated Conversational Bandits with Heterogeneous Clients
by: Li, Zhuohua, et al.
Published: (2024)

ToolACE-R: Model-aware Iterative Training and Adaptive Refinement for Tool Learning
by: Zeng, Xingshan, et al.
Published: (2025)

Independence Constrained Disentangled Representation Learning from Epistemological Perspective
by: Wang, Ruoyu, et al.
Published: (2024)

Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins
by: Zuo, Qian, et al.
Published: (2026)

CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
by: Yang, Chen, et al.
Published: (2024)

Uncertainty-Aware Reward-Free Exploration with General Function Approximation
by: Zhang, Junkai, et al.
Published: (2024)

On ADMM in Heterogeneous Federated Learning: Personalization, Robustness, and Fairness
by: Zhu, Shengkun, et al.
Published: (2024)

Diffusion Model-Based Data Synthesis Aided Federated Semi-Supervised Learning
by: Wang, Zhongwei, et al.
Published: (2025)

Online Clustering of Dueling Bandits
by: Wang, Zhiyong, et al.
Published: (2025)

IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Experts
by: Xue, Eric, et al.
Published: (2025)

Mitigating Modality Quantity and Quality Imbalance in Multimodal Online Federated Learning
by: Wang, Heqiang, et al.
Published: (2025)

Graph Federated Learning Based Proactive Content Caching in Edge Computing
by: Wang, Rui
Published: (2025)

Iterative Refinement Improves Compositional Image Generation
by: Jaiswal, Shantanu, et al.
Published: (2026)

Are LLMs Better GNN Helpers? Rethinking Robust Graph Learning under Deficiencies with Iterative Refinement
by: Wang, Zhaoyan, et al.
Published: (2025)

Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
by: Li, Zhuohua, et al.
Published: (2025)