Saved in:
| Main Authors: | Wang, Ruhan, Wang, Zhiyong, Huang, Chengkai, Wang, Rui, Yu, Tong, Yao, Lina, Lui, John C. S., Zhou, Dongruo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.07440 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
FERA: Uncertainty-Aware Federated Reasoning for Large Language Models
by: Wang, Ruhan, et al.
Published: (2026)
by: Wang, Ruhan, et al.
Published: (2026)
Provable Zero-Shot Generalization in Offline Reinforcement Learning
by: Wang, Zhiyong, et al.
Published: (2025)
by: Wang, Zhiyong, et al.
Published: (2025)
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
by: Wang, Zhiyong, et al.
Published: (2024)
by: Wang, Zhiyong, et al.
Published: (2024)
How to Provably Improve Return Conditioned Supervised Learning?
by: Liu, Zhishuai, et al.
Published: (2025)
by: Liu, Zhishuai, et al.
Published: (2025)
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
by: Zhao, Runze, et al.
Published: (2025)
by: Zhao, Runze, et al.
Published: (2025)
Variance-Dependent Regret Bounds for Non-stationary Linear Bandits
by: Wang, Zhiyong, et al.
Published: (2024)
by: Wang, Zhiyong, et al.
Published: (2024)
Return Augmented Decision Transformer for Off-Dynamics Reinforcement Learning
by: Wang, Ruhan, et al.
Published: (2024)
by: Wang, Ruhan, et al.
Published: (2024)
DPAdapter: Improving Differentially Private Deep Learning through Noise Tolerance Pre-training
by: Wang, Zihao, et al.
Published: (2024)
by: Wang, Zihao, et al.
Published: (2024)
Near-Optimal Second-Order Guarantees for Model-Based Adversarial Imitation Learning
by: Li, Shangzhe, et al.
Published: (2025)
by: Li, Shangzhe, et al.
Published: (2025)
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
by: Yang, Hantao, et al.
Published: (2024)
by: Yang, Hantao, et al.
Published: (2024)
Diffusion Policies for Risk-Averse Behavior Modeling in Offline Reinforcement Learning
by: Chen, Xiaocong, et al.
Published: (2024)
by: Chen, Xiaocong, et al.
Published: (2024)
Fed-CVLC: Compressing Federated Learning Communications with Variable-Length Codes
by: Su, Xiaoxin, et al.
Published: (2024)
by: Su, Xiaoxin, et al.
Published: (2024)
Breaking the $\log(1/Δ_2)$ Barrier: Better Batched Best Arm Identification with Adaptive Grids
by: Jin, Tianyuan, et al.
Published: (2025)
by: Jin, Tianyuan, et al.
Published: (2025)
Nearly Minimax Optimal Regret for Learning Linear Mixture Stochastic Shortest Path
by: Di, Qiwei, et al.
Published: (2024)
by: Di, Qiwei, et al.
Published: (2024)
On the Limits of Test-Time Compute: Sequential Reward Filtering for Better Inference
by: Yu, Yue, et al.
Published: (2025)
by: Yu, Yue, et al.
Published: (2025)
Refiner: Data Refining against Gradient Leakage Attacks in Federated Learning
by: Fan, Mingyuan, et al.
Published: (2022)
by: Fan, Mingyuan, et al.
Published: (2022)
Federated Large Language Models: Current Progress and Future Directions
by: Yao, Yuhang, et al.
Published: (2024)
by: Yao, Yuhang, et al.
Published: (2024)
Quantum Diffusion Models for Few-Shot Learning
by: Wang, Ruhan, et al.
Published: (2024)
by: Wang, Ruhan, et al.
Published: (2024)
Iterative Refinement of Flow Policies in Probability Space for Online Reinforcement Learning
by: Sun, Mingyang, et al.
Published: (2025)
by: Sun, Mingyang, et al.
Published: (2025)
Sample and Computationally Efficient Continuous-Time Reinforcement Learning with General Function Approximation
by: Zhao, Runze, et al.
Published: (2025)
by: Zhao, Runze, et al.
Published: (2025)
Combinatorial Multivariant Multi-Armed Bandits with Applications to Episodic Reinforcement Learning and Beyond
by: Liu, Xutong, et al.
Published: (2024)
by: Liu, Xutong, et al.
Published: (2024)
Large Language Model-Enhanced Multi-Armed Bandits
by: Sun, Jiahang, et al.
Published: (2025)
by: Sun, Jiahang, et al.
Published: (2025)
Towards Agentic Recommender Systems in the Era of Multimodal Large Language Models
by: Huang, Chengkai, et al.
Published: (2025)
by: Huang, Chengkai, et al.
Published: (2025)
Federated Linear Dueling Bandits
by: Huang, Xuhan, et al.
Published: (2025)
by: Huang, Xuhan, et al.
Published: (2025)
iFlip: Iterative Feedback-driven Counterfactual Example Refinement
by: Wang, Yilong, et al.
Published: (2026)
by: Wang, Yilong, et al.
Published: (2026)
FedConPE: Efficient Federated Conversational Bandits with Heterogeneous Clients
by: Li, Zhuohua, et al.
Published: (2024)
by: Li, Zhuohua, et al.
Published: (2024)
ToolACE-R: Model-aware Iterative Training and Adaptive Refinement for Tool Learning
by: Zeng, Xingshan, et al.
Published: (2025)
by: Zeng, Xingshan, et al.
Published: (2025)
Independence Constrained Disentangled Representation Learning from Epistemological Perspective
by: Wang, Ruoyu, et al.
Published: (2024)
by: Wang, Ruoyu, et al.
Published: (2024)
Near-Constant Strong Violation and Last-Iterate Convergence for Online CMDPs via Decaying Safety Margins
by: Zuo, Qian, et al.
Published: (2026)
by: Zuo, Qian, et al.
Published: (2026)
CoPS: Empowering LLM Agents with Provable Cross-Task Experience Sharing
by: Yang, Chen, et al.
Published: (2024)
by: Yang, Chen, et al.
Published: (2024)
Uncertainty-Aware Reward-Free Exploration with General Function Approximation
by: Zhang, Junkai, et al.
Published: (2024)
by: Zhang, Junkai, et al.
Published: (2024)
On ADMM in Heterogeneous Federated Learning: Personalization, Robustness, and Fairness
by: Zhu, Shengkun, et al.
Published: (2024)
by: Zhu, Shengkun, et al.
Published: (2024)
Diffusion Model-Based Data Synthesis Aided Federated Semi-Supervised Learning
by: Wang, Zhongwei, et al.
Published: (2025)
by: Wang, Zhongwei, et al.
Published: (2025)
Online Clustering of Dueling Bandits
by: Wang, Zhiyong, et al.
Published: (2025)
by: Wang, Zhiyong, et al.
Published: (2025)
IMPROVE: Iterative Model Pipeline Refinement and Optimization Leveraging LLM Experts
by: Xue, Eric, et al.
Published: (2025)
by: Xue, Eric, et al.
Published: (2025)
Mitigating Modality Quantity and Quality Imbalance in Multimodal Online Federated Learning
by: Wang, Heqiang, et al.
Published: (2025)
by: Wang, Heqiang, et al.
Published: (2025)
Graph Federated Learning Based Proactive Content Caching in Edge Computing
by: Wang, Rui
Published: (2025)
by: Wang, Rui
Published: (2025)
Iterative Refinement Improves Compositional Image Generation
by: Jaiswal, Shantanu, et al.
Published: (2026)
by: Jaiswal, Shantanu, et al.
Published: (2026)
Are LLMs Better GNN Helpers? Rethinking Robust Graph Learning under Deficiencies with Iterative Refinement
by: Wang, Zhaoyan, et al.
Published: (2025)
by: Wang, Zhaoyan, et al.
Published: (2025)
Demystifying Online Clustering of Bandits: Enhanced Exploration Under Stochastic and Smoothed Adversarial Contexts
by: Li, Zhuohua, et al.
Published: (2025)
by: Li, Zhuohua, et al.
Published: (2025)
Similar Items
-
FERA: Uncertainty-Aware Federated Reasoning for Large Language Models
by: Wang, Ruhan, et al.
Published: (2026) -
Provable Zero-Shot Generalization in Offline Reinforcement Learning
by: Wang, Zhiyong, et al.
Published: (2025) -
Model-based RL as a Minimalist Approach to Horizon-Free and Second-Order Bounds
by: Wang, Zhiyong, et al.
Published: (2024) -
How to Provably Improve Return Conditioned Supervised Learning?
by: Liu, Zhishuai, et al.
Published: (2025) -
Instance-Dependent Continuous-Time Reinforcement Learning via Maximum Likelihood Estimation
by: Zhao, Runze, et al.
Published: (2025)