Saved in:
| Main Authors: | Guo, Wei, Lu, Siyuan, Ran, Xiangdong, Tong, Yiqi, Ban, Yikun, Xu, Zelong, Fan, Jing, Huang, Zixuan, Zhang, Xiao, Hu, Zhaojun, Zhuang, Fuzhen |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.18749 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
H2Tune: Federated Foundation Model Fine-Tuning with Hybrid Heterogeneity
by: Guo, Wei, et al.
Published: (2025)
by: Guo, Wei, et al.
Published: (2025)
UniFAR: A Unified Facet-Aware Retrieval Framework for Scientific Documents
by: Dou, Zheng, et al.
Published: (2026)
by: Dou, Zheng, et al.
Published: (2026)
A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications
by: Guo, Wei, et al.
Published: (2024)
by: Guo, Wei, et al.
Published: (2024)
Proto-EVFL: Enhanced Vertical Federated Learning via Dual Prototype with Extremely Unaligned Data
by: Guo, Wei, et al.
Published: (2025)
by: Guo, Wei, et al.
Published: (2025)
Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
by: Li, Zhongyi, et al.
Published: (2026)
by: Li, Zhongyi, et al.
Published: (2026)
Does Your Reasoning Model Implicitly Know When to Stop Thinking?
by: Huang, Zixuan, et al.
Published: (2026)
by: Huang, Zixuan, et al.
Published: (2026)
Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
by: Chen, Zehao, et al.
Published: (2026)
by: Chen, Zehao, et al.
Published: (2026)
Real-Time Aligned Reward Model beyond Semantics
by: Huang, Zixuan, et al.
Published: (2026)
by: Huang, Zixuan, et al.
Published: (2026)
Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
by: Lu, Xiaodong, et al.
Published: (2026)
by: Lu, Xiaodong, et al.
Published: (2026)
Heterogeneous Agent Collaborative Reinforcement Learning
by: Zhang, Zhixia, et al.
Published: (2026)
by: Zhang, Zhixia, et al.
Published: (2026)
CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval
by: Sun, Zelong, et al.
Published: (2025)
by: Sun, Zelong, et al.
Published: (2025)
AgriCHN: A Comprehensive Cross-domain Resource for Chinese Agricultural Named Entity Recognition
by: Zeng, Lingxiao, et al.
Published: (2025)
by: Zeng, Lingxiao, et al.
Published: (2025)
Novel reinforced wood material with a biomimetic hierarchical square honeycomb structure under quasi‐static loading: Simulation and experimental study
by: Zixuan Fan, et al.
Published: (2025)
by: Zixuan Fan, et al.
Published: (2025)
Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation
by: Xie, Hongyan, et al.
Published: (2025)
by: Xie, Hongyan, et al.
Published: (2025)
Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment
by: Chen, Wei, et al.
Published: (2025)
by: Chen, Wei, et al.
Published: (2025)
Adaptive Robust Estimator for Multi-Agent Reinforcement Learning
by: Li, Zhongyi, et al.
Published: (2026)
by: Li, Zhongyi, et al.
Published: (2026)
Electrochemical Regioselective C(sp 2 )–H Selenylation of Pyrrolo[2,3‐ d ]pyrimidine Derivatives With Diselenides
by: Zixuan Liu, et al.
Published: (2026)
by: Zixuan Liu, et al.
Published: (2026)
Learnable Sampler Distillation for Discrete Diffusion Models
by: Fu, Feiyang, et al.
Published: (2025)
by: Fu, Feiyang, et al.
Published: (2025)
Your Group-Relative Advantage Is Biased
by: Yang, Fengkai, et al.
Published: (2026)
by: Yang, Fengkai, et al.
Published: (2026)
GCL-OT: Graph Contrastive Learning with Optimal Transport for Heterophilic Text-Attributed Graphs
by: Ren, Yating, et al.
Published: (2025)
by: Ren, Yating, et al.
Published: (2025)
A Market-Clearing-based Sensitivity Model for Locational Marginal and Average Carbon Emission
by: Lu, Zelong
Published: (2024)
by: Lu, Zelong
Published: (2024)
Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
by: Huang, Zixuan, et al.
Published: (2025)
by: Huang, Zixuan, et al.
Published: (2025)
CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning
by: Wu, Siye, et al.
Published: (2026)
by: Wu, Siye, et al.
Published: (2026)
HESTIA: A Hessian-Guided Differentiable Quantization-Aware Training Framework for Extremely Low-Bit LLMs
by: Wang, Guoan, et al.
Published: (2026)
by: Wang, Guoan, et al.
Published: (2026)
BiFedKD: Bidirectional Federated Knowledge Distillation Framework for Non-IID and Long-Tailed ECG Monitoring
by: Shu, Zixuan, et al.
Published: (2026)
by: Shu, Zixuan, et al.
Published: (2026)
STRIDE: Learnable Stepwise Language Feedback for LLM Reasoning
by: Zhang, Junjie, et al.
Published: (2026)
by: Zhang, Junjie, et al.
Published: (2026)
Skill-Aware Data Selection and Fine-Tuning for Data-Efficient Reasoning Distillation
by: Zhang, Lechen, et al.
Published: (2026)
by: Zhang, Lechen, et al.
Published: (2026)
On Multilinear Forms for Mod $p$ Representations of $\mathrm{GL}_2(\mathbb{Q}_p)$
by: Fan, Yikun
Published: (2026)
by: Fan, Yikun
Published: (2026)
Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
by: Hu, Xiao, et al.
Published: (2025)
by: Hu, Xiao, et al.
Published: (2025)
A General Deep Learning Framework for Wireless Resource Allocation under Discrete Constraints
by: Wang, Yikun, et al.
Published: (2026)
by: Wang, Yikun, et al.
Published: (2026)
FLeW: Facet-Level and Adaptive Weighted Representation Learning of Scientific Documents
by: Dou, Zheng, et al.
Published: (2025)
by: Dou, Zheng, et al.
Published: (2025)
FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
by: Lu, Pingchen, et al.
Published: (2025)
by: Lu, Pingchen, et al.
Published: (2025)
Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification
by: Yan, Jintao, et al.
Published: (2025)
by: Yan, Jintao, et al.
Published: (2025)
A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
by: Tong, Jingwen, et al.
Published: (2024)
by: Tong, Jingwen, et al.
Published: (2024)
Learnability-Guided Diffusion for Dataset Distillation
by: Chan-Santiago, Jeffrey A., et al.
Published: (2026)
by: Chan-Santiago, Jeffrey A., et al.
Published: (2026)
SynGR: Unleashing the Potential of Cross-Modal Synergy for Generative Recommendation
by: Chen, Wei, et al.
Published: (2026)
by: Chen, Wei, et al.
Published: (2026)
Small-Scale-Fading-Aware Resource Allocation in Wireless Federated Learning
by: Wang, Jiacheng, et al.
Published: (2025)
by: Wang, Jiacheng, et al.
Published: (2025)
Policy Improvement Reinforcement Learning
by: Wang, Huaiyang, et al.
Published: (2026)
by: Wang, Huaiyang, et al.
Published: (2026)
LLMBoost: Make Large Language Models Stronger with Boosting
by: Chen, Zehao, et al.
Published: (2025)
by: Chen, Zehao, et al.
Published: (2025)
Neural Exploitation and Exploration of Contextual Bandits
by: Ban, Yikun, et al.
Published: (2023)
by: Ban, Yikun, et al.
Published: (2023)
Similar Items
-
H2Tune: Federated Foundation Model Fine-Tuning with Hybrid Heterogeneity
by: Guo, Wei, et al.
Published: (2025) -
UniFAR: A Unified Facet-Aware Retrieval Framework for Scientific Documents
by: Dou, Zheng, et al.
Published: (2026) -
A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications
by: Guo, Wei, et al.
Published: (2024) -
Proto-EVFL: Enhanced Vertical Federated Learning via Dual Prototype with Extremely Unaligned Data
by: Guo, Wei, et al.
Published: (2025) -
Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
by: Li, Zhongyi, et al.
Published: (2026)