:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Guo, Wei, Lu, Siyuan, Ran, Xiangdong, Tong, Yiqi, Ban, Yikun, Xu, Zelong, Fan, Jing, Huang, Zixuan, Zhang, Xiao, Hu, Zhaojun, Zhuang, Fuzhen
Format:	Preprint
Published:	2026
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.18749
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

H2Tune: Federated Foundation Model Fine-Tuning with Hybrid Heterogeneity
by: Guo, Wei, et al.
Published: (2025)

UniFAR: A Unified Facet-Aware Retrieval Framework for Scientific Documents
by: Dou, Zheng, et al.
Published: (2026)

A Comprehensive Survey of Federated Transfer Learning: Challenges, Methods and Applications
by: Guo, Wei, et al.
Published: (2024)

Proto-EVFL: Enhanced Vertical Federated Learning via Dual Prototype with Extremely Unaligned Data
by: Guo, Wei, et al.
Published: (2025)

Counterfactual Credit Policy Optimization for Multi-Agent Collaboration
by: Li, Zhongyi, et al.
Published: (2026)

Does Your Reasoning Model Implicitly Know When to Stop Thinking?
by: Huang, Zixuan, et al.
Published: (2026)

Weak-Driven Learning: How Weak Agents make Strong Agents Stronger
by: Chen, Zehao, et al.
Published: (2026)

Real-Time Aligned Reward Model beyond Semantics
by: Huang, Zixuan, et al.
Published: (2026)

Contextual Rollout Bandits for Reinforcement Learning with Verifiable Rewards
by: Lu, Xiaodong, et al.
Published: (2026)

Heterogeneous Agent Collaborative Reinforcement Learning
by: Zhang, Zhixia, et al.
Published: (2026)

CoTMR: Chain-of-Thought Multi-Scale Reasoning for Training-Free Zero-Shot Composed Image Retrieval
by: Sun, Zelong, et al.
Published: (2025)

AgriCHN: A Comprehensive Cross-domain Resource for Chinese Agricultural Named Entity Recognition
by: Zeng, Lingxiao, et al.
Published: (2025)

Novel reinforced wood material with a biomimetic hierarchical square honeycomb structure under quasi‐static loading: Simulation and experimental study
by: Zixuan Fan, et al.
Published: (2025)

Mitigating Spurious Correlations Between Question and Answer via Chain-of-Thought Correctness Perception Distillation
by: Xie, Hongyan, et al.
Published: (2025)

Bridging Social Psychology and LLM Reasoning: Conflict-Aware Meta-Review Generation via Cognitive Alignment
by: Chen, Wei, et al.
Published: (2025)

Adaptive Robust Estimator for Multi-Agent Reinforcement Learning
by: Li, Zhongyi, et al.
Published: (2026)

Electrochemical Regioselective C(sp 2 )–H Selenylation of Pyrrolo[2,3‐ d ]pyrimidine Derivatives With Diselenides
by: Zixuan Liu, et al.
Published: (2026)

Learnable Sampler Distillation for Discrete Diffusion Models
by: Fu, Feiyang, et al.
Published: (2025)

Your Group-Relative Advantage Is Biased
by: Yang, Fengkai, et al.
Published: (2026)

GCL-OT: Graph Contrastive Learning with Optimal Transport for Heterophilic Text-Attributed Graphs
by: Ren, Yating, et al.
Published: (2025)

A Market-Clearing-based Sensitivity Model for Locational Marginal and Average Carbon Emission
by: Lu, Zelong
Published: (2024)

Adaptive Batch-Wise Sample Scheduling for Direct Preference Optimization
by: Huang, Zixuan, et al.
Published: (2025)

CODA: Difficulty-Aware Compute Allocation for Adaptive Reasoning
by: Wu, Siye, et al.
Published: (2026)

HESTIA: A Hessian-Guided Differentiable Quantization-Aware Training Framework for Extremely Low-Bit LLMs
by: Wang, Guoan, et al.
Published: (2026)

BiFedKD: Bidirectional Federated Knowledge Distillation Framework for Non-IID and Long-Tailed ECG Monitoring
by: Shu, Zixuan, et al.
Published: (2026)

STRIDE: Learnable Stepwise Language Feedback for LLM Reasoning
by: Zhang, Junjie, et al.
Published: (2026)

Skill-Aware Data Selection and Fine-Tuning for Data-Efficient Reasoning Distillation
by: Zhang, Lechen, et al.
Published: (2026)

On Multilinear Forms for Mod $p$ Representations of $\mathrm{GL}_2(\mathbb{Q}_p)$
by: Fan, Yikun
Published: (2026)

Why Distillation can Outperform Zero-RL: The Role of Flexible Reasoning
by: Hu, Xiao, et al.
Published: (2025)

A General Deep Learning Framework for Wireless Resource Allocation under Discrete Constraints
by: Wang, Yikun, et al.
Published: (2026)

FLeW: Facet-Level and Adaptive Weighted Representation Learning of Scientific Documents
by: Dou, Zheng, et al.
Published: (2025)

FedPOB: Sample-Efficient Federated Prompt Optimization via Bandits
by: Lu, Pingchen, et al.
Published: (2025)

Mobility-Aware Asynchronous Federated Learning with Dynamic Sparsification
by: Yan, Jintao, et al.
Published: (2025)

A Federated Online Restless Bandit Framework for Cooperative Resource Allocation
by: Tong, Jingwen, et al.
Published: (2024)

Learnability-Guided Diffusion for Dataset Distillation
by: Chan-Santiago, Jeffrey A., et al.
Published: (2026)

SynGR: Unleashing the Potential of Cross-Modal Synergy for Generative Recommendation
by: Chen, Wei, et al.
Published: (2026)

Small-Scale-Fading-Aware Resource Allocation in Wireless Federated Learning
by: Wang, Jiacheng, et al.
Published: (2025)

Policy Improvement Reinforcement Learning
by: Wang, Huaiyang, et al.
Published: (2026)

LLMBoost: Make Large Language Models Stronger with Boosting
by: Chen, Zehao, et al.
Published: (2025)

Neural Exploitation and Exploration of Contextual Bandits
by: Ban, Yikun, et al.
Published: (2023)