Saved in:
| Main Authors: | Shi, Boyu, Zhou, Junbo, Liu, Chang, Yang, Xu, Wang, Qiufeng, Geng, Xin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.08209 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
by: Xia, Shi-Yu, et al.
Published: (2024)
by: Xia, Shi-Yu, et al.
Published: (2024)
Transferring Core Knowledge via Learngenes
by: Feng, Fu, et al.
Published: (2024)
by: Feng, Fu, et al.
Published: (2024)
Chain-based Distillation for Effective Initialization of Variable-Sized Small Language Models
by: Shi, Boyu, et al.
Published: (2026)
by: Shi, Boyu, et al.
Published: (2026)
GENE-FL: Gene-Driven Parameter-Efficient Dynamic Federated Learning
by: Guo, Shunxin, et al.
Published: (2025)
by: Guo, Shunxin, et al.
Published: (2025)
NASH: Neural Architecture and Accelerator Search for Multiplication-Reduced Hybrid Models
by: Xu, Yang, et al.
Published: (2024)
by: Xu, Yang, et al.
Published: (2024)
Towards Understanding Feature Learning in Parameter Transfer
by: Yuan, Hua, et al.
Published: (2025)
by: Yuan, Hua, et al.
Published: (2025)
When Forgetting Builds Reliability: LLM Unlearning for Reliable Hardware Code Generation
by: Liang, Yiwen, et al.
Published: (2025)
by: Liang, Yiwen, et al.
Published: (2025)
WAVE: Weight Templates for Adaptive Initialization of Variable-sized Models
by: Feng, Fu, et al.
Published: (2024)
by: Feng, Fu, et al.
Published: (2024)
Extracting Multimodal Learngene in CLIP: Unveiling the Multimodal Generalizable Knowledge
by: Chen, Ruiming, et al.
Published: (2025)
by: Chen, Ruiming, et al.
Published: (2025)
From Isolation to Integration: Building an Adaptive Expert Forest for Pre-Trained Model-based Class-Incremental Learning
by: Liu, Ruiqi, et al.
Published: (2026)
by: Liu, Ruiqi, et al.
Published: (2026)
Efficient Deployment of Deep MIMO Detection Using Learngene
by: Zhang, Jinya, et al.
Published: (2025)
by: Zhang, Jinya, et al.
Published: (2025)
DiffCPS: Diffusion Model based Constrained Policy Search for Offline Reinforcement Learning
by: He, Longxiang, et al.
Published: (2023)
by: He, Longxiang, et al.
Published: (2023)
Learning to Search for Vehicle Routing with Multiple Time Windows
by: Xu, Kuan, et al.
Published: (2025)
by: Xu, Kuan, et al.
Published: (2025)
Diagnosing Retrieval Bias Under Multiple In-Context Knowledge Updates in Large Language Models
by: Qiao, Boyu, et al.
Published: (2026)
by: Qiao, Boyu, et al.
Published: (2026)
XPERT: Expert Knowledge Transfer for Effective Training of Language Models
by: Liu, Chang, et al.
Published: (2026)
by: Liu, Chang, et al.
Published: (2026)
Revisiting Randomization in Greedy Model Search
by: Chen, Xin, et al.
Published: (2025)
by: Chen, Xin, et al.
Published: (2025)
Heterogeneous Data Game: Characterizing the Model Competition Across Multiple Data Sources
by: Xu, Renzhe, et al.
Published: (2025)
by: Xu, Renzhe, et al.
Published: (2025)
Can Class-Priors Help Single-Positive Multi-Label Learning?
by: Liu, Biao, et al.
Published: (2023)
by: Liu, Biao, et al.
Published: (2023)
Constraint-based Pre-training: From Structured Constraints to Scalable Model Initialization
by: Feng, Fu, et al.
Published: (2026)
by: Feng, Fu, et al.
Published: (2026)
Adaptive Federated LoRA in Heterogeneous Wireless Networks with Independent Sampling
by: Hou, Yanzhao, et al.
Published: (2025)
by: Hou, Yanzhao, et al.
Published: (2025)
Un-mixing Test-time Adaptation under Heterogeneous Data Streams
by: Su, Zixian, et al.
Published: (2024)
by: Su, Zixian, et al.
Published: (2024)
DeepVision-103K: A Visually Diverse, Broad-Coverage, and Verifiable Mathematical Dataset for Multimodal Reasoning
by: Sun, Haoxiang, et al.
Published: (2026)
by: Sun, Haoxiang, et al.
Published: (2026)
Exploring the Impact of Dataset Statistical Effect Size on Model Performance and Data Sample Size Sufficiency
by: Hatamian, Arya, et al.
Published: (2025)
by: Hatamian, Arya, et al.
Published: (2025)
Saving for the future: Enhancing generalization via partial logic regularization
by: Tan, Zhaorui, et al.
Published: (2025)
by: Tan, Zhaorui, et al.
Published: (2025)
Towards Unified Task Embeddings Across Multiple Models: Bridging the Gap for Prompt-Based Large Language Models and Beyond
by: Wang, Xinyu, et al.
Published: (2024)
by: Wang, Xinyu, et al.
Published: (2024)
Multi-Study R-Learner for Estimating Heterogeneous Treatment Effects Across Studies Using Statistical Machine Learning
by: Shyr, Cathy, et al.
Published: (2023)
by: Shyr, Cathy, et al.
Published: (2023)
ASSEMBLAGE-DEEPHISTORY: A Cross-Build Binary Dataset with Temporal Coverage
by: Liu, Chang, et al.
Published: (2026)
by: Liu, Chang, et al.
Published: (2026)
AutoSynth: Automated Workflow Optimization for High-Quality Synthetic Dataset Generation via Monte Carlo Tree Search
by: Bi, Shuzhen, et al.
Published: (2025)
by: Bi, Shuzhen, et al.
Published: (2025)
Score Neural Operator: A Generative Model for Learning and Generalizing Across Multiple Probability Distributions
by: Liao, Xinyu, et al.
Published: (2024)
by: Liao, Xinyu, et al.
Published: (2024)
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
by: Wang, Xinghao, et al.
Published: (2024)
by: Wang, Xinghao, et al.
Published: (2024)
Federated Continual Learning via Knowledge Fusion: A Survey
by: Yang, Xin, et al.
Published: (2023)
by: Yang, Xin, et al.
Published: (2023)
Unveiling Statistical Significance of Online Regression over Multiple Datasets
by: Abu-Shaira, Mohammad, et al.
Published: (2025)
by: Abu-Shaira, Mohammad, et al.
Published: (2025)
Joint Training Across Multiple Activation Sparsity Regimes
by: Wang, Haotian
Published: (2026)
by: Wang, Haotian
Published: (2026)
Towards Size-invariant Salient Object Detection: A Generic Evaluation and Optimization Approach
by: Bao, Shilong, et al.
Published: (2025)
by: Bao, Shilong, et al.
Published: (2025)
BTS: Building Timeseries Dataset: Empowering Large-Scale Building Analytics
by: Prabowo, Arian, et al.
Published: (2024)
by: Prabowo, Arian, et al.
Published: (2024)
Direction Finding with Sparse Arrays Based on Variable Window Size Spatial Smoothing
by: Leite, Wesley S., et al.
Published: (2025)
by: Leite, Wesley S., et al.
Published: (2025)
Multiple Instance Verification
by: Xu, Xin, et al.
Published: (2024)
by: Xu, Xin, et al.
Published: (2024)
A Sensitivity-Driven Expert Allocation Method in LoRA-MoE for Efficient Fine-Tuning
by: Xu, Junzhou, et al.
Published: (2025)
by: Xu, Junzhou, et al.
Published: (2025)
Scaling Law Phenomena Across Regression Paradigms: Multiple and Kernel Approaches
by: Chen, Yifang, et al.
Published: (2025)
by: Chen, Yifang, et al.
Published: (2025)
When to Commit? Towards Variable-Size Self-Contained Blocks for Discrete Diffusion Language Models
by: Wang, Danny, et al.
Published: (2026)
by: Wang, Danny, et al.
Published: (2026)
Similar Items
-
Exploring Learngene via Stage-wise Weight Sharing for Initializing Variable-sized Models
by: Xia, Shi-Yu, et al.
Published: (2024) -
Transferring Core Knowledge via Learngenes
by: Feng, Fu, et al.
Published: (2024) -
Chain-based Distillation for Effective Initialization of Variable-Sized Small Language Models
by: Shi, Boyu, et al.
Published: (2026) -
GENE-FL: Gene-Driven Parameter-Efficient Dynamic Federated Learning
by: Guo, Shunxin, et al.
Published: (2025) -
NASH: Neural Architecture and Accelerator Search for Multiplication-Reduced Hybrid Models
by: Xu, Yang, et al.
Published: (2024)