Saved in:
| Main Authors: | Li, Meng, Wang, Peisong, Shao, Yuantian, Hu, Qinghao, Fang, Hongjian, Zhang, Yifan, Wei, Zhihui, Cheng, Jian |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01975 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Ban&Pick: Ehancing Performance and Efficiency of MoE-LLMs via Smarter Routing
by: Chen, Yuanteng, et al.
Published: (2025)
by: Chen, Yuanteng, et al.
Published: (2025)
Block Rotation is All You Need for MXFP4 Quantization
by: Shao, Yuantian, et al.
Published: (2025)
by: Shao, Yuantian, et al.
Published: (2025)
EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
by: Chen, Yuanteng, et al.
Published: (2025)
by: Chen, Yuanteng, et al.
Published: (2025)
Towards Efficient and Accurate Spiking Neural Networks via Adaptive Bit Allocation
by: Yao, Xingting, et al.
Published: (2025)
by: Yao, Xingting, et al.
Published: (2025)
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
by: Shen, Bowen, et al.
Published: (2024)
by: Shen, Bowen, et al.
Published: (2024)
Two-Stage Regularization-Based Structured Pruning for LLMs
by: Feng, Mingkuan, et al.
Published: (2025)
by: Feng, Mingkuan, et al.
Published: (2025)
DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
by: Shao, Yuantian, et al.
Published: (2025)
by: Shao, Yuantian, et al.
Published: (2025)
Intra-DP: A High Performance Collaborative Inference System for Mobile Edge Computing
by: Sun, Zekai, et al.
Published: (2025)
by: Sun, Zekai, et al.
Published: (2025)
Intra-Trajectory Consistency for Reward Modeling
by: Zhou, Chaoyang, et al.
Published: (2025)
by: Zhou, Chaoyang, et al.
Published: (2025)
IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors
by: Zheng, Shenghe, et al.
Published: (2024)
by: Zheng, Shenghe, et al.
Published: (2024)
$\rm SP^3$: Enhancing Structured Pruning via PCA Projection
by: Hu, Yuxuan, et al.
Published: (2023)
by: Hu, Yuxuan, et al.
Published: (2023)
MoE-I$^2$: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition
by: Yang, Cheng, et al.
Published: (2024)
by: Yang, Cheng, et al.
Published: (2024)
MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
by: Wu, Qinzhuo, et al.
Published: (2024)
by: Wu, Qinzhuo, et al.
Published: (2024)
FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training
by: Cai, Fuhan, et al.
Published: (2025)
by: Cai, Fuhan, et al.
Published: (2025)
Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE
by: Chen, Yuanteng, et al.
Published: (2026)
by: Chen, Yuanteng, et al.
Published: (2026)
Intra-Layer Recurrence in Transformers for Language Modeling
by: Nguyen, Anthony, et al.
Published: (2025)
by: Nguyen, Anthony, et al.
Published: (2025)
Predefined Prototypes for Intra-Class Separation and Disentanglement
by: Almudévar, Antonio, et al.
Published: (2024)
by: Almudévar, Antonio, et al.
Published: (2024)
DeepResearch-Slice: Bridging the Retrieval-Utilization Gap via Explicit Text Slicing
by: Lu, Shuo, et al.
Published: (2025)
by: Lu, Shuo, et al.
Published: (2025)
HiViS: Hiding Visual Tokens from the Drafter for Speculative Decoding in Vision-Language Models
by: Xie, Zhinan, et al.
Published: (2025)
by: Xie, Zhinan, et al.
Published: (2025)
FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN Training at Large Scale
by: Zhu, Zeyu, et al.
Published: (2024)
by: Zhu, Zeyu, et al.
Published: (2024)
Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models
by: Wei, Xiwen, et al.
Published: (2025)
by: Wei, Xiwen, et al.
Published: (2025)
Leveraging Intra-modal and Inter-modal Interaction for Multi-Modal Entity Alignment
by: Hu, Zhiwei, et al.
Published: (2024)
by: Hu, Zhiwei, et al.
Published: (2024)
Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores
by: Badash, Zvi N., et al.
Published: (2026)
by: Badash, Zvi N., et al.
Published: (2026)
Investigating Intra-Abstraction Policies For Non-exact Abstraction Algorithms
by: Schmöcker, Robin, et al.
Published: (2025)
by: Schmöcker, Robin, et al.
Published: (2025)
Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining
by: Li, Jianwei, et al.
Published: (2024)
by: Li, Jianwei, et al.
Published: (2024)
Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models
by: Li, Jianwei, et al.
Published: (2023)
by: Li, Jianwei, et al.
Published: (2023)
A Unified Conditional Flow for Motion Generation, Editing, and Intra-Structural Retargeting
by: Li, Junlin, et al.
Published: (2026)
by: Li, Junlin, et al.
Published: (2026)
AI-driven View Guidance System in Intra-cardiac Echocardiography Imaging
by: Huh, Jaeyoung, et al.
Published: (2024)
by: Huh, Jaeyoung, et al.
Published: (2024)
PrunePath: Towards Highly Structured Sparse Language Models
by: Gu, Zhexuan, et al.
Published: (2026)
by: Gu, Zhexuan, et al.
Published: (2026)
Inter- and Intra-Subject Variability in EEG: A Systematic Survey
by: Tran, Xuan-The, et al.
Published: (2026)
by: Tran, Xuan-The, et al.
Published: (2026)
Intra-Fairness Dynamics: The Bias Spillover Effect in Targeted LLM Alignment
by: Paraschou, Eva, et al.
Published: (2026)
by: Paraschou, Eva, et al.
Published: (2026)
Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability
by: Ma, Zhaoyang, et al.
Published: (2025)
by: Ma, Zhaoyang, et al.
Published: (2025)
IMPA-HGAE:Intra-Meta-Path Augmented Heterogeneous Graph Autoencoder
by: Lin, Di, et al.
Published: (2025)
by: Lin, Di, et al.
Published: (2025)
Modulating Cross-Modal Convergence with Single-Stimulus, Intra-Modal Dispersion
by: Hosseini, Eghbal A., et al.
Published: (2026)
by: Hosseini, Eghbal A., et al.
Published: (2026)
MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
by: Guo, Zikang, et al.
Published: (2025)
by: Guo, Zikang, et al.
Published: (2025)
Distance-Forward Learning: Enhancing the Forward-Forward Algorithm Towards High-Performance On-Chip Learning
by: Wu, Yujie, et al.
Published: (2024)
by: Wu, Yujie, et al.
Published: (2024)
Robust Multivariate Time Series Forecasting against Intra- and Inter-Series Transitional Shift
by: He, Hui, et al.
Published: (2024)
by: He, Hui, et al.
Published: (2024)
Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings
by: Liu, Xuanqing, et al.
Published: (2025)
by: Liu, Xuanqing, et al.
Published: (2025)
EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion Models
by: Heo, Jaehoon, et al.
Published: (2025)
by: Heo, Jaehoon, et al.
Published: (2025)
Continuous Sign Language Recognition Using Intra-inter Gloss Attention
by: Ranjbar, Hossein, et al.
Published: (2024)
by: Ranjbar, Hossein, et al.
Published: (2024)
Similar Items
-
Ban&Pick: Ehancing Performance and Efficiency of MoE-LLMs via Smarter Routing
by: Chen, Yuanteng, et al.
Published: (2025) -
Block Rotation is All You Need for MXFP4 Quantization
by: Shao, Yuantian, et al.
Published: (2025) -
EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
by: Chen, Yuanteng, et al.
Published: (2025) -
Towards Efficient and Accurate Spiking Neural Networks via Adaptive Bit Allocation
by: Yao, Xingting, et al.
Published: (2025) -
Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
by: Shen, Bowen, et al.
Published: (2024)