:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Meng, Wang, Peisong, Shao, Yuantian, Hu, Qinghao, Fang, Hongjian, Zhang, Yifan, Wei, Zhihui, Cheng, Jian
Format:	Preprint
Published:	2026
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2602.01975
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Ban&Pick: Ehancing Performance and Efficiency of MoE-LLMs via Smarter Routing
by: Chen, Yuanteng, et al.
Published: (2025)

Block Rotation is All You Need for MXFP4 Quantization
by: Shao, Yuantian, et al.
Published: (2025)

EAC-MoE: Expert-Selection Aware Compressor for Mixture-of-Experts Large Language Models
by: Chen, Yuanteng, et al.
Published: (2025)

Towards Efficient and Accurate Spiking Neural Networks via Adaptive Bit Allocation
by: Yao, Xingting, et al.
Published: (2025)

Pruning Large Language Models to Intra-module Low-rank Architecture with Transitional Activations
by: Shen, Bowen, et al.
Published: (2024)

Two-Stage Regularization-Based Structured Pruning for LLMs
by: Feng, Mingkuan, et al.
Published: (2025)

DartQuant: Efficient Rotational Distribution Calibration for LLM Quantization
by: Shao, Yuantian, et al.
Published: (2025)

Intra-DP: A High Performance Collaborative Inference System for Mobile Edge Computing
by: Sun, Zekai, et al.
Published: (2025)

Intra-Trajectory Consistency for Reward Modeling
by: Zhou, Chaoyang, et al.
Published: (2025)

IntraMix: Intra-Class Mixup Generation for Accurate Labels and Neighbors
by: Zheng, Shenghe, et al.
Published: (2024)

$\rm SP^3$: Enhancing Structured Pruning via PCA Projection
by: Hu, Yuxuan, et al.
Published: (2023)

MoE-I$^2$: Compressing Mixture of Experts Models through Inter-Expert Pruning and Intra-Expert Low-Rank Decomposition
by: Yang, Cheng, et al.
Published: (2024)

MobileVLM: A Vision-Language Model for Better Intra- and Inter-UI Understanding
by: Wu, Qinzhuo, et al.
Published: (2024)

FastFLUX: Pruning FLUX with Block-wise Replacement and Sandwich Training
by: Cai, Fuhan, et al.
Published: (2025)

Certain Head, Uncertain Tail: Expert-Sample for Test-Time Scaling in Fine-Grained MoE
by: Chen, Yuanteng, et al.
Published: (2026)

Intra-Layer Recurrence in Transformers for Language Modeling
by: Nguyen, Anthony, et al.
Published: (2025)

Predefined Prototypes for Intra-Class Separation and Disentanglement
by: Almudévar, Antonio, et al.
Published: (2024)

DeepResearch-Slice: Bridging the Retrieval-Utilization Gap via Explicit Text Slicing
by: Lu, Shuo, et al.
Published: (2025)

HiViS: Hiding Visual Tokens from the Drafter for Speculative Decoding in Vision-Language Models
by: Xie, Zhinan, et al.
Published: (2025)

FastGL: A GPU-Efficient Framework for Accelerating Sampling-Based GNN Training at Large Scale
by: Zhu, Zeyu, et al.
Published: (2024)

Mitigating Intra- and Inter-modal Forgetting in Continual Learning of Unified Multimodal Models
by: Wei, Xiwen, et al.
Published: (2025)

Leveraging Intra-modal and Inter-modal Interaction for Multi-Modal Entity Alignment
by: Hu, Zhiwei, et al.
Published: (2024)

Between the Layers Lies the Truth: Uncertainty Estimation in LLMs Using Intra-Layer Local Information Scores
by: Badash, Zvi N., et al.
Published: (2026)

Investigating Intra-Abstraction Policies For Non-exact Abstraction Algorithms
by: Schmöcker, Robin, et al.
Published: (2025)

Greedy Output Approximation: Towards Efficient Structured Pruning for LLMs Without Retraining
by: Li, Jianwei, et al.
Published: (2024)

Towards Robust Pruning: An Adaptive Knowledge-Retention Pruning Strategy for Language Models
by: Li, Jianwei, et al.
Published: (2023)

A Unified Conditional Flow for Motion Generation, Editing, and Intra-Structural Retargeting
by: Li, Junlin, et al.
Published: (2026)

AI-driven View Guidance System in Intra-cardiac Echocardiography Imaging
by: Huh, Jaeyoung, et al.
Published: (2024)

PrunePath: Towards Highly Structured Sparse Language Models
by: Gu, Zhexuan, et al.
Published: (2026)

Inter- and Intra-Subject Variability in EEG: A Systematic Survey
by: Tran, Xuan-The, et al.
Published: (2026)

Intra-Fairness Dynamics: The Bias Spillover Effect in Targeted LLM Alignment
by: Paraschou, Eva, et al.
Published: (2026)

Harmonizing Intra-coherence and Inter-divergence in Ensemble Attacks for Adversarial Transferability
by: Ma, Zhaoyang, et al.
Published: (2025)

IMPA-HGAE:Intra-Meta-Path Augmented Heterogeneous Graph Autoencoder
by: Lin, Di, et al.
Published: (2025)

Modulating Cross-Modal Convergence with Single-Stimulus, Intra-Modal Dispersion
by: Hosseini, Eghbal A., et al.
Published: (2026)

MIRROR: Multi-agent Intra- and Inter-Reflection for Optimized Reasoning in Tool Learning
by: Guo, Zikang, et al.
Published: (2025)

Distance-Forward Learning: Enhancing the Forward-Forward Algorithm Towards High-Performance On-Chip Learning
by: Wu, Yujie, et al.
Published: (2024)

Robust Multivariate Time Series Forecasting against Intra- and Inter-Series Transitional Shift
by: He, Hui, et al.
Published: (2024)

Learning LLM Preference over Intra-Dialogue Pairs: A Framework for Utterance-level Understandings
by: Liu, Xuanqing, et al.
Published: (2025)

EXION: Exploiting Inter- and Intra-Iteration Output Sparsity for Diffusion Models
by: Heo, Jaehoon, et al.
Published: (2025)

Continuous Sign Language Recognition Using Intra-inter Gloss Attention
by: Ranjbar, Hossein, et al.
Published: (2024)