:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Author:	Tang, Zhongpan
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2512.16963
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

From TLinFormer to TConstFormer: The Leap to Constant-Time Transformer Attention: Achieving O(1) Computation and O(1) KV Cache during Autoregressive Inference
by: Tang, Zhongpan
Published: (2025)

Rethinking Transformer Connectivity: TLinFormer, A Path to Exact, Full Context-Aware Linear Attention
by: Tang, Zhongpan
Published: (2025)

Modular Continual Learning via Zero-Leakage Reconstruction Routing and Autonomous Task Discovery
by: Kermiche, Noureddine
Published: (2026)

ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization
by: Zhao, Weibo, et al.
Published: (2024)

GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks
by: Tang, Wenwu, et al.
Published: (2026)

Compressive Mahalanobis Metric Learning Adapts to Intrinsic Dimension
by: Palias, Efstratios, et al.
Published: (2023)

Block-Operations: Using Modular Routing to Improve Compositional Generalization
by: Dietz, Florian, et al.
Published: (2024)

Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization
by: Shin, Sungbin, et al.
Published: (2024)

LLMComp: A Language Modeling Paradigm for Error-Bounded Scientific Data Compression (Technical Report)
by: Li, Guozhong, et al.
Published: (2025)

MoDeGPT: Modular Decomposition for Large Language Model Compression
by: Lin, Chi-Heng, et al.
Published: (2024)

Intrinsic Training Signals for Federated Learning Aggregation
by: Fiorini, Cosimo, et al.
Published: (2025)

SHRP: Specialized Head Routing and Pruning for Efficient Encoder Compression
by: Su, Zeli, et al.
Published: (2025)

Modular Distributed Nonconvex Learning with Error Feedback
by: Carnevale, Guido, et al.
Published: (2025)

Intrinsic Signal Models Defined by the High-Dimensional, Small-Sample Limit
by: Mototake, Yoh-ichi, et al.
Published: (2023)

Training-free Ultra Small Model for Universal Sparse Reconstruction in Compressed Sensing
by: Tang, Chaoqing, et al.
Published: (2025)

FoMo X: Modular Explainability Signals for Outlier Detection Foundation Models
by: Klüttermann, Simon, et al.
Published: (2026)

QERA: an Analytical Framework for Quantization Error Reconstruction
by: Zhang, Cheng, et al.
Published: (2024)

ECVL-ROUTER: Scenario-Aware Routing for Vision-Language Models
by: Tang, Xin, et al.
Published: (2025)

Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors
by: Yadav, Vikas, et al.
Published: (2024)

Neural Weight Compression for Language Models
by: Ryu, Jegwang, et al.
Published: (2025)

Accelerated Distributed Optimization with Compression and Error Feedback
by: Gao, Yuan, et al.
Published: (2025)

Route Sparse Autoencoder to Interpret Large Language Models
by: Shi, Wei, et al.
Published: (2025)

Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training
by: Bhaskara, Vin, et al.
Published: (2026)

Unlocking Emergent Modularity in Large Language Models
by: Qiu, Zihan, et al.
Published: (2023)

Spectral Manifold Regularization for Stable and Modular Routing in Deep MoE Architectures
by: Delibasoglu, Ibrahim
Published: (2026)

Leveraging Kernel Symmetry for Joint Compression and Error Mitigation in Edge Model Transfer
by: Hamadouche, Anis, et al.
Published: (2026)

PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
by: Tian, Ye, et al.
Published: (2025)

Compressing Search with Language Models
by: Mulc, Thomas, et al.
Published: (2024)

Proxy Compression for Language Modeling
by: Zheng, Lin, et al.
Published: (2026)

Adaptation to Intrinsic Dependence in Diffusion Language Models
by: Zhao, Yunxiao, et al.
Published: (2026)

Smoothie: Label Free Language Model Routing
by: Guha, Neel, et al.
Published: (2024)

MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model?
by: Ma, Songkai, et al.
Published: (2025)

Learning the Error Patterns of Language Models
by: Kim, Jinwoo, et al.
Published: (2026)

Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration
by: Sun, Yan, et al.
Published: (2025)

Planning Under Observation Mismatch for Traffic Signal Control via Adaptive Modular World Models
by: Huang, Zherui, et al.
Published: (2025)

Knowledge Fusion of Large Language Models Via Modular SkillPacks
by: Du, Guodong, et al.
Published: (2025)

Error Feedback Can Accurately Compress Preconditioners
by: Modoranu, Ionut-Vlad, et al.
Published: (2023)

Error Analysis in a Modular Meeting Transcription System
by: Vieting, Peter, et al.
Published: (2025)

Reconstruction Error-based Anomaly Detection with Few Outlying Examples
by: Angiulli, Fabrizio, et al.
Published: (2023)

Understanding Intrinsic Socioeconomic Biases in Large Language Models
by: Arzaghi, Mina, et al.
Published: (2024)