Saved in:
| Main Author: | Tang, Zhongpan |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.16963 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From TLinFormer to TConstFormer: The Leap to Constant-Time Transformer Attention: Achieving O(1) Computation and O(1) KV Cache during Autoregressive Inference
by: Tang, Zhongpan
Published: (2025)
by: Tang, Zhongpan
Published: (2025)
Rethinking Transformer Connectivity: TLinFormer, A Path to Exact, Full Context-Aware Linear Attention
by: Tang, Zhongpan
Published: (2025)
by: Tang, Zhongpan
Published: (2025)
Modular Continual Learning via Zero-Leakage Reconstruction Routing and Autonomous Task Discovery
by: Kermiche, Noureddine
Published: (2026)
by: Kermiche, Noureddine
Published: (2026)
ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization
by: Zhao, Weibo, et al.
Published: (2024)
by: Zhao, Weibo, et al.
Published: (2024)
GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks
by: Tang, Wenwu, et al.
Published: (2026)
by: Tang, Wenwu, et al.
Published: (2026)
Compressive Mahalanobis Metric Learning Adapts to Intrinsic Dimension
by: Palias, Efstratios, et al.
Published: (2023)
by: Palias, Efstratios, et al.
Published: (2023)
Block-Operations: Using Modular Routing to Improve Compositional Generalization
by: Dietz, Florian, et al.
Published: (2024)
by: Dietz, Florian, et al.
Published: (2024)
Rethinking Pruning Large Language Models: Benefits and Pitfalls of Reconstruction Error Minimization
by: Shin, Sungbin, et al.
Published: (2024)
by: Shin, Sungbin, et al.
Published: (2024)
LLMComp: A Language Modeling Paradigm for Error-Bounded Scientific Data Compression (Technical Report)
by: Li, Guozhong, et al.
Published: (2025)
by: Li, Guozhong, et al.
Published: (2025)
MoDeGPT: Modular Decomposition for Large Language Model Compression
by: Lin, Chi-Heng, et al.
Published: (2024)
by: Lin, Chi-Heng, et al.
Published: (2024)
Intrinsic Training Signals for Federated Learning Aggregation
by: Fiorini, Cosimo, et al.
Published: (2025)
by: Fiorini, Cosimo, et al.
Published: (2025)
SHRP: Specialized Head Routing and Pruning for Efficient Encoder Compression
by: Su, Zeli, et al.
Published: (2025)
by: Su, Zeli, et al.
Published: (2025)
Modular Distributed Nonconvex Learning with Error Feedback
by: Carnevale, Guido, et al.
Published: (2025)
by: Carnevale, Guido, et al.
Published: (2025)
Intrinsic Signal Models Defined by the High-Dimensional, Small-Sample Limit
by: Mototake, Yoh-ichi, et al.
Published: (2023)
by: Mototake, Yoh-ichi, et al.
Published: (2023)
Training-free Ultra Small Model for Universal Sparse Reconstruction in Compressed Sensing
by: Tang, Chaoqing, et al.
Published: (2025)
by: Tang, Chaoqing, et al.
Published: (2025)
FoMo X: Modular Explainability Signals for Outlier Detection Foundation Models
by: Klüttermann, Simon, et al.
Published: (2026)
by: Klüttermann, Simon, et al.
Published: (2026)
QERA: an Analytical Framework for Quantization Error Reconstruction
by: Zhang, Cheng, et al.
Published: (2024)
by: Zhang, Cheng, et al.
Published: (2024)
ECVL-ROUTER: Scenario-Aware Routing for Vision-Language Models
by: Tang, Xin, et al.
Published: (2025)
by: Tang, Xin, et al.
Published: (2025)
Paraphrase and Aggregate with Large Language Models for Minimizing Intent Classification Errors
by: Yadav, Vikas, et al.
Published: (2024)
by: Yadav, Vikas, et al.
Published: (2024)
Neural Weight Compression for Language Models
by: Ryu, Jegwang, et al.
Published: (2025)
by: Ryu, Jegwang, et al.
Published: (2025)
Accelerated Distributed Optimization with Compression and Error Feedback
by: Gao, Yuan, et al.
Published: (2025)
by: Gao, Yuan, et al.
Published: (2025)
Route Sparse Autoencoder to Interpret Large Language Models
by: Shi, Wei, et al.
Published: (2025)
by: Shi, Wei, et al.
Published: (2025)
Curiosity-Critic: Cumulative Prediction Error Improvement as a Tractable Intrinsic Reward for World Model Training
by: Bhaskara, Vin, et al.
Published: (2026)
by: Bhaskara, Vin, et al.
Published: (2026)
Unlocking Emergent Modularity in Large Language Models
by: Qiu, Zihan, et al.
Published: (2023)
by: Qiu, Zihan, et al.
Published: (2023)
Spectral Manifold Regularization for Stable and Modular Routing in Deep MoE Architectures
by: Delibasoglu, Ibrahim
Published: (2026)
by: Delibasoglu, Ibrahim
Published: (2026)
Leveraging Kernel Symmetry for Joint Compression and Error Mitigation in Edge Model Transfer
by: Hamadouche, Anis, et al.
Published: (2026)
by: Hamadouche, Anis, et al.
Published: (2026)
PocketLLM: Ultimate Compression of Large Language Models via Meta Networks
by: Tian, Ye, et al.
Published: (2025)
by: Tian, Ye, et al.
Published: (2025)
Compressing Search with Language Models
by: Mulc, Thomas, et al.
Published: (2024)
by: Mulc, Thomas, et al.
Published: (2024)
Proxy Compression for Language Modeling
by: Zheng, Lin, et al.
Published: (2026)
by: Zheng, Lin, et al.
Published: (2026)
Adaptation to Intrinsic Dependence in Diffusion Language Models
by: Zhao, Yunxiao, et al.
Published: (2026)
by: Zhao, Yunxiao, et al.
Published: (2026)
Smoothie: Label Free Language Model Routing
by: Guha, Neel, et al.
Published: (2024)
by: Guha, Neel, et al.
Published: (2024)
MoE-Compression: How the Compression Error of Experts Affects the Inference Accuracy of MoE Model?
by: Ma, Songkai, et al.
Published: (2025)
by: Ma, Songkai, et al.
Published: (2025)
Learning the Error Patterns of Language Models
by: Kim, Jinwoo, et al.
Published: (2026)
by: Kim, Jinwoo, et al.
Published: (2026)
Efficient Reinforcement Learning for Large Language Models with Intrinsic Exploration
by: Sun, Yan, et al.
Published: (2025)
by: Sun, Yan, et al.
Published: (2025)
Planning Under Observation Mismatch for Traffic Signal Control via Adaptive Modular World Models
by: Huang, Zherui, et al.
Published: (2025)
by: Huang, Zherui, et al.
Published: (2025)
Knowledge Fusion of Large Language Models Via Modular SkillPacks
by: Du, Guodong, et al.
Published: (2025)
by: Du, Guodong, et al.
Published: (2025)
Error Feedback Can Accurately Compress Preconditioners
by: Modoranu, Ionut-Vlad, et al.
Published: (2023)
by: Modoranu, Ionut-Vlad, et al.
Published: (2023)
Error Analysis in a Modular Meeting Transcription System
by: Vieting, Peter, et al.
Published: (2025)
by: Vieting, Peter, et al.
Published: (2025)
Reconstruction Error-based Anomaly Detection with Few Outlying Examples
by: Angiulli, Fabrizio, et al.
Published: (2023)
by: Angiulli, Fabrizio, et al.
Published: (2023)
Understanding Intrinsic Socioeconomic Biases in Large Language Models
by: Arzaghi, Mina, et al.
Published: (2024)
by: Arzaghi, Mina, et al.
Published: (2024)
Similar Items
-
From TLinFormer to TConstFormer: The Leap to Constant-Time Transformer Attention: Achieving O(1) Computation and O(1) KV Cache during Autoregressive Inference
by: Tang, Zhongpan
Published: (2025) -
Rethinking Transformer Connectivity: TLinFormer, A Path to Exact, Full Context-Aware Linear Attention
by: Tang, Zhongpan
Published: (2025) -
Modular Continual Learning via Zero-Leakage Reconstruction Routing and Autonomous Task Discovery
by: Kermiche, Noureddine
Published: (2026) -
ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization
by: Zhao, Weibo, et al.
Published: (2024) -
GRAIL: Post-hoc Compensation by Linear Reconstruction for Compressed Networks
by: Tang, Wenwu, et al.
Published: (2026)