Saved in:
| Main Authors: | Xiao, Jinying, Ji, Bin, Li, Shasha, Liu, Xiaodong, Jun, Ma, Wang, Chao, Li, Wei, Zhong, Ye, Xie, Xuan, Tashi, Nyima, Yu, Jie |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2511.22316 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
EMP: Enhance Memory in Data Pruning
by: Xiao, Jinying, et al.
Published: (2024)
by: Xiao, Jinying, et al.
Published: (2024)
LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment
by: Zeng, Binrui, et al.
Published: (2024)
by: Zeng, Binrui, et al.
Published: (2024)
Context-Aware Dynamic Chunking for Streaming Tibetan Speech Recognition
by: Wang, Chao, et al.
Published: (2025)
by: Wang, Chao, et al.
Published: (2025)
RetrieveAll: A Multilingual Named Entity Recognition Framework with Large Language Models
by: Zhang, Jin, et al.
Published: (2025)
by: Zhang, Jin, et al.
Published: (2025)
TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual Similarity
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Human-in-the-Loop Generation of Adversarial Texts: A Case Study on Tibetan Script
by: Cao, Xi, et al.
Published: (2024)
by: Cao, Xi, et al.
Published: (2024)
Model Editing for LLMs4Code: How Far are We?
by: Li, Xiaopeng, et al.
Published: (2024)
by: Li, Xiaopeng, et al.
Published: (2024)
SageAttention2: Efficient Attention with Thorough Outlier Smoothing and Per-thread INT4 Quantization
by: Zhang, Jintao, et al.
Published: (2024)
by: Zhang, Jintao, et al.
Published: (2024)
OSC: Hardware Efficient W4A4 Quantization via Outlier Separation in Channel Dimension
by: Zhang, Zhiyuan, et al.
Published: (2026)
by: Zhang, Zhiyuan, et al.
Published: (2026)
TFD: A Comprehensive Structured Tibetan Foundation Dataset for Low-Resource Language Processing and Large-Scale Modeling
by: Huang, Cheng, et al.
Published: (2025)
by: Huang, Cheng, et al.
Published: (2025)
BATQuant: Outlier-resilient MXFP4 Quantization via Learnable Block-wise Optimization
by: Li, Ji-Fu, et al.
Published: (2026)
by: Li, Ji-Fu, et al.
Published: (2026)
When Modalities Remember: Continual Learning for Multimodal Knowledge Graphs
by: Li, Linyu, et al.
Published: (2026)
by: Li, Linyu, et al.
Published: (2026)
Listening, Imagining & Refining: A Heuristic Optimized ASR Correction Framework with LLMs
by: Liu, Yutong, et al.
Published: (2025)
by: Liu, Yutong, et al.
Published: (2025)
TED: Accelerate Model Training by Internal Generalization
by: Xiao, Jinying, et al.
Published: (2024)
by: Xiao, Jinying, et al.
Published: (2024)
Rotate, Clip, and Partition: Towards W2A4KV4 Quantization by Integrating Rotation and Learnable Non-uniform Quantizer
by: Choi, Euntae, et al.
Published: (2025)
by: Choi, Euntae, et al.
Published: (2025)
Identifying Knowledge Editing Types in Large Language Models
by: Li, Xiaopeng, et al.
Published: (2024)
by: Li, Xiaopeng, et al.
Published: (2024)
SWEA: Updating Factual Knowledge in Large Language Models via Subject Word Embedding Altering
by: Li, Xiaopeng, et al.
Published: (2024)
by: Li, Xiaopeng, et al.
Published: (2024)
QuaRot: Outlier-Free 4-Bit Inference in Rotated LLMs
by: Ashkboos, Saleh, et al.
Published: (2024)
by: Ashkboos, Saleh, et al.
Published: (2024)
RotateKV: Accurate and Robust 2-Bit KV Cache Quantization for LLMs via Outlier-Aware Adaptive Rotations
by: Su, Zunhai, et al.
Published: (2025)
by: Su, Zunhai, et al.
Published: (2025)
Outlier-Safe Pre-Training for Robust 4-Bit Quantization of Large Language Models
by: Park, Jungwoo, et al.
Published: (2025)
by: Park, Jungwoo, et al.
Published: (2025)
TLUE: A Tibetan Language Understanding Evaluation Benchmark
by: Gao, Fan, et al.
Published: (2025)
by: Gao, Fan, et al.
Published: (2025)
Progressive Residual Extraction based Pre-training for Speech Representation Learning
by: Wang, Tianrui, et al.
Published: (2024)
by: Wang, Tianrui, et al.
Published: (2024)
PrefixQuant: Eliminating Outliers by Prefixed Tokens for Large Language Models Quantization
by: Chen, Mengzhao, et al.
Published: (2024)
by: Chen, Mengzhao, et al.
Published: (2024)
EMSEdit: Efficient Multi-Step Meta-Learning-based Model Editing
by: Li, Xiaopeng, et al.
Published: (2025)
by: Li, Xiaopeng, et al.
Published: (2025)
DIM: Dynamic Integration of Multimodal Entity Linking with Large Language Model
by: Song, Shezheng, et al.
Published: (2024)
by: Song, Shezheng, et al.
Published: (2024)
SmoothQuant: Accurate and Efficient Post-Training Quantization for Large Language Models
by: Xiao, Guangxuan, et al.
Published: (2022)
by: Xiao, Guangxuan, et al.
Published: (2022)
SEVEN: Pruning Transformer Model by Reserving Sentinels
by: Xiao, Jinying, et al.
Published: (2024)
by: Xiao, Jinying, et al.
Published: (2024)
LNPT: Label-free Network Pruning and Training
by: Xiao, Jinying, et al.
Published: (2024)
by: Xiao, Jinying, et al.
Published: (2024)
Glucocorticoid‐Induced Tumor Necrosis Factor Receptor Ligand: Correlation and Therapeutic Potential for Cognitive Impairment in Temporal Lobe Epilepsy
by: Man Li, et al.
Published: (2026)
by: Man Li, et al.
Published: (2026)
Rethinking Residual Distribution in Locate-then-Edit Model Editing
by: Li, Xiaopeng, et al.
Published: (2025)
by: Li, Xiaopeng, et al.
Published: (2025)
TIBSTC-CoT: A Multi-Domain Instruction Dataset for Chain-of-Thought Reasoning in Language Models
by: Gao, Fan, et al.
Published: (2025)
by: Gao, Fan, et al.
Published: (2025)
Astro: Activation-guided Structured Regularization for Outlier-Robust LLM Post-Training Quantization
by: Chen, Xi, et al.
Published: (2026)
by: Chen, Xi, et al.
Published: (2026)
MASQuant: Modality-Aware Smoothing Quantization for Multimodal Large Language Models
by: Hu, Lulu, et al.
Published: (2026)
by: Hu, Lulu, et al.
Published: (2026)
CodeQuant: Unified Clustering and Quantization for Enhanced Outlier Smoothing in Low-Precision Mixture-of-Experts
by: Yin, Xiangyang, et al.
Published: (2026)
by: Yin, Xiangyang, et al.
Published: (2026)
MILLION: Mastering Long-Context LLM Inference Via Outlier-Immunized KV Product Quantization
by: Wang, Zongwu, et al.
Published: (2025)
by: Wang, Zongwu, et al.
Published: (2025)
QuantTune: Optimizing Model Quantization with Adaptive Outlier-Driven Fine Tuning
by: Chen, Jiun-Man, et al.
Published: (2024)
by: Chen, Jiun-Man, et al.
Published: (2024)
SVDQuant: Absorbing Outliers by Low-Rank Components for 4-Bit Diffusion Models
by: Li, Muyang, et al.
Published: (2024)
by: Li, Muyang, et al.
Published: (2024)
OptRot: Mitigating Weight Outliers via Data-Free Rotations for Post-Training Quantization
by: Gadhikar, Advait, et al.
Published: (2025)
by: Gadhikar, Advait, et al.
Published: (2025)
Accurate KV Cache Quantization with Outlier Tokens Tracing
by: Su, Yi, et al.
Published: (2025)
by: Su, Yi, et al.
Published: (2025)
ASER: Activation Smoothing and Error Reconstruction for Large Language Model Quantization
by: Zhao, Weibo, et al.
Published: (2024)
by: Zhao, Weibo, et al.
Published: (2024)
Similar Items
-
EMP: Enhance Memory in Data Pruning
by: Xiao, Jinying, et al.
Published: (2024) -
LSAQ: Layer-Specific Adaptive Quantization for Large Language Model Deployment
by: Zeng, Binrui, et al.
Published: (2024) -
Context-Aware Dynamic Chunking for Streaming Tibetan Speech Recognition
by: Wang, Chao, et al.
Published: (2025) -
RetrieveAll: A Multilingual Named Entity Recognition Framework with Large Language Models
by: Zhang, Jin, et al.
Published: (2025) -
TSCheater: Generating High-Quality Tibetan Adversarial Texts via Visual Similarity
by: Cao, Xi, et al.
Published: (2024)