:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yang, Zi, Choudhary, Samridhi, Kunzmann, Siegfried, Zhang, Zheng
Format:	Preprint
Published:	2023
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2306.01076
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CoMERA: Computing- and Memory-Efficient Training via Rank-Adaptive Tensor Optimization
by: Yang, Zi, et al.
Published: (2024)

Saten: Sparse Augmented Tensor Networks for Post-Training Compression of Large Language Models
by: Solgi, Ryan, et al.
Published: (2025)

ROSAQ: Rotation-based Saliency-Aware Weight Quantization for Efficiently Compressing Large Language Models
by: Yoon, Junho, et al.
Published: (2025)

SASQ: Static Activation Scaling for Quantization-Aware Training in Large Language Models
by: Mao, Shizhuo, et al.
Published: (2025)

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
by: Chen, Mengzhao, et al.
Published: (2024)

On the Compressibility of Quantized Large Language Models
by: Mao, Yu, et al.
Published: (2024)

SiLQ: Simple Large Language Model Quantization-Aware Training
by: Esser, Steven K., et al.
Published: (2025)

Quantized Large Language Models in Biomedical Natural Language Processing: Evaluation and Recommendation
by: Zhan, Zaifu, et al.
Published: (2025)

Advancements in Natural Language Processing: Exploring Transformer-Based Architectures for Text Understanding
by: Wu, Tianhao, et al.
Published: (2025)

Exploration of Marker-Based Approaches in Argument Mining through Augmented Natural Language
by: Das, Nilmadhab, et al.
Published: (2024)

Low-Rank Quantization-Aware Training for LLMs
by: Bondarenko, Yelysei, et al.
Published: (2024)

Exploring the Role of Reasoning Structures for Constructing Proofs in Multi-Step Natural Language Reasoning with Large Language Models
by: Zheng, Zi'ou, et al.
Published: (2024)

Aggressive Post-Training Compression on Extremely Large Language Models
by: Zhang, Zining, et al.
Published: (2024)

LoRETTA: Low-Rank Economic Tensor-Train Adaptation for Ultra-Low-Parameter Fine-Tuning of Large Language Models
by: Yang, Yifan, et al.
Published: (2024)

LLMC: Benchmarking Large Language Model Quantization with a Versatile Compression Toolkit
by: Gong, Ruihao, et al.
Published: (2024)

AdaZeta: Adaptive Zeroth-Order Tensor-Train Adaption for Memory-Efficient Large Language Models Fine-Tuning
by: Yang, Yifan, et al.
Published: (2024)

CommVQ: Commutative Vector Quantization for KV Cache Compression
by: Li, Junyan, et al.
Published: (2025)

Continuous Approximations for Improving Quantization Aware Training of LLMs
by: Li, He, et al.
Published: (2024)

On-the-fly Denoising for Data Augmentation in Natural Language Understanding
by: Fang, Tianqing, et al.
Published: (2022)

LatentLLM: Attention-Aware Joint Tensor Compression
by: Koike-Akino, Toshiaki, et al.
Published: (2025)

Natural Context Drift Undermines the Natural Language Understanding of Large Language Models
by: Wu, Yulong, et al.
Published: (2025)

Label-Confidence-Aware Uncertainty Estimation in Natural Language Generation
by: Lin, Qinhong, et al.
Published: (2024)

Combining Transformers with Natural Language Explanations
by: Ruggeri, Federico, et al.
Published: (2021)

Understanding Network Behaviors through Natural Language Question-Answering
by: Xing, Mingzhe, et al.
Published: (2025)

Understanding and Tackling Label Errors in Individual-Level Nature Language Understanding
by: Xiao, Yunpeng, et al.
Published: (2025)

RDR: the Recap, Deliberate, and Respond Method for Enhanced Language Understanding
by: Zi, Yuxin, et al.
Published: (2023)

Time and Memory Trade-off of KV-Cache Compression in Tensor Transformer Decoding
by: Chen, Yifang, et al.
Published: (2025)

Benchmarking Post-Training Quantization of Large Language Models under Microscaling Floating Point Formats
by: Zhang, Manyi, et al.
Published: (2026)

Learning to Compress Prompt in Natural Language Formats
by: Chuang, Yu-Neng, et al.
Published: (2024)

An Empirical Study on Prompt Compression for Large Language Models
by: Zhang, Zheng, et al.
Published: (2025)

AdaComp: Extractive Context Compression with Adaptive Predictor for Retrieval-Augmented Large Language Models
by: Zhang, Qianchi, et al.
Published: (2024)

GWQ: Gradient-Aware Weight Quantization for Large Language Models
by: Shao, Yihua, et al.
Published: (2024)

Improving Natural Language Understanding for LLMs via Large-Scale Instruction Synthesis
by: Yuan, Lin, et al.
Published: (2025)

DisCoCLIP: A Distributional Compositional Tensor Network Encoder for Vision-Language Understanding
by: Lo, Kin Ian, et al.
Published: (2025)

Semantic Mastery: Enhancing LLMs with Advanced Natural Language Understanding
by: Hariharan, Mohanakrishnan
Published: (2025)

Integrating Symbolic Natural Language Understanding and Language Models for Word Sense Disambiguation
by: Zhao, Kexin, et al.
Published: (2025)

DLLMQuant: Quantizing Diffusion-based Large Language Models
by: Xu, Chen, et al.
Published: (2025)

Quantization Meets Reasoning: Exploring LLM Low-Bit Quantization Degradation for Mathematical Reasoning
by: Li, Zhen, et al.
Published: (2025)

Evaluating Quantized Large Language Models
by: Li, Shiyao, et al.
Published: (2024)

Synthetic Feature Augmentation Improves Generalization Performance of Language Models
by: Choudhary, Ashok, et al.
Published: (2025)