:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Liao, Kuo-Yu, Chang, Cheng-Shang, Hong, Y. -W. Peter
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Information Theory Machine Learning
Online Access:	https://arxiv.org/abs/2404.07009
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Latent Space Alignment for Semantic Channel Equalization
by: Hüttebräucker, Tomás, et al.
Published: (2024)

An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding
by: Hu, Dou, et al.
Published: (2025)

Geometric Signatures of Compositionality Across a Language Model's Lifetime
by: Lee, Jin Hwa, et al.
Published: (2024)

RateQuant: Optimal Mixed-Precision KV Cache Quantization via Rate-Distortion Theory
by: Zuo, Fei, et al.
Published: (2026)

Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
by: Nagle, Alliot, et al.
Published: (2024)

Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs
by: Kiruluta, Andrew
Published: (2026)

An Enhanced Text Compression Approach Using Transformer-based Language Models
by: Rahman, Chowdhury Mofizur, et al.
Published: (2024)

Efficient Learned Data Compression via Dual-Stream Feature Decoupling
by: Ma, Huidong, et al.
Published: (2026)

Theoretical Limits of Language Model Alignment
by: Paes, Lucas Monteiro, et al.
Published: (2026)

Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory
by: Anwar, Usman, et al.
Published: (2026)

In-Context Learning with Representations: Contextual Generalization of Trained Transformers
by: Yang, Tong, et al.
Published: (2024)

Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds
by: Guo, Yang, et al.
Published: (2025)

SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts
by: Brinner, Marc, et al.
Published: (2025)

Visual Language Model based Cross-modal Semantic Communication Systems
by: Jiang, Feibo, et al.
Published: (2024)

Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning
by: Mitra, Purbesh, et al.
Published: (2025)

A Survey on Large Language Models from Concept to Implementation
by: Wang, Chen, et al.
Published: (2024)

A Little Confidence Goes a Long Way
by: Scoville, John, et al.
Published: (2024)

A Rate-Distortion Framework for Summarization
by: Arda, Enes, et al.
Published: (2025)

Language Modeling Is Compression
by: Delétang, Grégoire, et al.
Published: (2023)

Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
by: Makkuva, Ashok Vardhan, et al.
Published: (2024)

Semantic Faithfulness and Entropy Production Measures to Tame Your LLM Demons and Manage Hallucinations
by: Halperin, Igor
Published: (2025)

Harmonizing Program Induction with Rate-Distortion Theory
by: Zhou, Hanqi, et al.
Published: (2024)

The Information of Large Language Model Geometry
by: Tan, Zhiquan, et al.
Published: (2024)

Reconstructing Biological Pathways by Applying Selective Incremental Learning to (Very) Small Language Models
by: Saha, Pranta, et al.
Published: (2025)

Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models
by: Wei, Lai, et al.
Published: (2024)

Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
by: Liang, Paul Pu, et al.
Published: (2023)

Memorization-Compression Cycles Improve Generalization
by: Yu, Fangyuan
Published: (2025)

Integrating Pre-Trained Language Model with Physical Layer Communications
by: Lee, Ju-Hyung, et al.
Published: (2024)

Language Models As Semantic Indexers
by: Jin, Bowen, et al.
Published: (2023)

Theoretical guarantees on the best-of-n alignment policy
by: Beirami, Ahmad, et al.
Published: (2024)

Understanding Factual Recall in Transformers via Associative Memories
by: Nichani, Eshaan, et al.
Published: (2024)

InfAlign: Inference-aware language model alignment
by: Balashankar, Ananth, et al.
Published: (2024)

Transformers on Markov Data: Constant Depth Suffices
by: Rajaraman, Nived, et al.
Published: (2024)

MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression
by: Elias, Noel, et al.
Published: (2024)

Proposal and study of statistical features for string similarity computation and classification
by: Rodrigues, E. O., et al.
Published: (2026)

Effective Context in Transformers: An Analysis of Fragmentation and Tokenization
by: Fesharaki, Amirmehdi Jafari, et al.
Published: (2026)

Multi-turn Training with Basic Human Feedback Helps Little on LLM Reasoning
by: Liu, Qiang, et al.
Published: (2025)

Speculative Decoding Scaling Laws (SDSL): Throughput Optimization Made Simple
by: Bozorgkhoo, Amirhossein, et al.
Published: (2026)

What Makes the Preferred Thinking Direction for LLMs in Multiple-choice Questions?
by: Zhang, Yizhe, et al.
Published: (2025)

Cost-aware LLM-based Online Dataset Annotation
by: Elumar, Eray Can, et al.
Published: (2025)