Saved in:
| Main Authors: | Liao, Kuo-Yu, Chang, Cheng-Shang, Hong, Y. -W. Peter |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2404.07009 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Latent Space Alignment for Semantic Channel Equalization
by: Hüttebräucker, Tomás, et al.
Published: (2024)
by: Hüttebräucker, Tomás, et al.
Published: (2024)
An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding
by: Hu, Dou, et al.
Published: (2025)
by: Hu, Dou, et al.
Published: (2025)
Geometric Signatures of Compositionality Across a Language Model's Lifetime
by: Lee, Jin Hwa, et al.
Published: (2024)
by: Lee, Jin Hwa, et al.
Published: (2024)
RateQuant: Optimal Mixed-Precision KV Cache Quantization via Rate-Distortion Theory
by: Zuo, Fei, et al.
Published: (2026)
by: Zuo, Fei, et al.
Published: (2026)
Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
by: Nagle, Alliot, et al.
Published: (2024)
by: Nagle, Alliot, et al.
Published: (2024)
Filtering Beats Fine Tuning: A Bayesian Kalman View of In Context Learning in LLMs
by: Kiruluta, Andrew
Published: (2026)
by: Kiruluta, Andrew
Published: (2026)
An Enhanced Text Compression Approach Using Transformer-based Language Models
by: Rahman, Chowdhury Mofizur, et al.
Published: (2024)
by: Rahman, Chowdhury Mofizur, et al.
Published: (2024)
Efficient Learned Data Compression via Dual-Stream Feature Decoupling
by: Ma, Huidong, et al.
Published: (2026)
by: Ma, Huidong, et al.
Published: (2026)
Theoretical Limits of Language Model Alignment
by: Paes, Lucas Monteiro, et al.
Published: (2026)
by: Paes, Lucas Monteiro, et al.
Published: (2026)
Analyzing and Improving Chain-of-Thought Monitorability Through Information Theory
by: Anwar, Usman, et al.
Published: (2026)
by: Anwar, Usman, et al.
Published: (2026)
In-Context Learning with Representations: Contextual Generalization of Trained Transformers
by: Yang, Tong, et al.
Published: (2024)
by: Yang, Tong, et al.
Published: (2024)
Retrieval-Augmented Generation as Noisy In-Context Learning: A Unified Theory and Risk Bounds
by: Guo, Yang, et al.
Published: (2025)
by: Guo, Yang, et al.
Published: (2025)
SemCSE: Semantic Contrastive Sentence Embeddings Using LLM-Generated Summaries For Scientific Abstracts
by: Brinner, Marc, et al.
Published: (2025)
by: Brinner, Marc, et al.
Published: (2025)
Visual Language Model based Cross-modal Semantic Communication Systems
by: Jiang, Feibo, et al.
Published: (2024)
by: Jiang, Feibo, et al.
Published: (2024)
Semantic Soft Bootstrapping: Long Context Reasoning in LLMs without Reinforcement Learning
by: Mitra, Purbesh, et al.
Published: (2025)
by: Mitra, Purbesh, et al.
Published: (2025)
A Survey on Large Language Models from Concept to Implementation
by: Wang, Chen, et al.
Published: (2024)
by: Wang, Chen, et al.
Published: (2024)
A Little Confidence Goes a Long Way
by: Scoville, John, et al.
Published: (2024)
by: Scoville, John, et al.
Published: (2024)
A Rate-Distortion Framework for Summarization
by: Arda, Enes, et al.
Published: (2025)
by: Arda, Enes, et al.
Published: (2025)
Language Modeling Is Compression
by: Delétang, Grégoire, et al.
Published: (2023)
by: Delétang, Grégoire, et al.
Published: (2023)
Attention with Markov: A Framework for Principled Analysis of Transformers via Markov Chains
by: Makkuva, Ashok Vardhan, et al.
Published: (2024)
by: Makkuva, Ashok Vardhan, et al.
Published: (2024)
Semantic Faithfulness and Entropy Production Measures to Tame Your LLM Demons and Manage Hallucinations
by: Halperin, Igor
Published: (2025)
by: Halperin, Igor
Published: (2025)
Harmonizing Program Induction with Rate-Distortion Theory
by: Zhou, Hanqi, et al.
Published: (2024)
by: Zhou, Hanqi, et al.
Published: (2024)
The Information of Large Language Model Geometry
by: Tan, Zhiquan, et al.
Published: (2024)
by: Tan, Zhiquan, et al.
Published: (2024)
Reconstructing Biological Pathways by Applying Selective Incremental Learning to (Very) Small Language Models
by: Saha, Pranta, et al.
Published: (2025)
by: Saha, Pranta, et al.
Published: (2025)
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models
by: Wei, Lai, et al.
Published: (2024)
by: Wei, Lai, et al.
Published: (2024)
Multimodal Learning Without Labeled Multimodal Data: Guarantees and Applications
by: Liang, Paul Pu, et al.
Published: (2023)
by: Liang, Paul Pu, et al.
Published: (2023)
Memorization-Compression Cycles Improve Generalization
by: Yu, Fangyuan
Published: (2025)
by: Yu, Fangyuan
Published: (2025)
Integrating Pre-Trained Language Model with Physical Layer Communications
by: Lee, Ju-Hyung, et al.
Published: (2024)
by: Lee, Ju-Hyung, et al.
Published: (2024)
Language Models As Semantic Indexers
by: Jin, Bowen, et al.
Published: (2023)
by: Jin, Bowen, et al.
Published: (2023)
Theoretical guarantees on the best-of-n alignment policy
by: Beirami, Ahmad, et al.
Published: (2024)
by: Beirami, Ahmad, et al.
Published: (2024)
Understanding Factual Recall in Transformers via Associative Memories
by: Nichani, Eshaan, et al.
Published: (2024)
by: Nichani, Eshaan, et al.
Published: (2024)
InfAlign: Inference-aware language model alignment
by: Balashankar, Ananth, et al.
Published: (2024)
by: Balashankar, Ananth, et al.
Published: (2024)
Transformers on Markov Data: Constant Depth Suffices
by: Rajaraman, Nived, et al.
Published: (2024)
by: Rajaraman, Nived, et al.
Published: (2024)
MultiTok: Variable-Length Tokenization for Efficient LLMs Adapted from LZW Compression
by: Elias, Noel, et al.
Published: (2024)
by: Elias, Noel, et al.
Published: (2024)
Proposal and study of statistical features for string similarity computation and classification
by: Rodrigues, E. O., et al.
Published: (2026)
by: Rodrigues, E. O., et al.
Published: (2026)
Effective Context in Transformers: An Analysis of Fragmentation and Tokenization
by: Fesharaki, Amirmehdi Jafari, et al.
Published: (2026)
by: Fesharaki, Amirmehdi Jafari, et al.
Published: (2026)
Multi-turn Training with Basic Human Feedback Helps Little on LLM Reasoning
by: Liu, Qiang, et al.
Published: (2025)
by: Liu, Qiang, et al.
Published: (2025)
Speculative Decoding Scaling Laws (SDSL): Throughput Optimization Made Simple
by: Bozorgkhoo, Amirhossein, et al.
Published: (2026)
by: Bozorgkhoo, Amirhossein, et al.
Published: (2026)
What Makes the Preferred Thinking Direction for LLMs in Multiple-choice Questions?
by: Zhang, Yizhe, et al.
Published: (2025)
by: Zhang, Yizhe, et al.
Published: (2025)
Cost-aware LLM-based Online Dataset Annotation
by: Elumar, Eray Can, et al.
Published: (2025)
by: Elumar, Eray Can, et al.
Published: (2025)
Similar Items
-
Latent Space Alignment for Semantic Channel Equalization
by: Hüttebräucker, Tomás, et al.
Published: (2024) -
An Information-theoretic Multi-task Representation Learning Framework for Natural Language Understanding
by: Hu, Dou, et al.
Published: (2025) -
Geometric Signatures of Compositionality Across a Language Model's Lifetime
by: Lee, Jin Hwa, et al.
Published: (2024) -
RateQuant: Optimal Mixed-Precision KV Cache Quantization via Rate-Distortion Theory
by: Zuo, Fei, et al.
Published: (2026) -
Fundamental Limits of Prompt Compression: A Rate-Distortion Framework for Black-Box Language Models
by: Nagle, Alliot, et al.
Published: (2024)