Saved in:
| Main Authors: | Shao, Jintian, Cheng, Yiming |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.03038 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective
by: Shao, Jintian, et al.
Published: (2025)
by: Shao, Jintian, et al.
Published: (2025)
CoT is Not True Reasoning, It Is Just a Tight Constraint to Imitate: A Theory Perspective
by: Shao, Jintian, et al.
Published: (2025)
by: Shao, Jintian, et al.
Published: (2025)
Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective
by: Feng, Duanyu, et al.
Published: (2024)
by: Feng, Duanyu, et al.
Published: (2024)
Power-Law Decay Loss for Large Language Model Finetuning: A Theory Perspective
by: Shao, Jintian
Published: (2025)
by: Shao, Jintian
Published: (2025)
Analyzing Consumer Reviews for Understanding Drivers of Hotels Ratings: An Indian Perspective
by: Dasgupta, Subhasis, et al.
Published: (2024)
by: Dasgupta, Subhasis, et al.
Published: (2024)
Towards a Theoretical Understanding of Synthetic Data in LLM Post-Training: A Reverse-Bottleneck Perspective
by: Gan, Zeyu, et al.
Published: (2024)
by: Gan, Zeyu, et al.
Published: (2024)
Towards Distillation-Resistant Large Language Models: An Information-Theoretic Perspective
by: Fang, Hao, et al.
Published: (2026)
by: Fang, Hao, et al.
Published: (2026)
Why Code, Why Now: An Information-Theoretic Perspective on the Limits of Machine Learning
by: Zhao, Zhimin
Published: (2026)
by: Zhao, Zhimin
Published: (2026)
VQ-Logits: Compressing the Output Bottleneck of Large Language Models via Vector Quantized Logits
by: Shao, Jintian, et al.
Published: (2025)
by: Shao, Jintian, et al.
Published: (2025)
Understanding and Addressing the Under-Translation Problem from the Perspective of Decoding Objective
by: Shao, Chenze, et al.
Published: (2024)
by: Shao, Chenze, et al.
Published: (2024)
Analyzing Wrap-Up Effects through an Information-Theoretic Lens
by: Meister, Clara, et al.
Published: (2022)
by: Meister, Clara, et al.
Published: (2022)
Toward Understanding Unlearning Difficulty: A Mechanistic Perspective and Circuit-Guided Difficulty Metric
by: Cheng, Jiali, et al.
Published: (2026)
by: Cheng, Jiali, et al.
Published: (2026)
Feature Resemblance: Towards a Theoretical Understanding of Analogical Reasoning in Transformers
by: Xu, Ruichen, et al.
Published: (2026)
by: Xu, Ruichen, et al.
Published: (2026)
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics
by: Zhu, Hanlin, et al.
Published: (2024)
by: Zhu, Hanlin, et al.
Published: (2024)
Analyzing Diversity in Healthcare LLM Research: A Scientometric Perspective
by: Restrepo, David, et al.
Published: (2024)
by: Restrepo, David, et al.
Published: (2024)
Measuring and Analyzing Subjective Uncertainty in Scientific Communications
by: Sourati, Jamshid, et al.
Published: (2025)
by: Sourati, Jamshid, et al.
Published: (2025)
Theoretical Understanding of In-Context Learning in Shallow Transformers with Unstructured Data
by: Xing, Yue, et al.
Published: (2024)
by: Xing, Yue, et al.
Published: (2024)
Analyzing and Mitigating Object Hallucination: A Training Bias Perspective
by: Li, Yifan, et al.
Published: (2025)
by: Li, Yifan, et al.
Published: (2025)
Towards Cross-lingual Values Judgment: A Consensus-Pluralism Perspective
by: Chen, Yukun, et al.
Published: (2026)
by: Chen, Yukun, et al.
Published: (2026)
An Information-Theoretic Approach to Analyze NLP Classification Tasks
by: Wang, Luran, et al.
Published: (2024)
by: Wang, Luran, et al.
Published: (2024)
Med-HEAL: Analyzing and Mitigating Hallucinations in Medical LLMs with Hallucination-Aware In-Context Learning
by: Liao, Yiming, et al.
Published: (2026)
by: Liao, Yiming, et al.
Published: (2026)
Benchmarking Abstract and Reasoning Abilities Through A Theoretical Perspective
by: Ma, Qingchuan, et al.
Published: (2025)
by: Ma, Qingchuan, et al.
Published: (2025)
Theoretical Limits of Language Model Alignment
by: Paes, Lucas Monteiro, et al.
Published: (2026)
by: Paes, Lucas Monteiro, et al.
Published: (2026)
Understanding and Analyzing Inappropriately Targeting Language in Online Discourse: A Comparative Annotation Study
by: Barbarestani, Baran, et al.
Published: (2025)
by: Barbarestani, Baran, et al.
Published: (2025)
ComplexFormer: Disruptively Advancing Transformer Inference Ability via Head-Specific Complex Vector Attention
by: Shao, Jintian, et al.
Published: (2025)
by: Shao, Jintian, et al.
Published: (2025)
On the Theoretical Limitations of Embedding-Based Retrieval
by: Weller, Orion, et al.
Published: (2025)
by: Weller, Orion, et al.
Published: (2025)
Towards Understanding Safety Alignment: A Mechanistic Perspective from Safety Neurons
by: Chen, Jianhui, et al.
Published: (2024)
by: Chen, Jianhui, et al.
Published: (2024)
Unveiling Cultural Blind Spots: Analyzing the Limitations of mLLMs in Procedural Text Comprehension
by: Yari, Amir Hossein, et al.
Published: (2025)
by: Yari, Amir Hossein, et al.
Published: (2025)
LinguaGame: A Linguistically Grounded Game-Theoretic Paradigm for Multi-Agent Dialogue Generation
by: Ye, Yuxiao, et al.
Published: (2026)
by: Ye, Yuxiao, et al.
Published: (2026)
Do LVLMs Understand Charts? Analyzing and Correcting Factual Errors in Chart Captioning
by: Huang, Kung-Hsiang, et al.
Published: (2023)
by: Huang, Kung-Hsiang, et al.
Published: (2023)
A Theoretical Perspective for Speculative Decoding Algorithm
by: Yin, Ming, et al.
Published: (2024)
by: Yin, Ming, et al.
Published: (2024)
Visual Thoughts: A Unified Perspective of Understanding Multimodal Chain-of-Thought
by: Cheng, Zihui, et al.
Published: (2025)
by: Cheng, Zihui, et al.
Published: (2025)
Reranking Laws for Language Generation: A Communication-Theoretic Perspective
by: Farinhas, António, et al.
Published: (2024)
by: Farinhas, António, et al.
Published: (2024)
A Theoretical Understanding of Self-Correction through In-context Alignment
by: Wang, Yifei, et al.
Published: (2024)
by: Wang, Yifei, et al.
Published: (2024)
LIDAO: Towards Limited Interventions for Debiasing (Large) Language Models
by: Liu, Tianci, et al.
Published: (2024)
by: Liu, Tianci, et al.
Published: (2024)
Do Large Language Models Truly Understand Geometric Structures?
by: Wang, Xiaofeng, et al.
Published: (2025)
by: Wang, Xiaofeng, et al.
Published: (2025)
Theoretical Benefit and Limitation of Diffusion Language Model
by: Feng, Guhao, et al.
Published: (2025)
by: Feng, Guhao, et al.
Published: (2025)
OSUM: Advancing Open Speech Understanding Models with Limited Resources in Academia
by: Geng, Xuelong, et al.
Published: (2025)
by: Geng, Xuelong, et al.
Published: (2025)
A Measure-Theoretic Analysis of Reasoning: Structural Generalization and Approximation Limits
by: Zhang, Yuyang, et al.
Published: (2026)
by: Zhang, Yuyang, et al.
Published: (2026)
Analyzing the Roles of Language and Vision in Learning from Limited Data
by: Chen, Allison, et al.
Published: (2024)
by: Chen, Allison, et al.
Published: (2024)
Similar Items
-
Towards Analyzing and Understanding the Limitations of VAPO: A Theoretical Perspective
by: Shao, Jintian, et al.
Published: (2025) -
CoT is Not True Reasoning, It Is Just a Tight Constraint to Imitate: A Theory Perspective
by: Shao, Jintian, et al.
Published: (2025) -
Towards Analyzing and Understanding the Limitations of DPO: A Theoretical Perspective
by: Feng, Duanyu, et al.
Published: (2024) -
Power-Law Decay Loss for Large Language Model Finetuning: A Theory Perspective
by: Shao, Jintian
Published: (2025) -
Analyzing Consumer Reviews for Understanding Drivers of Hotels Ratings: An Indian Perspective
by: Dasgupta, Subhasis, et al.
Published: (2024)