Saved in:
| Main Authors: | He, Shaokai, Wei, Kaiwen, Zeng, Xinyi, Chen, Xiang, Yang, Xue, Li, Zhenyang, Zhong, Jiang, Tian, Yu |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.07347 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DiffER: Categorical Diffusion for Chemical Retrosynthesis
by: Current, Sean, et al.
Published: (2025)
by: Current, Sean, et al.
Published: (2025)
Evaluating the Reversal Curse in Model Editing
by: Xu, Hao-Xiang, et al.
Published: (2023)
by: Xu, Hao-Xiang, et al.
Published: (2023)
A Theoretical Analysis of Why Masked Diffusion Models Mitigate the Reversal Curse
by: Jeon, Moongyu, et al.
Published: (2026)
by: Jeon, Moongyu, et al.
Published: (2026)
Reversible Diffusion Decoding for Diffusion Language Models
by: Wang, Xinyun, et al.
Published: (2026)
by: Wang, Xinyun, et al.
Published: (2026)
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
by: Fan, Chenghao, et al.
Published: (2026)
by: Fan, Chenghao, et al.
Published: (2026)
Mitigating Reversal Curse in Large Language Models via Semantic-aware Permutation Training
by: Guo, Qingyan, et al.
Published: (2024)
by: Guo, Qingyan, et al.
Published: (2024)
Delving into the Reversal Curse: How Far Can Large Language Models Generalize?
by: Lin, Zhengkai, et al.
Published: (2024)
by: Lin, Zhengkai, et al.
Published: (2024)
DPN-LE: Dual Personality Neuron Localization and Editing for Large Language Models
by: Zheng, Lifan, et al.
Published: (2026)
by: Zheng, Lifan, et al.
Published: (2026)
Relative Score Policy Optimization for Diffusion Language Models
by: Yu, Zichao, et al.
Published: (2026)
by: Yu, Zichao, et al.
Published: (2026)
ES-Mem: Event Segmentation-Based Memory for Long-Term Dialogue Agents
by: Zou, Huhai, et al.
Published: (2026)
by: Zou, Huhai, et al.
Published: (2026)
Multilingual Large Language Models and Curse of Multilinguality
by: Gurgurov, Daniil, et al.
Published: (2024)
by: Gurgurov, Daniil, et al.
Published: (2024)
Learning to Shop Like Humans: A Review-driven Retrieval-Augmented Recommendation Framework with LLMs
by: Wei, Kaiwen, et al.
Published: (2025)
by: Wei, Kaiwen, et al.
Published: (2025)
Reverse Training to Nurse the Reversal Curse
by: Golovneva, Olga, et al.
Published: (2024)
by: Golovneva, Olga, et al.
Published: (2024)
GraphER: A Structure-aware Text-to-Graph Model for Entity and Relation Extraction
by: Zaratiana, Urchade, et al.
Published: (2024)
by: Zaratiana, Urchade, et al.
Published: (2024)
TreeDiff: AST-Guided Code Generation with Diffusion LLMs
by: Zeng, Yiming, et al.
Published: (2025)
by: Zeng, Yiming, et al.
Published: (2025)
Exploring the Reversal Curse and Other Deductive Logical Reasoning in BERT and GPT-Based Large Language Models
by: Wu, Da, et al.
Published: (2023)
by: Wu, Da, et al.
Published: (2023)
DiffLM: Controllable Synthetic Data Generation via Diffusion Language Models
by: Zhou, Ying, et al.
Published: (2024)
by: Zhou, Ying, et al.
Published: (2024)
CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation
by: Huang, Chihan, et al.
Published: (2025)
by: Huang, Chihan, et al.
Published: (2025)
LPO: Discovering Missed Peephole Optimizations with Large Language Models
by: Xu, Zhenyang, et al.
Published: (2025)
by: Xu, Zhenyang, et al.
Published: (2025)
DiffPO: Diffusion-styled Preference Optimization for Efficient Inference-Time Alignment of Large Language Models
by: Chen, Ruizhe, et al.
Published: (2025)
by: Chen, Ruizhe, et al.
Published: (2025)
The Curse of Popularity: Popular Entities have Catastrophic Side Effects when Deleting Knowledge from Language Models
by: Takahashi, Ryosuke, et al.
Published: (2024)
by: Takahashi, Ryosuke, et al.
Published: (2024)
An Analysis and Mitigation of the Reversal Curse
by: Lv, Ang, et al.
Published: (2023)
by: Lv, Ang, et al.
Published: (2023)
Entity-Aware Biaffine Attention Model for Improved Constituent Parsing with Reduced Entity Violations
by: Bai, Xinyi
Published: (2024)
by: Bai, Xinyi
Published: (2024)
MedHallBench: A New Benchmark for Assessing Hallucination in Medical Large Language Models
by: Zuo, Kaiwen, et al.
Published: (2024)
by: Zuo, Kaiwen, et al.
Published: (2024)
Unlocking the Power of Large Language Models for Entity Alignment
by: Jiang, Xuhui, et al.
Published: (2024)
by: Jiang, Xuhui, et al.
Published: (2024)
DiffETM: Diffusion Process Enhanced Embedded Topic Model
by: Shao, Wei, et al.
Published: (2025)
by: Shao, Wei, et al.
Published: (2025)
SafeSteer: A Decoding-level Defense Mechanism for Multimodal Large Language Models
by: Zeng, Xinyi, et al.
Published: (2026)
by: Zeng, Xinyi, et al.
Published: (2026)
DiLaDiff: Distilled Latent-Augmented Diffusion for Language Modeling
by: Lemercier, Jean-Marie, et al.
Published: (2026)
by: Lemercier, Jean-Marie, et al.
Published: (2026)
Reverse Modeling in Large Language Models
by: Yu, Sicheng, et al.
Published: (2024)
by: Yu, Sicheng, et al.
Published: (2024)
Large Language Models for Few-Shot Named Entity Recognition
by: Zhao, Yufei, et al.
Published: (2018)
by: Zhao, Yufei, et al.
Published: (2018)
LPR: Large Language Models-Aided Program Reduction
by: Zhang, Mengxiao, et al.
Published: (2023)
by: Zhang, Mengxiao, et al.
Published: (2023)
SciER: An Entity and Relation Extraction Dataset for Datasets, Methods, and Tasks in Scientific Documents
by: Zhang, Qi, et al.
Published: (2024)
by: Zhang, Qi, et al.
Published: (2024)
Revolutionizing Reinforcement Learning Framework for Diffusion Large Language Models
by: Wang, Yinjie, et al.
Published: (2025)
by: Wang, Yinjie, et al.
Published: (2025)
The Factorization Curse: Which Tokens You Predict Underlie the Reversal Curse and More
by: Kitouni, Ouail, et al.
Published: (2024)
by: Kitouni, Ouail, et al.
Published: (2024)
Large Language Diffusion Models
by: Nie, Shen, et al.
Published: (2025)
by: Nie, Shen, et al.
Published: (2025)
Additive Multi-Step Markov Chains and the Curse of Dimensionality in Large Language Models
by: Usatenko, O. V., et al.
Published: (2026)
by: Usatenko, O. V., et al.
Published: (2026)
Understanding the Repeat Curse in Large Language Models from a Feature Perspective
by: Yao, Junchi, et al.
Published: (2025)
by: Yao, Junchi, et al.
Published: (2025)
Towards a Theoretical Understanding of the 'Reversal Curse' via Training Dynamics
by: Zhu, Hanlin, et al.
Published: (2024)
by: Zhu, Hanlin, et al.
Published: (2024)
Efficient Post-Training Pruning of Large Language Models with Statistical Correction
by: Yu, Peiqi, et al.
Published: (2026)
by: Yu, Peiqi, et al.
Published: (2026)
DiffListener: Discrete Diffusion Model for Listener Generation
by: Jung, Siyeol, et al.
Published: (2025)
by: Jung, Siyeol, et al.
Published: (2025)
Similar Items
-
DiffER: Categorical Diffusion for Chemical Retrosynthesis
by: Current, Sean, et al.
Published: (2025) -
Evaluating the Reversal Curse in Model Editing
by: Xu, Hao-Xiang, et al.
Published: (2023) -
A Theoretical Analysis of Why Masked Diffusion Models Mitigate the Reversal Curse
by: Jeon, Moongyu, et al.
Published: (2026) -
Reversible Diffusion Decoding for Diffusion Language Models
by: Wang, Xinyun, et al.
Published: (2026) -
Stable-DiffCoder: Pushing the Frontier of Code Diffusion Large Language Model
by: Fan, Chenghao, et al.
Published: (2026)