Saved in:
| Main Authors: | Wang, Danny, Qiu, Ruihong, Huang, Zi |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.23994 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Text Meets Topology: Rethinking Out-of-distribution Detection in Text-Rich Networks
by: Wang, Danny, et al.
Published: (2025)
by: Wang, Danny, et al.
Published: (2025)
Block-R1: Rethinking the Role of Block Size in Multi-domain Reinforcement Learning for Diffusion Large Language Models
by: Jiang, Yan, et al.
Published: (2026)
by: Jiang, Yan, et al.
Published: (2026)
TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only
by: Liu, Yilun, et al.
Published: (2026)
by: Liu, Yilun, et al.
Published: (2026)
Break the Block: Dynamic-size Reasoning Blocks for Diffusion Large Language Models via Monotonic Entropy Descent with Reinforcement Learning
by: Jiang, Yan, et al.
Published: (2026)
by: Jiang, Yan, et al.
Published: (2026)
When Does a Language Model Commit? A Finite-Answer Theory of Pre-Verbalization Commitment
by: Zhang, Long, et al.
Published: (2026)
by: Zhang, Long, et al.
Published: (2026)
What Information Matters? Graph Out-of-Distribution Detection via Tri-Component Information Decomposition
by: Wang, Danny, et al.
Published: (2026)
by: Wang, Danny, et al.
Published: (2026)
GOLD: Graph Out-of-Distribution Detection via Implicit Adversarial Latent Generation
by: Wang, Danny, et al.
Published: (2025)
by: Wang, Danny, et al.
Published: (2025)
BitStack: Any-Size Compression of Large Language Models in Variable Memory Environments
by: Wang, Xinghao, et al.
Published: (2024)
by: Wang, Xinghao, et al.
Published: (2024)
GeoBlock: Inferring Block Granularity from Dependency Geometry in Diffusion Language Models
by: Wan, Lipeng, et al.
Published: (2026)
by: Wan, Lipeng, et al.
Published: (2026)
Drifting Objectives for Refining Discrete Diffusion Language Models
by: Oba, Daisuke, et al.
Published: (2026)
by: Oba, Daisuke, et al.
Published: (2026)
Discrete Diffusion Language Model for Efficient Text Summarization
by: Dat, Do Huu, et al.
Published: (2024)
by: Dat, Do Huu, et al.
Published: (2024)
LangFlow: Continuous Diffusion Rivals Discrete in Language Modeling
by: Chen, Yuxin, et al.
Published: (2026)
by: Chen, Yuxin, et al.
Published: (2026)
LaMsS: When Large Language Models Meet Self-Skepticism
by: Wu, Yetao, et al.
Published: (2024)
by: Wu, Yetao, et al.
Published: (2024)
MMR-GRPO: Accelerating GRPO-Style Training through Diversity-Aware Reward Reweighting
by: Wei, Kangda, et al.
Published: (2026)
by: Wei, Kangda, et al.
Published: (2026)
Block Circulant Adapter for Large Language Models
by: Ding, Xinyu, et al.
Published: (2025)
by: Ding, Xinyu, et al.
Published: (2025)
Non-Markovian Discrete Diffusion with Causal Language Models
by: Zhang, Yangtian, et al.
Published: (2025)
by: Zhang, Yangtian, et al.
Published: (2025)
LoSA: Locality Aware Sparse Attention for Block-Wise Diffusion Language Models
by: Xi, Haocheng, et al.
Published: (2026)
by: Xi, Haocheng, et al.
Published: (2026)
Improving Variable-Length Generation in Diffusion Language Models via Length Regularization
by: Cheng, Zicong, et al.
Published: (2026)
by: Cheng, Zicong, et al.
Published: (2026)
Continuous Diffusion Scales Competitively with Discrete Diffusion for Language
by: Yang, Zhihan, et al.
Published: (2026)
by: Yang, Zhihan, et al.
Published: (2026)
LEGO: Language Model Building Blocks
by: Bhansali, Shrenik, et al.
Published: (2024)
by: Bhansali, Shrenik, et al.
Published: (2024)
Few-Step Diffusion Language Models via Trajectory Self-Distillation
by: Zhang, Tunyu, et al.
Published: (2026)
by: Zhang, Tunyu, et al.
Published: (2026)
Self-MoE: Towards Compositional Large Language Models with Self-Specialized Experts
by: Kang, Junmo, et al.
Published: (2024)
by: Kang, Junmo, et al.
Published: (2024)
Constrained Discrete Diffusion
by: Cardei, Michael, et al.
Published: (2025)
by: Cardei, Michael, et al.
Published: (2025)
CBQ: Cross-Block Quantization for Large Language Models
by: Ding, Xin, et al.
Published: (2023)
by: Ding, Xin, et al.
Published: (2023)
Towards Lifelong Learning of Large Language Models: A Survey
by: Zheng, Junhao, et al.
Published: (2024)
by: Zheng, Junhao, et al.
Published: (2024)
A Reparameterized Discrete Diffusion Model for Text Generation
by: Zheng, Lin, et al.
Published: (2023)
by: Zheng, Lin, et al.
Published: (2023)
Discrete Diffusion Modeling by Estimating the Ratios of the Data Distribution
by: Lou, Aaron, et al.
Published: (2023)
by: Lou, Aaron, et al.
Published: (2023)
FinanceQA: A Benchmark for Evaluating Financial Analysis Capabilities of Large Language Models
by: Mateega, Spencer, et al.
Published: (2025)
by: Mateega, Spencer, et al.
Published: (2025)
Stability-Weighted Decoding for Diffusion Language Models
by: Wu, Yue, et al.
Published: (2026)
by: Wu, Yue, et al.
Published: (2026)
Steering Without Breaking: Mechanistically Informed Interventions for Discrete Diffusion Language Models
by: Zhou, Hanhan, et al.
Published: (2026)
by: Zhou, Hanhan, et al.
Published: (2026)
Scaling Law for Language Models Training Considering Batch Size
by: Shuai, Xian, et al.
Published: (2024)
by: Shuai, Xian, et al.
Published: (2024)
When Should a Language Model Trust Itself? Same-Model Self-Verification as a Conditional Confidence Signal
by: Phalod, Aditya Ajay
Published: (2026)
by: Phalod, Aditya Ajay
Published: (2026)
DiffListener: Discrete Diffusion Model for Listener Generation
by: Jung, Siyeol, et al.
Published: (2025)
by: Jung, Siyeol, et al.
Published: (2025)
MetaState: Persistent Working Memory Enhances Reasoning in Discrete Diffusion Language Models
by: Xia, Kejing, et al.
Published: (2026)
by: Xia, Kejing, et al.
Published: (2026)
A Closer Look into Mixture-of-Experts in Large Language Models
by: Lo, Ka Man, et al.
Published: (2024)
by: Lo, Ka Man, et al.
Published: (2024)
In-Situ Tweedie Discrete Diffusion Models
by: Li, Xiao, et al.
Published: (2025)
by: Li, Xiao, et al.
Published: (2025)
Prompt Optimization Via Diffusion Language Models
by: Wang, Shiyu, et al.
Published: (2026)
by: Wang, Shiyu, et al.
Published: (2026)
Sequential Diffusion Language Models
by: Liu, Yangzhou, et al.
Published: (2025)
by: Liu, Yangzhou, et al.
Published: (2025)
GCondenser: Benchmarking Graph Condensation
by: Liu, Yilun, et al.
Published: (2024)
by: Liu, Yilun, et al.
Published: (2024)
Self-Distilled Reasoner: On-Policy Self-Distillation for Large Language Models
by: Zhao, Siyan, et al.
Published: (2026)
by: Zhao, Siyan, et al.
Published: (2026)
Similar Items
-
Text Meets Topology: Rethinking Out-of-distribution Detection in Text-Rich Networks
by: Wang, Danny, et al.
Published: (2025) -
Block-R1: Rethinking the Role of Block Size in Multi-domain Reinforcement Learning for Diffusion Large Language Models
by: Jiang, Yan, et al.
Published: (2026) -
TRN-R1-Zero: Text-rich Network Reasoning via LLMs with Reinforcement Learning Only
by: Liu, Yilun, et al.
Published: (2026) -
Break the Block: Dynamic-size Reasoning Blocks for Diffusion Large Language Models via Monotonic Entropy Descent with Reinforcement Learning
by: Jiang, Yan, et al.
Published: (2026) -
When Does a Language Model Commit? A Finite-Answer Theory of Pre-Verbalization Commitment
by: Zhang, Long, et al.
Published: (2026)