Saved in:
| Main Authors: | Luo, Lizhuo, Li, Shenggui, Wen, Yonggang, Zhang, Tianwei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.05992 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
DAWN: Dependency-Aware Fast Inference for Diffusion LLMs
by: Luo, Lizhuo, et al.
Published: (2026)
by: Luo, Lizhuo, et al.
Published: (2026)
SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading
by: Chen, Qiaoling, et al.
Published: (2025)
by: Chen, Qiaoling, et al.
Published: (2025)
AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
by: Han, Zihao, et al.
Published: (2024)
by: Han, Zihao, et al.
Published: (2024)
SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding
by: Li, Shenggui, et al.
Published: (2026)
by: Li, Shenggui, et al.
Published: (2026)
Triplet-Block Diffusion RWKV
by: Lin, Ke, et al.
Published: (2026)
by: Lin, Ke, et al.
Published: (2026)
ReSpec: Towards Optimizing Speculative Decoding in Reinforcement Learning Systems
by: Chen, Qiaoling, et al.
Published: (2025)
by: Chen, Qiaoling, et al.
Published: (2025)
Inference-time Alignment via Sparse Junction Steering
by: Hu, Runyi, et al.
Published: (2026)
by: Hu, Runyi, et al.
Published: (2026)
CONCUR: High-Throughput Agentic Batch Inference of LLM via Congestion-Based Concurrency Control
by: Chen, Qiaoling, et al.
Published: (2026)
by: Chen, Qiaoling, et al.
Published: (2026)
From Next-Token to Next-Block: A Principled Adaptation Path for Diffusion LLMs
by: Tian, Yuchuan, et al.
Published: (2025)
by: Tian, Yuchuan, et al.
Published: (2025)
Understanding the Dark Side of LLMs' Intrinsic Self-Correction
by: Zhang, Qingjie, et al.
Published: (2024)
by: Zhang, Qingjie, et al.
Published: (2024)
Fast-dLLM v2: Efficient Block-Diffusion LLM
by: Wu, Chengyue, et al.
Published: (2025)
by: Wu, Chengyue, et al.
Published: (2025)
Speculating LLMs' Chinese Training Data Pollution from Their Tokens
by: Zhang, Qingjie, et al.
Published: (2025)
by: Zhang, Qingjie, et al.
Published: (2025)
Semantic-Aware Scheduling for GPU Clusters with Large Language Models
by: Wang, Zerui, et al.
Published: (2025)
by: Wang, Zerui, et al.
Published: (2025)
Thinking Inside the Mask: In-Place Prompting in Diffusion LLMs
by: Jin, Xiangqi, et al.
Published: (2025)
by: Jin, Xiangqi, et al.
Published: (2025)
DFlare: Scaling Up Draft Capacity for Block Diffusion Speculative Decoding
by: Zhang, Jiebin, et al.
Published: (2026)
by: Zhang, Jiebin, et al.
Published: (2026)
Sparse-dLLM: Accelerating Diffusion LLMs with Dynamic Cache Eviction
by: Song, Yuerong, et al.
Published: (2025)
by: Song, Yuerong, et al.
Published: (2025)
Swordsman: Entropy-Driven Adaptive Block Partition for Efficient Diffusion Language Models
by: Zhang, Yu, et al.
Published: (2026)
by: Zhang, Yu, et al.
Published: (2026)
Efficient Many-Shot In-Context Learning with Dynamic Block-Sparse Attention
by: Xiao, Emily, et al.
Published: (2025)
by: Xiao, Emily, et al.
Published: (2025)
SlidesGen-Bench: Evaluating Slides Generation via Computational and Quantitative Metrics
by: Yang, Yunqiao, et al.
Published: (2026)
by: Yang, Yunqiao, et al.
Published: (2026)
Reinforcement Learning Enhanced LLMs: A Survey
by: Wang, Shuhe, et al.
Published: (2024)
by: Wang, Shuhe, et al.
Published: (2024)
Automatic Slide Updating with User-Defined Dynamic Templates and Natural Language Instructions
by: Zhou, Kun, et al.
Published: (2026)
by: Zhou, Kun, et al.
Published: (2026)
SpecBlock: Block-Iterative Speculative Decoding with Dynamic Tree Drafting
by: Shi, Weijie, et al.
Published: (2026)
by: Shi, Weijie, et al.
Published: (2026)
DFlash: Block Diffusion for Flash Speculative Decoding
by: Chen, Jian, et al.
Published: (2026)
by: Chen, Jian, et al.
Published: (2026)
Advancing Block Diffusion Language Models for Test-Time Scaling
by: Lu, Yi, et al.
Published: (2026)
by: Lu, Yi, et al.
Published: (2026)
Avoiding Overthinking and Underthinking: Curriculum-Aware Budget Scheduling for LLMs
by: Rahman, Amirul, et al.
Published: (2026)
by: Rahman, Amirul, et al.
Published: (2026)
DiffCoT: Diffusion-styled Chain-of-Thought Reasoning in LLMs
by: Cao, Shidong, et al.
Published: (2026)
by: Cao, Shidong, et al.
Published: (2026)
Picky LLMs and Unreliable RMs: An Empirical Study on Safety Alignment after Instruction Tuning
by: Li, Guanlin, et al.
Published: (2025)
by: Li, Guanlin, et al.
Published: (2025)
Fast-Decoding Diffusion Language Models via Progress-Aware Confidence Schedules
by: Mohamed, Amr, et al.
Published: (2025)
by: Mohamed, Amr, et al.
Published: (2025)
dMoE: dLLMs with Learnable Block Experts
by: Feng, Sicheng, et al.
Published: (2026)
by: Feng, Sicheng, et al.
Published: (2026)
The Devil behind the mask: An emergent safety vulnerability of Diffusion LLMs
by: Wen, Zichen, et al.
Published: (2025)
by: Wen, Zichen, et al.
Published: (2025)
Accelerating Speculative Decoding with Block Diffusion Draft Trees
by: Ringel, Liran, et al.
Published: (2026)
by: Ringel, Liran, et al.
Published: (2026)
Cost-Efficient RAG for Entity Matching with LLMs: A Blocking-based Exploration
by: Ma, Chuangtao, et al.
Published: (2026)
by: Ma, Chuangtao, et al.
Published: (2026)
GliDe with a CaPE: A Low-Hassle Method to Accelerate Speculative Decoding
by: Du, Cunxiao, et al.
Published: (2024)
by: Du, Cunxiao, et al.
Published: (2024)
FlashBlock: Attention Caching for Efficient Long-Context Block Diffusion
by: Chen, Zhuokun, et al.
Published: (2026)
by: Chen, Zhuokun, et al.
Published: (2026)
Draft-Thinking: Learning Efficient Reasoning in Long Chain-of-Thought LLMs
by: Cao, Jie, et al.
Published: (2026)
by: Cao, Jie, et al.
Published: (2026)
D$^3$: Dynamic Directional Graph-Constrained Data Scheduling for LLM Training
by: Xu, Yuanjian, et al.
Published: (2026)
by: Xu, Yuanjian, et al.
Published: (2026)
CtrlDiff: Boosting Large Diffusion Language Models with Dynamic Block Prediction and Controllable Generation
by: Huang, Chihan, et al.
Published: (2025)
by: Huang, Chihan, et al.
Published: (2025)
Internal Chain-of-Thought: Empirical Evidence for Layer-wise Subtask Scheduling in LLMs
by: Yang, Zhipeng, et al.
Published: (2025)
by: Yang, Zhipeng, et al.
Published: (2025)
EBFT: Effective and Block-Wise Fine-Tuning for Sparse LLMs
by: Guo, Song, et al.
Published: (2024)
by: Guo, Song, et al.
Published: (2024)
GeoBlock: Inferring Block Granularity from Dependency Geometry in Diffusion Language Models
by: Wan, Lipeng, et al.
Published: (2026)
by: Wan, Lipeng, et al.
Published: (2026)
Similar Items
-
DAWN: Dependency-Aware Fast Inference for Diffusion LLMs
by: Luo, Lizhuo, et al.
Published: (2026) -
SPPO:Efficient Long-sequence LLM Training via Adaptive Sequence Pipeline Parallel Offloading
by: Chen, Qiaoling, et al.
Published: (2025) -
AsyncDSB: Schedule-Asynchronous Diffusion Schrödinger Bridge for Image Inpainting
by: Han, Zihao, et al.
Published: (2024) -
SpecForge: A Flexible and Efficient Open-Source Training Framework for Speculative Decoding
by: Li, Shenggui, et al.
Published: (2026) -
Triplet-Block Diffusion RWKV
by: Lin, Ke, et al.
Published: (2026)