:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Bai, Runsheng, Zhang, Chengyu, Deng, Yangdong
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2603.25872
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Refining Adaptive Zeroth-Order Optimization at Ease
von: Shu, Yao, et al.
Veröffentlicht: (2025)

A Multi-Level Framework for Accelerating Training Transformer Models
von: Zou, Longwei, et al.
Veröffentlicht: (2024)

Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
von: Wu, Shutong, et al.
Veröffentlicht: (2025)

DEER: Draft with Diffusion, Verify with Autoregressive Models
von: Cheng, Zicong, et al.
Veröffentlicht: (2025)

AlignedKV: Reducing Memory Access of KV-Cache with Precision-Aligned Quantization
von: Tan, Yifan, et al.
Veröffentlicht: (2024)

Quantifying the Ease of Reproducing Training Data in Unconditional Diffusion Models
von: Hasegawa, Masaya, et al.
Veröffentlicht: (2025)

SKIM: Any-bit Quantization Pushing The Limits of Post-Training Quantization
von: Bai, Runsheng, et al.
Veröffentlicht: (2024)

P-EAGLE: Parallel-Drafting EAGLE with Scalable Training
von: Hui, Mude, et al.
Veröffentlicht: (2026)

DREAM-S: Speculative Decoding with Searchable Drafting and Target-Aware Refinement for Multimodal Generation
von: Liu, Zining, et al.
Veröffentlicht: (2026)

Exploring and Improving Drafts in Blockwise Parallel Decoding
von: Kim, Taehyeon, et al.
Veröffentlicht: (2024)

Entropy-MCMC: Sampling from Flat Basins with Ease
von: Li, Bolian, et al.
Veröffentlicht: (2023)

When Latent Geometry Is Not Enough: Draft-Conditioned Latent Refinement for Non-Autoregressive Text Generation
von: Zhang, De Shuai
Veröffentlicht: (2026)

Refiner: Data Refining against Gradient Leakage Attacks in Federated Learning
von: Fan, Mingyuan, et al.
Veröffentlicht: (2022)

Self-Refining Diffusion Samplers: Enabling Parallelization via Parareal Iterations
von: Selvam, Nikil Roashan, et al.
Veröffentlicht: (2024)

PARD: Accelerating LLM Inference with Low-Cost PARallel Draft Model Adaptation
von: An, Zihao, et al.
Veröffentlicht: (2025)

Provably Learning Diffusion Models under the Manifold Hypothesis: Collapse and Refine
von: Huang, Wei, et al.
Veröffentlicht: (2026)

D-PACE: Dynamic Position-Aware Cross-Entropy for Parallel Speculative Drafting
von: Wu, Tianyu, et al.
Veröffentlicht: (2026)

Reversible Unfolding Network for Concealed Visual Perception with Generative Refinement
von: He, Chunming, et al.
Veröffentlicht: (2025)

Weak-to-Strong Elicitation via Mismatched Wrong Drafts
von: Deng, Wei
Veröffentlicht: (2026)

Learning to Refine: Self-Refinement of Parallel Reasoning in LLMs
von: Wang, Qibin, et al.
Veröffentlicht: (2025)

ProPD: Dynamic Token Tree Pruning and Generation for LLM Parallel Decoding
von: Zhong, Shuzhang, et al.
Veröffentlicht: (2024)

Easing Optimization Paths: a Circuit Perspective
von: Odonnat, Ambroise, et al.
Veröffentlicht: (2025)

MineDraft: A Framework for Batch Parallel Speculative Decoding
von: Tang, Zhenwei, et al.
Veröffentlicht: (2026)

Accelerating Parallel Sampling of Diffusion Models
von: Tang, Zhiwei, et al.
Veröffentlicht: (2024)

Parallel Sampling of Diffusion Models on $SO(3)$
von: Chen, Yan-Ting, et al.
Veröffentlicht: (2025)

Differentiable Information Bottleneck for Deterministic Multi-view Clustering
von: Yan, Xiaoqiang, et al.
Veröffentlicht: (2024)

EAGLE-2: Faster Inference of Language Models with Dynamic Draft Trees
von: Li, Yuhui, et al.
Veröffentlicht: (2024)

Towards Cold-Start Drafting and Continual Refining: A Value-Driven Memory Approach with Application to NPU Kernel Synthesis
von: Zheng, Yujie, et al.
Veröffentlicht: (2026)

Orchestrating Dual-Boundaries: An Arithmetic Intensity Inspired Acceleration Framework for Diffusion Language Models
von: Wei, Linye, et al.
Veröffentlicht: (2025)

ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs
von: Kang, Wonjun, et al.
Veröffentlicht: (2025)

Tailed Low-Rank Matrix Factorization for Similarity Matrix Completion
von: Ma, Changyi, et al.
Veröffentlicht: (2024)

AEGPO: Adaptive Entropy-Guided Policy Optimization for Diffusion Models
von: Li, Yuming, et al.
Veröffentlicht: (2026)

Refining Alignment Framework for Diffusion Models with Intermediate-Step Preference Ranking
von: Ren, Jie, et al.
Veröffentlicht: (2025)

ADiff4TPP: Asynchronous Diffusion Models for Temporal Point Processes
von: Mukherjee, Amartya, et al.
Veröffentlicht: (2025)

Particle Dynamics for Latent-Variable Energy-Based Models
von: Tang, Shiqin, et al.
Veröffentlicht: (2025)

Generation Order and Parallel Decoding in Masked Diffusion Models: An Information-Theoretic Perspective
von: Zhang, Shaorong, et al.
Veröffentlicht: (2026)

Backdooring Masked Diffusion Language Models
von: Cao, Daniel Yiming, et al.
Veröffentlicht: (2026)

Efficiently Aligning Draft Models via Parameter- and Data-Efficient Adaptation
von: Lin, Luxi, et al.
Veröffentlicht: (2026)

Diffusion Models: A Comprehensive Survey of Methods and Applications
von: Yang, Ling, et al.
Veröffentlicht: (2022)

Review, Remask, Refine (R3): Process-Guided Block Diffusion for Text Generation
von: Mounier, Nikita, et al.
Veröffentlicht: (2025)