Saved in:
| Main Authors: | Hsieh, He-Yen, Wang, Hong, Kung, H. T. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.00670 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
dVoting: Fast Voting for dLLMs
by: Feng, Sicheng, et al.
Published: (2026)
by: Feng, Sicheng, et al.
Published: (2026)
DMax: Aggressive Parallel Decoding for dLLMs
by: Chen, Zigeng, et al.
Published: (2026)
by: Chen, Zigeng, et al.
Published: (2026)
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
by: Lin, Haokun, et al.
Published: (2025)
by: Lin, Haokun, et al.
Published: (2025)
Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
by: Li, Qi, et al.
Published: (2025)
by: Li, Qi, et al.
Published: (2025)
Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs
by: Kim, Bumjun, et al.
Published: (2025)
by: Kim, Bumjun, et al.
Published: (2025)
SYNFAC-EDIT: Synthetic Imitation Edit Feedback for Factual Alignment in Clinical Summarization
by: Mishra, Prakamya, et al.
Published: (2024)
by: Mishra, Prakamya, et al.
Published: (2024)
EDIT: Enhancing Vision Transformers by Mitigating Attention Sink through an Encoder-Decoder Architecture
by: Feng, Wenfeng, et al.
Published: (2025)
by: Feng, Wenfeng, et al.
Published: (2025)
MX-SAFE: Versatile Inference- and Training-Proof Microscaling Format with On-the-Fly Exponent and Mantissa Bit Allocation
by: Park, Dahoon, et al.
Published: (2026)
by: Park, Dahoon, et al.
Published: (2026)
HYPE-EDIT-1: Benchmark for Measuring Reliability in Frontier Image Editing Models
by: Chan, Wing, et al.
Published: (2026)
by: Chan, Wing, et al.
Published: (2026)
ES-dLLM: Efficient Inference for Diffusion Large Language Models by Early-Skipping
by: Zhu, Zijian, et al.
Published: (2026)
by: Zhu, Zijian, et al.
Published: (2026)
dParallel: Learnable Parallel Decoding for dLLMs
by: Chen, Zigeng, et al.
Published: (2025)
by: Chen, Zigeng, et al.
Published: (2025)
Training-Free Diffusion-Driven Modeling of Pareto Set Evolution for Dynamic Multiobjective Optimization
by: Guan, Jian, et al.
Published: (2026)
by: Guan, Jian, et al.
Published: (2026)
LLMs Are Prone to Fallacies in Causal Inference
by: Joshi, Nitish, et al.
Published: (2024)
by: Joshi, Nitish, et al.
Published: (2024)
Gen4Gen: Generative Data Pipeline for Generative Multi-Concept Composition
by: Yeh, Chun-Hsiao, et al.
Published: (2024)
by: Yeh, Chun-Hsiao, et al.
Published: (2024)
dMoE: dLLMs with Learnable Block Experts
by: Feng, Sicheng, et al.
Published: (2026)
by: Feng, Sicheng, et al.
Published: (2026)
End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
by: Tan, Qitao, et al.
Published: (2025)
by: Tan, Qitao, et al.
Published: (2025)
PQS (Prune, Quantize, and Sort): Low-Bitwidth Accumulation of Dot Products in Neural Network Computations
by: Natesh, Vikas, et al.
Published: (2025)
by: Natesh, Vikas, et al.
Published: (2025)
Diffusion Model-Based Multiobjective Optimization for Gasoline Blending Scheduling
by: Fang, Wenxuan, et al.
Published: (2024)
by: Fang, Wenxuan, et al.
Published: (2024)
dInfer: An Efficient Inference Framework for Diffusion Language Models
by: Ma, Yuxin, et al.
Published: (2025)
by: Ma, Yuxin, et al.
Published: (2025)
Imbalanced Gradients in RL Post-Training of Multi-Task LLMs
by: Wu, Runzhe, et al.
Published: (2025)
by: Wu, Runzhe, et al.
Published: (2025)
GradPruner: Gradient-Guided Layer Pruning Enabling Efficient Fine-Tuning and Inference for LLMs
by: Huang, Wei, et al.
Published: (2026)
by: Huang, Wei, et al.
Published: (2026)
LiDAR-EDIT: LiDAR Data Generation by Editing the Object Layouts in Real-World Scenes
by: Ho, Shing-Hei, et al.
Published: (2024)
by: Ho, Shing-Hei, et al.
Published: (2024)
Dynamic Gradient Sparse Update for Edge Training
by: Li, I-Hsuan, et al.
Published: (2025)
by: Li, I-Hsuan, et al.
Published: (2025)
Dynamic Sparse No Training: Training-Free Fine-tuning for Sparse LLMs
by: Zhang, Yuxin, et al.
Published: (2023)
by: Zhang, Yuxin, et al.
Published: (2023)
Diffusion LLMs Can Do Faster-Than-AR Inference via Discrete Diffusion Forcing
by: Wang, Xu, et al.
Published: (2025)
by: Wang, Xu, et al.
Published: (2025)
Predict-then-Diffuse: Adaptive Response Length for Compute-Budgeted Inference in Diffusion LLMs
by: Rottoli, Michael, et al.
Published: (2026)
by: Rottoli, Michael, et al.
Published: (2026)
Trained Persistent Memory for Frozen Decoder-Only LLMs
by: Jeong, Hong
Published: (2026)
by: Jeong, Hong
Published: (2026)
Continuous Approximations for Improving Quantization Aware Training of LLMs
by: Li, He, et al.
Published: (2024)
by: Li, He, et al.
Published: (2024)
TerminalWorld: Benchmarking Agents on Real-World Terminal Tasks
by: Chu, Zhaoyang, et al.
Published: (2026)
by: Chu, Zhaoyang, et al.
Published: (2026)
MANATEE: Inference-Time Lightweight Diffusion Based Safety Defense for LLMs
by: Kan, Chun Yan Ryan, et al.
Published: (2026)
by: Kan, Chun Yan Ryan, et al.
Published: (2026)
G-Drift MIA: Membership Inference via Gradient-Induced Feature Drift in LLMs
by: Ranjan, Ravi, et al.
Published: (2026)
by: Ranjan, Ravi, et al.
Published: (2026)
GAC: Stabilizing Asynchronous RL Training for LLMs via Gradient Alignment Control
by: Xu, Haofeng, et al.
Published: (2026)
by: Xu, Haofeng, et al.
Published: (2026)
Encoder-Decoder Diffusion Language Models for Efficient Training and Inference
by: Arriola, Marianne, et al.
Published: (2025)
by: Arriola, Marianne, et al.
Published: (2025)
Sorted Weight Sectioning for Energy-Efficient Unstructured Sparse DNNs on Compute-in-Memory Crossbars
by: Farias, Matheus, et al.
Published: (2024)
by: Farias, Matheus, et al.
Published: (2024)
Efficient Reprogramming of Memristive Crossbars for DNNs: Weight Sorting and Bit Stucking
by: Farias, Matheus, et al.
Published: (2024)
by: Farias, Matheus, et al.
Published: (2024)
Dynamic Vocabulary Pruning in Early-Exit LLMs
by: Vincenti, Jort, et al.
Published: (2024)
by: Vincenti, Jort, et al.
Published: (2024)
One-Spike SNN: Single-Spike Phase Coding with Base Manipulation for ANN-to-SNN Conversion Loss Minimization
by: Hwang, Sangwoo, et al.
Published: (2024)
by: Hwang, Sangwoo, et al.
Published: (2024)
How to Train Data-Efficient LLMs
by: Sachdeva, Noveen, et al.
Published: (2024)
by: Sachdeva, Noveen, et al.
Published: (2024)
Is My Text in Your AI Model? Gradient-based Membership Inference Test applied to LLMs
by: Mancera, Gonzalo, et al.
Published: (2025)
by: Mancera, Gonzalo, et al.
Published: (2025)
Certification for Differentially Private Prediction in Gradient-Based Training
by: Wicker, Matthew, et al.
Published: (2024)
by: Wicker, Matthew, et al.
Published: (2024)
Similar Items
-
dVoting: Fast Voting for dLLMs
by: Feng, Sicheng, et al.
Published: (2026) -
DMax: Aggressive Parallel Decoding for dLLMs
by: Chen, Zigeng, et al.
Published: (2026) -
Quantization Meets dLLMs: A Systematic Study of Post-training Quantization for Diffusion LLMs
by: Lin, Haokun, et al.
Published: (2025) -
Every Step Counts: Decoding Trajectories as Authorship Fingerprints of dLLMs
by: Li, Qi, et al.
Published: (2025) -
Rainbow Padding: Mitigating Early Termination in Instruction-Tuned Diffusion LLMs
by: Kim, Bumjun, et al.
Published: (2025)