Saved in:
| Main Authors: | Zhang, Shuibai, Peng, Fred Zhangzhi, Zhang, Yiheng, Pan, Jin, Chrysos, Grigorios G. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.15596 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Planner Aware Path Learning in Diffusion Language Models Training
by: Peng, Fred Zhangzhi, et al.
Published: (2025)
by: Peng, Fred Zhangzhi, et al.
Published: (2025)
Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models
by: Zhang, Shuibai, et al.
Published: (2026)
by: Zhang, Shuibai, et al.
Published: (2026)
Don't Retrain, Align: Adapting Autoregressive LMs to Diffusion LMs via Representation Alignment
by: Peng, Fred Zhangzhi, et al.
Published: (2026)
by: Peng, Fred Zhangzhi, et al.
Published: (2026)
Activation-Free Backbones for Image Recognition: Polynomial Alternatives within MetaFormer-Style Vision Models
by: Wang, Jeffrey, et al.
Published: (2026)
by: Wang, Jeffrey, et al.
Published: (2026)
Mitigating Diffusion Model Hallucinations with Dynamic Guidance
by: Triaridis, Kostas, et al.
Published: (2025)
by: Triaridis, Kostas, et al.
Published: (2025)
Coupling Models for One-Step Discrete Generation
by: Peng, Fred Zhangzhi, et al.
Published: (2026)
by: Peng, Fred Zhangzhi, et al.
Published: (2026)
Revisiting Character-level Adversarial Attacks for Language Models
by: Rocamora, Elias Abad, et al.
Published: (2024)
by: Rocamora, Elias Abad, et al.
Published: (2024)
Certified Robustness Under Bounded Levenshtein Distance
by: Rocamora, Elias Abad, et al.
Published: (2025)
by: Rocamora, Elias Abad, et al.
Published: (2025)
Single-pass Detection of Jailbreaking Input in Large Language Models
by: Candogan, Leyla Naz, et al.
Published: (2025)
by: Candogan, Leyla Naz, et al.
Published: (2025)
LJ-Bench: Ontology-Based Benchmark for U.S. Crime
by: Tseng, Hung Yun, et al.
Published: (2026)
by: Tseng, Hung Yun, et al.
Published: (2026)
Leveraging the Context through Multi-Round Interactions for Jailbreaking Attacks
by: Cheng, Yixin, et al.
Published: (2024)
by: Cheng, Yixin, et al.
Published: (2024)
Multilinear Operator Networks
by: Cheng, Yixin, et al.
Published: (2024)
by: Cheng, Yixin, et al.
Published: (2024)
REST: Efficient and Accelerated EEG Seizure Analysis through Residual State Updates
by: Afzal, Arshia, et al.
Published: (2024)
by: Afzal, Arshia, et al.
Published: (2024)
Hadamard product in deep learning: Introduction, Advances and Challenges
by: Chrysos, Grigorios G, et al.
Published: (2025)
by: Chrysos, Grigorios G, et al.
Published: (2025)
Generalization of Scaled Deep ResNets in the Mean-Field Regime
by: Chen, Yihang, et al.
Published: (2024)
by: Chen, Yihang, et al.
Published: (2024)
Learning to Remove Cuts in Integer Linear Programming
by: Puigdemont, Pol, et al.
Published: (2024)
by: Puigdemont, Pol, et al.
Published: (2024)
The Last Mile to Supervised Performance: Semi-Supervised Domain Adaptation for Semantic Segmentation
by: Morales-Brotons, Daniel, et al.
Published: (2024)
by: Morales-Brotons, Daniel, et al.
Published: (2024)
Path Planning for Masked Diffusion Model Sampling
by: Peng, Fred Zhangzhi, et al.
Published: (2025)
by: Peng, Fred Zhangzhi, et al.
Published: (2025)
Robust NAS under adversarial training: benchmark, theory, and beyond
by: Wu, Yongtao, et al.
Published: (2024)
by: Wu, Yongtao, et al.
Published: (2024)
Towards Interpretability Without Sacrifice: Faithful Dense Layer Decomposition with Mixture of Decoders
by: Oldfield, James, et al.
Published: (2025)
by: Oldfield, James, et al.
Published: (2025)
Efficient local linearity regularization to overcome catastrophic overfitting
by: Rocamora, Elias Abad, et al.
Published: (2024)
by: Rocamora, Elias Abad, et al.
Published: (2024)
Continuous Diffusion Scales Competitively with Discrete Diffusion for Language
by: Yang, Zhihan, et al.
Published: (2026)
by: Yang, Zhihan, et al.
Published: (2026)
Why DDIM Hallucinates More Than DDPM: A Theoretical Analysis of Reverse Dynamics
by: Ashiq, Muhammad H., et al.
Published: (2026)
by: Ashiq, Muhammad H., et al.
Published: (2026)
ConTSG-Bench: A Unified Benchmark for Conditional Time Series Generation
by: Lan, Shaocheng, et al.
Published: (2026)
by: Lan, Shaocheng, et al.
Published: (2026)
Multilinear Mixture of Experts: Scalable Expert Specialization through Factorization
by: Oldfield, James, et al.
Published: (2024)
by: Oldfield, James, et al.
Published: (2024)
Diffusion Models Preferentially Memorize Prototypical Examples or: Why Does My Diffusion Model Love Slop?
by: Rodriguez, Marta Aparicio, et al.
Published: (2026)
by: Rodriguez, Marta Aparicio, et al.
Published: (2026)
Exploring the Potential of Probabilistic Transformer for Time Series Modeling: A Report on the ST-PT Framework
by: Xiong, Zhangzhi, et al.
Published: (2026)
by: Xiong, Zhangzhi, et al.
Published: (2026)
Steering Masked Discrete Diffusion Models via Discrete Denoising Posterior Prediction
by: Rector-Brooks, Jarrid, et al.
Published: (2024)
by: Rector-Brooks, Jarrid, et al.
Published: (2024)
D2ACE: Multi-Label Batch Selection Guided by Dual Dynamics and Adaptive Correlation Enhancement
by: Liu, Bin, et al.
Published: (2026)
by: Liu, Bin, et al.
Published: (2026)
ParallelBench: Understanding the Trade-offs of Parallel Decoding in Diffusion LLMs
by: Kang, Wonjun, et al.
Published: (2025)
by: Kang, Wonjun, et al.
Published: (2025)
Is Your Diffusion Sampler Actually Correct? A Sampler-Centric Evaluation of Discrete Diffusion Language Models
by: Tang, Luhan, et al.
Published: (2026)
by: Tang, Luhan, et al.
Published: (2026)
Inducing Uncertainty on Open-Weight Models for Test-Time Privacy in Image Recognition
by: Ashiq, Muhammad H., et al.
Published: (2025)
by: Ashiq, Muhammad H., et al.
Published: (2025)
Unlocking the Power of Diffusion Models in Sequential Recommendation: A Simple and Effective Approach
by: Chen, Jialei, et al.
Published: (2025)
by: Chen, Jialei, et al.
Published: (2025)
BackPlay: Head-Only Look-Back Self-Correction for Diffusion Language Models
by: Liu, Liming, et al.
Published: (2026)
by: Liu, Liming, et al.
Published: (2026)
Towards Best Practices of Activation Patching in Language Models: Metrics and Methods
by: Zhang, Fred, et al.
Published: (2023)
by: Zhang, Fred, et al.
Published: (2023)
Multi-Label Adaptive Batch Selection by Highlighting Hard and Imbalanced Samples
by: Zhou, Ao, et al.
Published: (2024)
by: Zhou, Ao, et al.
Published: (2024)
Batch Selection for Multi-Label Classification Guided by Uncertainty and Dynamic Label Correlations
by: Zhou, Ao, et al.
Published: (2024)
by: Zhou, Ao, et al.
Published: (2024)
Cross-Domain Pre-training with Language Models for Transferable Time Series Representations
by: Cheng, Mingyue, et al.
Published: (2024)
by: Cheng, Mingyue, et al.
Published: (2024)
Evo: Autoregressive-Diffusion Large Language Models with Evolving Balance
by: Wu, Junde, et al.
Published: (2026)
by: Wu, Junde, et al.
Published: (2026)
Advancing Time Series Classification with Multimodal Language Modeling
by: Cheng, Mingyue, et al.
Published: (2024)
by: Cheng, Mingyue, et al.
Published: (2024)
Similar Items
-
Planner Aware Path Learning in Diffusion Language Models Training
by: Peng, Fred Zhangzhi, et al.
Published: (2025) -
Expert-Choice Routing Enables Adaptive Computation in Diffusion Language Models
by: Zhang, Shuibai, et al.
Published: (2026) -
Don't Retrain, Align: Adapting Autoregressive LMs to Diffusion LMs via Representation Alignment
by: Peng, Fred Zhangzhi, et al.
Published: (2026) -
Activation-Free Backbones for Image Recognition: Polynomial Alternatives within MetaFormer-Style Vision Models
by: Wang, Jeffrey, et al.
Published: (2026) -
Mitigating Diffusion Model Hallucinations with Dynamic Guidance
by: Triaridis, Kostas, et al.
Published: (2025)