Saved in:
| Main Authors: | Wan, Ben, Zheng, Tianyi, Chen, Zhaoyu, Wang, Yuxiao, Wang, Jia |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2501.09464 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Sink-Aware Pruning for Diffusion Language Models
by: Myrzakhan, Aidar, et al.
Published: (2026)
by: Myrzakhan, Aidar, et al.
Published: (2026)
Analysis of Variational Sparse Autoencoders
by: Baker, Zachary, et al.
Published: (2025)
by: Baker, Zachary, et al.
Published: (2025)
Rethinking the Diffusion Models for Numerical Tabular Data Imputation from the Perspective of Wasserstein Gradient Flow
by: Chen, Zhichao, et al.
Published: (2024)
by: Chen, Zhichao, et al.
Published: (2024)
On Penalty-based Bilevel Gradient Descent Method
by: Shen, Han, et al.
Published: (2023)
by: Shen, Han, et al.
Published: (2023)
A Gradient Flow Approach to Solving Inverse Problems with Latent Diffusion Models
by: Wang, Tim Y. J., et al.
Published: (2025)
by: Wang, Tim Y. J., et al.
Published: (2025)
SparseDM: Toward Sparse Efficient Diffusion Models
by: Wang, Kafeng, et al.
Published: (2024)
by: Wang, Kafeng, et al.
Published: (2024)
Data Pruning in Generative Diffusion Models
by: Briq, Rania, et al.
Published: (2024)
by: Briq, Rania, et al.
Published: (2024)
Wasserstein Proximal Policy Gradient
by: Zhu, Zhaoyu, et al.
Published: (2026)
by: Zhu, Zhaoyu, et al.
Published: (2026)
FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies
by: Gao, Chenxiao, et al.
Published: (2026)
by: Gao, Chenxiao, et al.
Published: (2026)
SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot
by: Tuo, Kaiwen, et al.
Published: (2025)
by: Tuo, Kaiwen, et al.
Published: (2025)
Compact SO(3) Equivariant Atomistic Foundation Models via Structural Pruning
by: Wang, Chen, et al.
Published: (2026)
by: Wang, Chen, et al.
Published: (2026)
ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
by: Meng, Xiang, et al.
Published: (2024)
by: Meng, Xiang, et al.
Published: (2024)
Global Convergence of Wasserstein Policy Gradient for Entropy-Regularized Reinforcement Learning
by: Zhu, Zhaoyu, et al.
Published: (2026)
by: Zhu, Zhaoyu, et al.
Published: (2026)
Gradient Guidance for Diffusion Models: An Optimization Perspective
by: Guo, Yingqing, et al.
Published: (2024)
by: Guo, Yingqing, et al.
Published: (2024)
A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
by: Chowdhury, Mohammed Nowaz Rabbani, et al.
Published: (2024)
by: Chowdhury, Mohammed Nowaz Rabbani, et al.
Published: (2024)
Optimal Defenses Against Gradient Reconstruction Attacks
by: Chen, Yuxiao, et al.
Published: (2024)
by: Chen, Yuxiao, et al.
Published: (2024)
Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models
by: Yang, Ruofeng, et al.
Published: (2025)
by: Yang, Ruofeng, et al.
Published: (2025)
Window-Diffusion: Accelerating Diffusion Language Model Inference with Windowed Token Pruning and Caching
by: Zuo, Fengrui, et al.
Published: (2026)
by: Zuo, Fengrui, et al.
Published: (2026)
InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context
by: Teng, Xin, et al.
Published: (2026)
by: Teng, Xin, et al.
Published: (2026)
ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning
by: Su, Mingluo, et al.
Published: (2026)
by: Su, Mingluo, et al.
Published: (2026)
Gradient Flow Sampler-based Distributionally Robust Optimization
by: Xu, Zusen, et al.
Published: (2025)
by: Xu, Zusen, et al.
Published: (2025)
Reconstruct the Pruned Model without Any Retraining
by: Wang, Pingjie, et al.
Published: (2024)
by: Wang, Pingjie, et al.
Published: (2024)
Action Dubber: Timing Audible Actions via Inflectional Flow
by: Wan, Wenlong, et al.
Published: (2025)
by: Wan, Wenlong, et al.
Published: (2025)
AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent
by: Liu, Jing, et al.
Published: (2025)
by: Liu, Jing, et al.
Published: (2025)
GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection
by: Chen, Jinggang, et al.
Published: (2023)
by: Chen, Jinggang, et al.
Published: (2023)
Sparse Additive Model Pruning for Order-Based Causal Structure Learning
by: Kanamori, Kentaro, et al.
Published: (2026)
by: Kanamori, Kentaro, et al.
Published: (2026)
Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
by: Jolicoeur-Martineau, Alexia, et al.
Published: (2023)
by: Jolicoeur-Martineau, Alexia, et al.
Published: (2023)
DLM-Scope: Mechanistic Interpretability of Diffusion Language Models via Sparse Autoencoders
by: Wang, Xu, et al.
Published: (2026)
by: Wang, Xu, et al.
Published: (2026)
Wanda++: Pruning Large Language Models via Regional Gradients
by: Yang, Yifan, et al.
Published: (2025)
by: Yang, Yifan, et al.
Published: (2025)
Sparse Gradient Compression for Fine-Tuning Large Language Models
by: Yang, David H., et al.
Published: (2025)
by: Yang, David H., et al.
Published: (2025)
The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse Models
by: Xiao, Yang, et al.
Published: (2025)
by: Xiao, Yang, et al.
Published: (2025)
One-Step Flow Policy Mirror Descent
by: Chen, Tianyi, et al.
Published: (2025)
by: Chen, Tianyi, et al.
Published: (2025)
Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
by: Liu, Enshu, et al.
Published: (2024)
by: Liu, Enshu, et al.
Published: (2024)
Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks
by: Zhang, Junhe, et al.
Published: (2024)
by: Zhang, Junhe, et al.
Published: (2024)
Neural Sinkhorn Gradient Flow
by: Zhu, Huminhao, et al.
Published: (2024)
by: Zhu, Huminhao, et al.
Published: (2024)
GARDO: Reinforcing Diffusion Models without Reward Hacking
by: He, Haoran, et al.
Published: (2025)
by: He, Haoran, et al.
Published: (2025)
S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
by: Hu, Yuezhou, et al.
Published: (2024)
by: Hu, Yuezhou, et al.
Published: (2024)
Pruning is Optimal for Learning Sparse Features in High-Dimensions
by: Vural, Nuri Mert, et al.
Published: (2024)
by: Vural, Nuri Mert, et al.
Published: (2024)
Unlearning Backdoor Attacks through Gradient-Based Model Pruning
by: Dunnett, Kealan, et al.
Published: (2024)
by: Dunnett, Kealan, et al.
Published: (2024)
ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models
by: Li, Xu, et al.
Published: (2026)
by: Li, Xu, et al.
Published: (2026)
Similar Items
-
Sink-Aware Pruning for Diffusion Language Models
by: Myrzakhan, Aidar, et al.
Published: (2026) -
Analysis of Variational Sparse Autoencoders
by: Baker, Zachary, et al.
Published: (2025) -
Rethinking the Diffusion Models for Numerical Tabular Data Imputation from the Perspective of Wasserstein Gradient Flow
by: Chen, Zhichao, et al.
Published: (2024) -
On Penalty-based Bilevel Gradient Descent Method
by: Shen, Han, et al.
Published: (2023) -
A Gradient Flow Approach to Solving Inverse Problems with Latent Diffusion Models
by: Wang, Tim Y. J., et al.
Published: (2025)