:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wan, Ben, Zheng, Tianyi, Chen, Zhaoyu, Wang, Yuxiao, Wang, Jia
Format:	Preprint
Published:	2025
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2501.09464
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Sink-Aware Pruning for Diffusion Language Models
by: Myrzakhan, Aidar, et al.
Published: (2026)

Analysis of Variational Sparse Autoencoders
by: Baker, Zachary, et al.
Published: (2025)

Rethinking the Diffusion Models for Numerical Tabular Data Imputation from the Perspective of Wasserstein Gradient Flow
by: Chen, Zhichao, et al.
Published: (2024)

On Penalty-based Bilevel Gradient Descent Method
by: Shen, Han, et al.
Published: (2023)

A Gradient Flow Approach to Solving Inverse Problems with Latent Diffusion Models
by: Wang, Tim Y. J., et al.
Published: (2025)

SparseDM: Toward Sparse Efficient Diffusion Models
by: Wang, Kafeng, et al.
Published: (2024)

Data Pruning in Generative Diffusion Models
by: Briq, Rania, et al.
Published: (2024)

Wasserstein Proximal Policy Gradient
by: Zhu, Zhaoyu, et al.
Published: (2026)

FlowRL: A Taxonomy and Modular Framework for Reinforcement Learning with Diffusion Policies
by: Gao, Chenxiao, et al.
Published: (2026)

SparseSSM: Efficient Selective Structured State Space Models Can Be Pruned in One-Shot
by: Tuo, Kaiwen, et al.
Published: (2025)

Compact SO(3) Equivariant Atomistic Foundation Models via Structural Pruning
by: Wang, Chen, et al.
Published: (2026)

ALPS: Improved Optimization for Highly Sparse One-Shot Pruning for Large Language Models
by: Meng, Xiang, et al.
Published: (2024)

Global Convergence of Wasserstein Policy Gradient for Entropy-Regularized Reinforcement Learning
by: Zhu, Zhaoyu, et al.
Published: (2026)

Gradient Guidance for Diffusion Models: An Optimization Perspective
by: Guo, Yingqing, et al.
Published: (2024)

A Provably Effective Method for Pruning Experts in Fine-tuned Sparse Mixture-of-Experts
by: Chowdhury, Mohammed Nowaz Rabbani, et al.
Published: (2024)

Optimal Defenses Against Gradient Reconstruction Attacks
by: Chen, Yuxiao, et al.
Published: (2024)

Elucidating Rectified Flow with Deterministic Sampler: Polynomial Discretization Complexity for Multi and One-step Models
by: Yang, Ruofeng, et al.
Published: (2025)

Window-Diffusion: Accelerating Diffusion Language Model Inference with Windowed Token Pruning and Caching
by: Zuo, Fengrui, et al.
Published: (2026)

InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context
by: Teng, Xin, et al.
Published: (2026)

ROSE: Reordered SparseGPT for More Accurate One-Shot Large Language Models Pruning
by: Su, Mingluo, et al.
Published: (2026)

Gradient Flow Sampler-based Distributionally Robust Optimization
by: Xu, Zusen, et al.
Published: (2025)

Reconstruct the Pruned Model without Any Retraining
by: Wang, Pingjie, et al.
Published: (2024)

Action Dubber: Timing Audible Actions via Inflectional Flow
by: Wan, Wenlong, et al.
Published: (2025)

AWP: Activation-Aware Weight Pruning and Quantization with Projected Gradient Descent
by: Liu, Jing, et al.
Published: (2025)

GAIA: Delving into Gradient-based Attribution Abnormality for Out-of-distribution Detection
by: Chen, Jinggang, et al.
Published: (2023)

Sparse Additive Model Pruning for Order-Based Causal Structure Learning
by: Kanamori, Kentaro, et al.
Published: (2026)

Generating and Imputing Tabular Data via Diffusion and Flow-based Gradient-Boosted Trees
by: Jolicoeur-Martineau, Alexia, et al.
Published: (2023)

DLM-Scope: Mechanistic Interpretability of Diffusion Language Models via Sparse Autoencoders
by: Wang, Xu, et al.
Published: (2026)

Wanda++: Pruning Large Language Models via Regional Gradients
by: Yang, Yifan, et al.
Published: (2025)

Sparse Gradient Compression for Fine-Tuning Large Language Models
by: Yang, David H., et al.
Published: (2025)

The Right to be Forgotten in Pruning: Unveil Machine Unlearning on Sparse Models
by: Xiao, Yang, et al.
Published: (2025)

One-Step Flow Policy Mirror Descent
by: Chen, Tianyi, et al.
Published: (2025)

Efficient Expert Pruning for Sparse Mixture-of-Experts Language Models: Enhancing Performance and Reducing Inference Costs
by: Liu, Enshu, et al.
Published: (2024)

Federated Split Learning with Model Pruning and Gradient Quantization in Wireless Networks
by: Zhang, Junhe, et al.
Published: (2024)

Neural Sinkhorn Gradient Flow
by: Zhu, Huminhao, et al.
Published: (2024)

GARDO: Reinforcing Diffusion Models without Reward Hacking
by: He, Haoran, et al.
Published: (2025)

S-STE: Continuous Pruning Function for Efficient 2:4 Sparse Pre-training
by: Hu, Yuezhou, et al.
Published: (2024)

Pruning is Optimal for Learning Sparse Features in High-Dimensions
by: Vural, Nuri Mert, et al.
Published: (2024)

Unlearning Backdoor Attacks through Gradient-Based Model Pruning
by: Dunnett, Kealan, et al.
Published: (2024)

ResPrune: Text-Conditioned Subspace Reconstruction for Visual Token Pruning in Large Vision-Language Models
by: Li, Xu, et al.
Published: (2026)