Saved in:
| Main Authors: | Wang, Yixiao, Jiang, Ting, Shao, Zishan, Ye, Hancheng, Sun, Jingwei, Ma, Mingyuan, Zhang, Jianyi, Chen, Yiran, Li, Hai |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.01552 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
SADA: Stability-guided Adaptive Diffusion Acceleration
by: Jiang, Ting, et al.
Published: (2025)
by: Jiang, Ting, et al.
Published: (2025)
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
by: Shao, Zishan, et al.
Published: (2025)
by: Shao, Zishan, et al.
Published: (2025)
CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation
by: Wang, Qinsi, et al.
Published: (2024)
by: Wang, Qinsi, et al.
Published: (2024)
CoreMatching: A Co-adaptive Sparse Inference Framework with Token and Neuron Pruning for Comprehensive Acceleration of Vision-Language Models
by: Wang, Qinsi, et al.
Published: (2025)
by: Wang, Qinsi, et al.
Published: (2025)
FlashSVD v1.5: Making Low-Rank Transformers Inference Actually Fast
by: Wu, Wenhao, et al.
Published: (2026)
by: Wu, Wenhao, et al.
Published: (2026)
Enhanced Cyclic Coordinate Descent Methods for Elastic Net Penalized Linear Models
by: Wang, Yixiao, et al.
Published: (2025)
by: Wang, Yixiao, et al.
Published: (2025)
KVCOMM: Online Cross-context KV-cache Communication for Efficient LLM-based Multi-agent Systems
by: Ye, Hancheng, et al.
Published: (2025)
by: Ye, Hancheng, et al.
Published: (2025)
PrivAct: Internalizing Contextual Privacy Preservation via Multi-Agent Preference Training
by: Cheng, Yuhan, et al.
Published: (2026)
by: Cheng, Yuhan, et al.
Published: (2026)
Keyframe-oriented Vision Token Pruning: Enhancing Efficiency of Large Vision Language Models on Long-Form Video Processing
by: Liu, Yudong, et al.
Published: (2025)
by: Liu, Yudong, et al.
Published: (2025)
Angles Don't Lie: Unlocking Training-Efficient RL Through the Model's Own Signals
by: Wang, Qinsi, et al.
Published: (2025)
by: Wang, Qinsi, et al.
Published: (2025)
DPad: Efficient Diffusion Language Models with Suffix Dropout
by: Chen, Xinhua, et al.
Published: (2025)
by: Chen, Xinhua, et al.
Published: (2025)
LoBAM: LoRA-Based Backdoor Attack on Model Merging
by: Yin, Ming, et al.
Published: (2024)
by: Yin, Ming, et al.
Published: (2024)
Min-K%++: Improved Baseline for Detecting Pre-Training Data from Large Language Models
by: Zhang, Jingyang, et al.
Published: (2024)
by: Zhang, Jingyang, et al.
Published: (2024)
ZEUS: Zero-shot Embeddings for Unsupervised Separation of Tabular Data
by: Marszałek, Patryk, et al.
Published: (2025)
by: Marszałek, Patryk, et al.
Published: (2025)
FlashFPS: Efficient Farthest Point Sampling for Large-Scale Point Clouds via Pruning and Caching
by: Fu, Yuzhe, et al.
Published: (2026)
by: Fu, Yuzhe, et al.
Published: (2026)
Distributionally Robust Optimization via Diffusion Ambiguity Modeling
by: Wen, Jiaqi, et al.
Published: (2025)
by: Wen, Jiaqi, et al.
Published: (2025)
SLED: Self Logits Evolution Decoding for Improving Factuality in Large Language Models
by: Zhang, Jianyi, et al.
Published: (2024)
by: Zhang, Jianyi, et al.
Published: (2024)
MLLM-LLaVA-FL: Multimodal Large Language Model Assisted Federated Learning
by: Zhang, Jianyi, et al.
Published: (2024)
by: Zhang, Jianyi, et al.
Published: (2024)
ZEUS EQUILIBRADO
by: Frederico Sabino
Published: (2019)
by: Frederico Sabino
Published: (2019)
IoT-MCP: Bridging LLMs and IoT Systems Through Model Context Protocol
by: Yang, Ningyuan, et al.
Published: (2025)
by: Yang, Ningyuan, et al.
Published: (2025)
SCoNE: Spherical Consistent Neighborhoods Ensemble for Effective and Efficient Multi-View Anomaly Detection
by: Xu, Yang, et al.
Published: (2025)
by: Xu, Yang, et al.
Published: (2025)
DLM-One: Diffusion Language Models for One-Step Sequence Generation
by: Chen, Tianqi, et al.
Published: (2025)
by: Chen, Tianqi, et al.
Published: (2025)
Scalable Dual Coordinate Descent for Kernel Methods
by: Shao, Zishan, et al.
Published: (2024)
by: Shao, Zishan, et al.
Published: (2024)
Calibration and Transformation-Free Weight-Only LLMs Quantization via Dynamic Grouping
by: Zheng, Xinzhe, et al.
Published: (2025)
by: Zheng, Xinzhe, et al.
Published: (2025)
Mitigating Non-IID Drift in Zeroth-Order Federated LLM Fine-Tuning with Transferable Sparsity
by: Ran, Yide, et al.
Published: (2025)
by: Ran, Yide, et al.
Published: (2025)
SpeechPrune: Context-aware Token Pruning for Speech Information Retrieval
by: Lin, Yueqian, et al.
Published: (2024)
by: Lin, Yueqian, et al.
Published: (2024)
SiDA-MoE: Sparsity-Inspired Data-Aware Serving for Efficient and Scalable Large Mixture-of-Experts Models
by: Du, Zhixu, et al.
Published: (2023)
by: Du, Zhixu, et al.
Published: (2023)
Improving Routability Prediction via NAS Using a Smooth One-shot Augmented Predictor
by: Sridhar, Arjun, et al.
Published: (2024)
by: Sridhar, Arjun, et al.
Published: (2024)
SeisFusion: Constrained Diffusion Model with Input Guidance for 3D Seismic Data Interpolation and Reconstruction
by: Wang, Shuang, et al.
Published: (2024)
by: Wang, Shuang, et al.
Published: (2024)
Score Forgetting Distillation: A Swift, Data-Free Method for Machine Unlearning in Diffusion Models
by: Chen, Tianqi, et al.
Published: (2024)
by: Chen, Tianqi, et al.
Published: (2024)
You Only Look One Step: Accelerating Backpropagation in Diffusion Sampling with Gradient Shortcuts
by: Dou, Hongkun, et al.
Published: (2025)
by: Dou, Hongkun, et al.
Published: (2025)
Catch-Up Distillation: You Only Need to Train Once for Accelerating Sampling
by: Shao, Shitong, et al.
Published: (2023)
by: Shao, Shitong, et al.
Published: (2023)
Advancing Deep Learning through Probability Engineering: A Pragmatic Paradigm for Modern AI
by: Zhang, Jianyi
Published: (2025)
by: Zhang, Jianyi
Published: (2025)
Policy Gradient with Second Order Momentum
by: Sun, Tianyu
Published: (2025)
by: Sun, Tianyu
Published: (2025)
Score identity Distillation: Exponentially Fast Distillation of Pretrained Diffusion Models for One-Step Generation
by: Zhou, Mingyuan, et al.
Published: (2024)
by: Zhou, Mingyuan, et al.
Published: (2024)
Score Distillation Beyond Acceleration: Generative Modeling from Corrupted Data
by: Zhang, Yasi, et al.
Published: (2025)
by: Zhang, Yasi, et al.
Published: (2025)
TAP: A Token-Adaptive Predictor Framework for Training-Free Diffusion Acceleration
by: Zhu, Haowei, et al.
Published: (2026)
by: Zhu, Haowei, et al.
Published: (2026)
PIDS: Joint Point Interaction-Dimension Search for 3D Point Cloud
by: Zhang, Tunhou, et al.
Published: (2022)
by: Zhang, Tunhou, et al.
Published: (2022)
Towards Intrinsically Calibrated Uncertainty Quantification in Industrial Data-Driven Models via Diffusion Sampler
by: Ma, Yiran, et al.
Published: (2026)
by: Ma, Yiran, et al.
Published: (2026)
MARS: Efficient, Adaptive Co-Scheduling for Heterogeneous Agentic Systems
by: Wang, Yifei, et al.
Published: (2026)
by: Wang, Yifei, et al.
Published: (2026)
Similar Items
-
SADA: Stability-guided Adaptive Diffusion Acceleration
by: Jiang, Ting, et al.
Published: (2025) -
FlashSVD: Memory-Efficient Inference with Streaming for Low-Rank Models
by: Shao, Zishan, et al.
Published: (2025) -
CoreInfer: Accelerating Large Language Model Inference with Semantics-Inspired Adaptive Sparse Activation
by: Wang, Qinsi, et al.
Published: (2024) -
CoreMatching: A Co-adaptive Sparse Inference Framework with Token and Neuron Pruning for Comprehensive Acceleration of Vision-Language Models
by: Wang, Qinsi, et al.
Published: (2025) -
FlashSVD v1.5: Making Low-Rank Transformers Inference Actually Fast
by: Wu, Wenhao, et al.
Published: (2026)