:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Jin, Meiling, Wang, Fei, Yuan, Xiaoyun, Qian, Chen, Cheng, Yuan
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Machine Learning Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2602.04166
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

WiSparse: Boosting LLM Inference Efficiency with Weight-Aware Mixed Activation Sparsity
von: Chen, Lei, et al.
Veröffentlicht: (2026)

Topology-Aware and Highly Generalizable Deep Reinforcement Learning for Efficient Retrieval in Multi-Deep Storage Systems
von: Li, Funing, et al.
Veröffentlicht: (2025)

BudgetDraft: Acceptance-Aware Multi-View Training for Sparse-KV Speculative Decoding
von: He, Liang, et al.
Veröffentlicht: (2026)

Towards Heterogeneity-Aware and Energy-Efficient Topology Optimization for Decentralized Federated Learning in Edge Environment
von: Liu, Yuze, et al.
Veröffentlicht: (2025)

TED: Training-Free Experience Distillation for Multimodal Reasoning
von: Yuan, Shuozhi, et al.
Veröffentlicht: (2026)

On Effectiveness and Efficiency of Agentic Tool-calling and RL Training
von: Liu, Tong, et al.
Veröffentlicht: (2026)

SparseDM: Toward Sparse Efficient Diffusion Models
von: Wang, Kafeng, et al.
Veröffentlicht: (2024)

HASTE: Hardware-Aware Dynamic Sparse Training for Large Output Spaces
von: Ullah, Nasib, et al.
Veröffentlicht: (2026)

DistrAttention: An Efficient and Flexible Self-Attention Mechanism on Modern GPUs
von: Jin, Haolin, et al.
Veröffentlicht: (2025)

EcoSpa: Efficient Transformer Training with Coupled Sparsity
von: Xiao, Jinqi, et al.
Veröffentlicht: (2025)

End-to-End On-Device Quantization-Aware Training for LLMs at Inference Cost
von: Tan, Qitao, et al.
Veröffentlicht: (2025)

Efficient Network Automatic Relevance Determination
von: Zhang, Hongwei, et al.
Veröffentlicht: (2025)

Data Warmup: Complexity-Aware Curricula for Efficient Diffusion Training
von: Lin, Jinhong, et al.
Veröffentlicht: (2026)

HATA: Trainable and Hardware-Efficient Hash-Aware Top-k Attention for Scalable Large Model Inference
von: Gong, Ping, et al.
Veröffentlicht: (2025)

EfficientQAT: Efficient Quantization-Aware Training for Large Language Models
von: Chen, Mengzhao, et al.
Veröffentlicht: (2024)

Magnitude-Modulated Equivariant Adapter for Parameter-Efficient Fine-Tuning of Equivariant Graph Neural Networks
von: Jin, Dian, et al.
Veröffentlicht: (2025)

MOSS: Efficient and Accurate FP8 LLM Training with Microscaling and Automatic Scaling
von: Zhang, Yu, et al.
Veröffentlicht: (2025)

Efficient Equivariant High-Order Crystal Tensor Prediction via Cartesian Local-Environment Many-Body Coupling
von: Jin, Dian, et al.
Veröffentlicht: (2026)

SlimPipe: Memory-Thrifty and Efficient Pipeline Parallelism for Long-Context LLM Training
von: Li, Zhouyang, et al.
Veröffentlicht: (2025)

Efficient Edge LLMs Deployment via HessianAware Quantization and CPU GPU Collaborative
von: Zhang, Tuo, et al.
Veröffentlicht: (2025)

Towards Interpretable Adversarial Examples via Sparse Adversarial Attack
von: Lin, Fudong, et al.
Veröffentlicht: (2025)

Online Prompt Pricing based on Combinatorial Multi-Armed Bandit and Hierarchical Stackelberg Game
von: Li, Meiling, et al.
Veröffentlicht: (2024)

Gradient-Congruity Guided Federated Sparse Training
von: Tian, Chris Xing, et al.
Veröffentlicht: (2024)

SparseBalance: Load-Balanced Long Context Training with Dynamic Sparse Attention
von: Xu, Hongtao, et al.
Veröffentlicht: (2026)

HPO: Hysteretic Policy Optimization for Stable and Efficient Training under Sparse-Reward Regime
von: Sana, Mohamed, et al.
Veröffentlicht: (2026)

Reviving ConvNeXt for Efficient Convolutional Diffusion Models
von: Kwon, Taesung, et al.
Veröffentlicht: (2026)

ULTHO: Ultra-Lightweight yet Efficient Hyperparameter Optimization in Deep Reinforcement Learning
von: Yuan, Mingqi, et al.
Veröffentlicht: (2025)

ProTrain: Efficient LLM Training via Memory-Aware Techniques
von: Yang, Hanmei, et al.
Veröffentlicht: (2024)

Sparse-VQ Transformer: An FFN-Free Framework with Vector Quantization for Enhanced Time Series Forecasting
von: Zhao, Yanjun, et al.
Veröffentlicht: (2024)

Data-Efficient Training by Evolved Sampling
von: Cheng, Ziheng, et al.
Veröffentlicht: (2025)

Optimal Corpus Aware Training for Neural Machine Translation
von: Liao, Yi-Hsiu, et al.
Veröffentlicht: (2025)

Enhancing the Resilience of Graph Neural Networks to Topological Perturbations in Sparse Graphs
von: He, Shuqi, et al.
Veröffentlicht: (2024)

An Efficient Hybrid Sparse Attention with CPU-GPU Parallelism for Long-Context Inference
von: Yao, Feiyu, et al.
Veröffentlicht: (2026)

GPrune-LLM: Generalization-Aware Structured Pruning for Large Language Models
von: Liu, Xiaoyun, et al.
Veröffentlicht: (2026)

BigMac: A Communication-Efficient Mixture-of-Experts Model Structure for Fast Training and Inference
von: Jin, Zewen, et al.
Veröffentlicht: (2025)

FAST: Topology-Aware Frequency-Domain Distribution Matching for Coreset Selection
von: Cui, Jin, et al.
Veröffentlicht: (2025)

Reasoner for Real-World Event Detection: Scaling Reinforcement Learning via Adaptive Perplexity-Aware Sampling Strategy
von: Zhang, Xiaoyun, et al.
Veröffentlicht: (2025)

Advancing On-Device Neural Network Training with TinyPropv2: Dynamic, Sparse, and Efficient Backpropagation
von: Rüb, Marcus, et al.
Veröffentlicht: (2024)

ssProp: Energy-Efficient Training for Convolutional Neural Networks with Scheduled Sparse Back Propagation
von: Zhong, Lujia, et al.
Veröffentlicht: (2024)

OGLS-SD: On-Policy Self-Distillation with Outcome-Guided Logit Steering for LLM Reasoning
von: Yang, Yuxiao, et al.
Veröffentlicht: (2026)