Saved in:
| Main Authors: | Janusz, Mikołaj, Wojnar, Tomasz, Li, Yawei, Benini, Luca, Adamczewski, Kamil |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2508.13836 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Shapley Pruning for Neural Network Compression
by: Adamczewski, Kamil, et al.
Published: (2024)
by: Adamczewski, Kamil, et al.
Published: (2024)
LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis
by: Döner, Berkay, et al.
Published: (2025)
by: Döner, Berkay, et al.
Published: (2025)
RDKV: Rate-Distortion Bit Allocation for Joint Eviction and Quantization of the KV Cache
by: Zhang, Junkai, et al.
Published: (2026)
by: Zhang, Junkai, et al.
Published: (2026)
FEMBA: Efficient and Scalable EEG Analysis with a Bidirectional Mamba Foundation Model
by: Tegon, Anna, et al.
Published: (2025)
by: Tegon, Anna, et al.
Published: (2025)
OMENN: One Matrix to Explain Neural Networks
by: Wróbel, Adam, et al.
Published: (2024)
by: Wróbel, Adam, et al.
Published: (2024)
Projected Compression: Trainable Projection for Efficient Transformer Compression
by: Stefaniak, Maciej, et al.
Published: (2025)
by: Stefaniak, Maciej, et al.
Published: (2025)
PhysioWave: A Multi-Scale Wavelet-Transformer for Physiological Signal Representation
by: Chen, Yanlong, et al.
Published: (2025)
by: Chen, Yanlong, et al.
Published: (2025)
Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models
by: Binkowski, Jakub, et al.
Published: (2026)
by: Binkowski, Jakub, et al.
Published: (2026)
Finetuning and Quantization of EEG-Based Foundational BioSignal Models on ECG and PPG Data for Blood Pressure Estimation
by: Tóth, Bálint, et al.
Published: (2025)
by: Tóth, Bálint, et al.
Published: (2025)
Scaling Laws for Fine-Grained Mixture of Experts
by: Krajewski, Jakub, et al.
Published: (2024)
by: Krajewski, Jakub, et al.
Published: (2024)
TinyMyo: a Tiny Foundation Model for Flexible EMG Signal Processing at the Edge
by: Fasulo, Matteo, et al.
Published: (2025)
by: Fasulo, Matteo, et al.
Published: (2025)
Chasing COMET: Leveraging Minimum Bayes Risk Decoding for Self-Improving Machine Translation
by: Guttmann, Kamil, et al.
Published: (2024)
by: Guttmann, Kamil, et al.
Published: (2024)
PrunePEFT: Iterative Hybrid Pruning for Parameter-Efficient Fine-tuning of LLMs
by: Yu, Tongzhou, et al.
Published: (2025)
by: Yu, Tongzhou, et al.
Published: (2025)
Rethinking Pruning for Backdoor Mitigation: An Optimization Perspective
by: Li, Nan, et al.
Published: (2024)
by: Li, Nan, et al.
Published: (2024)
Structured vs. Unstructured Pruning: An Exponential Gap
by: Ferre', Davide, et al.
Published: (2026)
by: Ferre', Davide, et al.
Published: (2026)
PERP: Rethinking the Prune-Retrain Paradigm in the Era of LLMs
by: Zimmer, Max, et al.
Published: (2023)
by: Zimmer, Max, et al.
Published: (2023)
One Self-Configurable Model to Solve Many Abstract Visual Reasoning Problems
by: Małkiński, Mikołaj, et al.
Published: (2023)
by: Małkiński, Mikołaj, et al.
Published: (2023)
REAP the Experts: Why Pruning Prevails for One-Shot MoE compression
by: Lasby, Mike, et al.
Published: (2025)
by: Lasby, Mike, et al.
Published: (2025)
Sparse Weight Averaging with Multiple Particles for Iterative Magnitude Pruning
by: Choi, Moonseok, et al.
Published: (2023)
by: Choi, Moonseok, et al.
Published: (2023)
Signal Collapse in One-Shot Pruning: When Sparse Models Fail to Distinguish Neural Representations
by: Saikumar, Dhananjay, et al.
Published: (2025)
by: Saikumar, Dhananjay, et al.
Published: (2025)
Locality-Aware Redundancy Pruning for LLM Depth Compression
by: Yun, Vincent-Daniel, et al.
Published: (2026)
by: Yun, Vincent-Daniel, et al.
Published: (2026)
Beyond One-Way Pruning: Bidirectional Pruning-Regrowth for Extreme Accuracy-Sparsity Tradeoff
by: Liu, Junchen, et al.
Published: (2025)
by: Liu, Junchen, et al.
Published: (2025)
Vanishing Contributions: A Unified Framework for Smooth and Iterative Model Compression
by: Nikiforos, Lorenzo, et al.
Published: (2025)
by: Nikiforos, Lorenzo, et al.
Published: (2025)
FedMap: Iterative Magnitude-Based Pruning for Communication-Efficient Federated Learning
by: Herzog, Alexander, et al.
Published: (2024)
by: Herzog, Alexander, et al.
Published: (2024)
Compressing Many-Shots in In-Context Learning
by: Khatri, Devvrit, et al.
Published: (2025)
by: Khatri, Devvrit, et al.
Published: (2025)
How to Train Your Multi-Exit Model? Analyzing the Impact of Training Strategies
by: Kubaty, Piotr, et al.
Published: (2024)
by: Kubaty, Piotr, et al.
Published: (2024)
Rethinking Key-Value Cache Compression Techniques for Large Language Model Serving
by: Gao, Wei, et al.
Published: (2025)
by: Gao, Wei, et al.
Published: (2025)
Rethinking the Harmonic Loss via Non-Euclidean Distance Layers
by: Miller-Golub, Maxwell, et al.
Published: (2026)
by: Miller-Golub, Maxwell, et al.
Published: (2026)
SymDrift: One-Shot Generative Modeling under Symmetries
by: Darouich, Samir, et al.
Published: (2026)
by: Darouich, Samir, et al.
Published: (2026)
A Free Lunch in LLM Compression: Revisiting Retraining after Pruning
by: Wagner, Moritz, et al.
Published: (2025)
by: Wagner, Moritz, et al.
Published: (2025)
RAP: KV-Cache Compression via RoPE-Aligned Pruning
by: Xin, Jihao, et al.
Published: (2026)
by: Xin, Jihao, et al.
Published: (2026)
Is Meta-Learning Out? Rethinking Unsupervised Few-Shot Classification with Limited Entropy
by: Guan, Yunchuan, et al.
Published: (2025)
by: Guan, Yunchuan, et al.
Published: (2025)
One-Shot Clustering for Federated Learning
by: Zuziak, Maciej Krzysztof, et al.
Published: (2025)
by: Zuziak, Maciej Krzysztof, et al.
Published: (2025)
BISeizuRe: BERT-Inspired Seizure Data Representation to Improve Epilepsy Monitoring
by: Benfenati, Luca, et al.
Published: (2024)
by: Benfenati, Luca, et al.
Published: (2024)
Is One Score Enough? Rethinking the Evaluation of Sequentially Evolving LLM Memory
by: Dong, Songwei, et al.
Published: (2026)
by: Dong, Songwei, et al.
Published: (2026)
SepLLM: Accelerate Large Language Models by Compressing One Segment into One Separator
by: Chen, Guoxuan, et al.
Published: (2024)
by: Chen, Guoxuan, et al.
Published: (2024)
Joint MoE Scaling Laws: Mixture of Experts Can Be Memory Efficient
by: Ludziejewski, Jan, et al.
Published: (2025)
by: Ludziejewski, Jan, et al.
Published: (2025)
Real vs. Semi-Simulated: Rethinking Evaluation for Treatment Effect Estimation
by: Panagopoulos, George
Published: (2026)
by: Panagopoulos, George
Published: (2026)
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
by: Cywiński, Bartosz, et al.
Published: (2025)
by: Cywiński, Bartosz, et al.
Published: (2025)
CAOS: Conformal Aggregation of One-Shot Predictors
by: Waldron, Maja
Published: (2026)
by: Waldron, Maja
Published: (2026)
Similar Items
-
Shapley Pruning for Neural Network Compression
by: Adamczewski, Kamil, et al.
Published: (2024) -
LUNA: Efficient and Topology-Agnostic Foundation Model for EEG Signal Analysis
by: Döner, Berkay, et al.
Published: (2025) -
RDKV: Rate-Distortion Bit Allocation for Joint Eviction and Quantization of the KV Cache
by: Zhang, Junkai, et al.
Published: (2026) -
FEMBA: Efficient and Scalable EEG Analysis with a Bidirectional Mamba Foundation Model
by: Tegon, Anna, et al.
Published: (2025) -
OMENN: One Matrix to Explain Neural Networks
by: Wróbel, Adam, et al.
Published: (2024)