Saved in:
| Main Authors: | Kubaty, Piotr, Wójcik, Bartosz, Krzepkowski, Bartłomiej, Michaluk, Monika, Trzciński, Tomasz, Pomponi, Jary, Adamczewski, Kamil |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2407.14320 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Rethinking Calibration for Early-Exit Neural Networks
by: Kubaty, Piotr, et al.
Published: (2025)
by: Kubaty, Piotr, et al.
Published: (2025)
GUIDE: Guidance-based Incremental Learning with Diffusion Models
by: Cywiński, Bartosz, et al.
Published: (2024)
by: Cywiński, Bartosz, et al.
Published: (2024)
NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural Networks
by: Gambella, Matteo, et al.
Published: (2024)
by: Gambella, Matteo, et al.
Published: (2024)
Goal-oriented Communications based on Recursive Early Exit Neural Networks
by: Pomponi, Jary, et al.
Published: (2024)
by: Pomponi, Jary, et al.
Published: (2024)
Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models
by: Binkowski, Jakub, et al.
Published: (2026)
by: Binkowski, Jakub, et al.
Published: (2026)
Efficient LLM Moderation with Multi-Layer Latent Prototypes
by: Chrabąszcz, Maciej, et al.
Published: (2025)
by: Chrabąszcz, Maciej, et al.
Published: (2025)
One Shot vs. Iterative: Rethinking Pruning Strategies for Model Compression
by: Janusz, Mikołaj, et al.
Published: (2025)
by: Janusz, Mikołaj, et al.
Published: (2025)
Class incremental learning with probability dampening and cascaded gated classifier
by: Pomponi, Jary, et al.
Published: (2024)
by: Pomponi, Jary, et al.
Published: (2024)
ELROND: Exploring and decomposing intrinsic capabilities of diffusion models
by: Skierś, Paweł, et al.
Published: (2026)
by: Skierś, Paweł, et al.
Published: (2026)
Divide and not forget: Ensemble of selectively trained experts in Continual Learning
by: Rypeść, Grzegorz, et al.
Published: (2024)
by: Rypeść, Grzegorz, et al.
Published: (2024)
MagMax: Leveraging Model Merging for Seamless Continual Learning
by: Marczak, Daniel, et al.
Published: (2024)
by: Marczak, Daniel, et al.
Published: (2024)
Revisiting Supervision for Continual Representation Learning
by: Marczak, Daniel, et al.
Published: (2023)
by: Marczak, Daniel, et al.
Published: (2023)
Task-recency bias strikes back: Adapting covariances in Exemplar-Free Class Incremental Learning
by: Rypeść, Grzegorz, et al.
Published: (2024)
by: Rypeść, Grzegorz, et al.
Published: (2024)
Realistic Evaluation of Test-Time Adaptation Algorithms: Unsupervised Hyperparameter Selection
by: Cygert, Sebastian, et al.
Published: (2024)
by: Cygert, Sebastian, et al.
Published: (2024)
AdaGlimpse: Active Visual Exploration with Arbitrary Glimpse Position and Scale
by: Pardyl, Adam, et al.
Published: (2024)
by: Pardyl, Adam, et al.
Published: (2024)
AR-TTA: A Simple Method for Real-World Continual Test-Time Adaptation
by: Sójka, Damian, et al.
Published: (2023)
by: Sójka, Damian, et al.
Published: (2023)
Communication Efficient Split Learning of ViTs with Attention-based Double Compression
by: Alvetreti, Federico, et al.
Published: (2025)
by: Alvetreti, Federico, et al.
Published: (2025)
Adaptive Semantic Token Communication for Transformer-based Edge Inference
by: Devoto, Alessio, et al.
Published: (2025)
by: Devoto, Alessio, et al.
Published: (2025)
Efficient Multi-Source Knowledge Transfer by Model Merging
by: Osial, Marcin, et al.
Published: (2025)
by: Osial, Marcin, et al.
Published: (2025)
Category Adaptation Meets Projected Distillation in Generalized Continual Category Discovery
by: Rypeść, Grzegorz, et al.
Published: (2023)
by: Rypeść, Grzegorz, et al.
Published: (2023)
LAPLEX: The FFT of Learnable Laplace Kernels
by: Struski, Łukasz, et al.
Published: (2026)
by: Struski, Łukasz, et al.
Published: (2026)
Conditional computation in neural networks: principles and research trends
by: Scardapane, Simone, et al.
Published: (2024)
by: Scardapane, Simone, et al.
Published: (2024)
Adaptive Semantic Token Selection for AI-native Goal-oriented Communications
by: Devoto, Alessio, et al.
Published: (2024)
by: Devoto, Alessio, et al.
Published: (2024)
Shapley Pruning for Neural Network Compression
by: Adamczewski, Kamil, et al.
Published: (2024)
by: Adamczewski, Kamil, et al.
Published: (2024)
Overestimation, Overfitting, and Plasticity in Actor-Critic: the Bitter Lesson of Reinforcement Learning
by: Nauman, Michal, et al.
Published: (2024)
by: Nauman, Michal, et al.
Published: (2024)
Improving Continual Learning Performance and Efficiency with Auxiliary Classifiers
by: Szatkowski, Filip, et al.
Published: (2024)
by: Szatkowski, Filip, et al.
Published: (2024)
SAeUron: Interpretable Concept Unlearning in Diffusion Models with Sparse Autoencoders
by: Cywiński, Bartosz, et al.
Published: (2025)
by: Cywiński, Bartosz, et al.
Published: (2025)
Conceptualizing Embeddings: Sparse Disentanglement for Vision-Language Models
by: Kubaty, Piotr, et al.
Published: (2026)
by: Kubaty, Piotr, et al.
Published: (2026)
How to Train Your Robots? The Impact of Demonstration Modality on Imitation Learning
by: Li, Haozhuo, et al.
Published: (2025)
by: Li, Haozhuo, et al.
Published: (2025)
MISS: Multiclass Interpretable Scoring Systems
by: Grzeszczyk, Michal K., et al.
Published: (2024)
by: Grzeszczyk, Michal K., et al.
Published: (2024)
Scaling Laws for Fine-Grained Mixture of Experts
by: Krajewski, Jakub, et al.
Published: (2024)
by: Krajewski, Jakub, et al.
Published: (2024)
TORE: Token Recycling in Vision Transformers for Efficient Active Visual Exploration
by: Olszewski, Jan, et al.
Published: (2023)
by: Olszewski, Jan, et al.
Published: (2023)
Unpacking Softmax: How Temperature Drives Representation Collapse, Compression, and Generalization
by: Masarczyk, Wojciech, et al.
Published: (2025)
by: Masarczyk, Wojciech, et al.
Published: (2025)
Decoupled Relative Learning Rate Schedules
by: Ludziejewski, Jan, et al.
Published: (2025)
by: Ludziejewski, Jan, et al.
Published: (2025)
EXACT: How to Train Your Accuracy
by: Karpukhin, Ivan, et al.
Published: (2022)
by: Karpukhin, Ivan, et al.
Published: (2022)
Workspace Optimization: How to Train Your Agent
by: Sarafian, Elad, et al.
Published: (2026)
by: Sarafian, Elad, et al.
Published: (2026)
Training normalizing flows with computationally intensive target probability distributions
by: Bialas, Piotr, et al.
Published: (2023)
by: Bialas, Piotr, et al.
Published: (2023)
Exploiting Activation Sparsity with Dense to Dynamic-k Mixture-of-Experts Conversion
by: Szatkowski, Filip, et al.
Published: (2023)
by: Szatkowski, Filip, et al.
Published: (2023)
Differentially Private Neural Tangent Kernels for Privacy-Preserving Data Generation
by: Yang, Yilin, et al.
Published: (2023)
by: Yang, Yilin, et al.
Published: (2023)
An Accurate and Low-Parameter Machine Learning Architecture for Next Location Prediction
by: Jary, Calvin, et al.
Published: (2024)
by: Jary, Calvin, et al.
Published: (2024)
Similar Items
-
Rethinking Calibration for Early-Exit Neural Networks
by: Kubaty, Piotr, et al.
Published: (2025) -
GUIDE: Guidance-based Incremental Learning with Diffusion Models
by: Cywiński, Bartosz, et al.
Published: (2024) -
NACHOS: Neural Architecture Search for Hardware Constrained Early Exit Neural Networks
by: Gambella, Matteo, et al.
Published: (2024) -
Goal-oriented Communications based on Recursive Early Exit Neural Networks
by: Pomponi, Jary, et al.
Published: (2024) -
Attention Sinks as Internal Signals for Hallucination Detection in Large Language Models
by: Binkowski, Jakub, et al.
Published: (2026)