:: Library Catalog

Buchumschlag

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Adnan, Mohammed, Jain, Rohan, Jacobs, Tom, Sharma, Ekansh, Krishnan, Rahul G., Burkholz, Rebekka, Ioannou, Yani
Format:	Preprint
Veröffentlicht:	2026
Schlagworte:	Machine Learning
Online-Zugang:	https://arxiv.org/abs/2605.27541
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

Ähnliche Einträge

Sparse Training from Random Initialization: Aligning Lottery Ticket Masks using Weight Symmetry
von: Adnan, Mohammed, et al.
Veröffentlicht: (2025)

HORST: Composing Optimizer Geometries for Sparse Transformer Training
von: Jacobs, Tom, et al.
Veröffentlicht: (2026)

Sign-In to the Lottery: Reparameterizing Sparse Training From Scratch
von: Gadhikar, Advait, et al.
Veröffentlicht: (2025)

Mask in the Mirror: Implicit Sparsification
von: Jacobs, Tom, et al.
Veröffentlicht: (2024)

Cyclic Sparse Training: Is it Enough?
von: Gadhikar, Advait, et al.
Veröffentlicht: (2024)

Mirror, Mirror of the Flow: How Does Regularization Shape Implicit Bias?
von: Jacobs, Tom, et al.
Veröffentlicht: (2025)

Never Saddle for Reparameterized Steepest Descent as Mirror Flow
von: Jacobs, Tom, et al.
Veröffentlicht: (2026)

Dynamic Sparse Training with Structured Sparsity
von: Lasby, Mike, et al.
Veröffentlicht: (2023)

Pay Attention to Small Weights
von: Zhou, Chao, et al.
Veröffentlicht: (2025)

Hyperbolic Aware Minimization: Implicit Bias for Sparsity
von: Jacobs, Tom, et al.
Veröffentlicht: (2025)

Robustness of Mixtures of Experts to Feature Noise
von: Sun, Dong, et al.
Veröffentlicht: (2026)

The Graphon Limit Hypothesis: Understanding Neural Network Pruning via Infinite Width Analysis
von: Pham, Hoang, et al.
Veröffentlicht: (2025)

Learning Fine-grained Parameter Sharing via Sparse Tensor Decomposition
von: Üyük, Cem, et al.
Veröffentlicht: (2024)

GATE: How to Keep Out Intrusive Neighbors
von: Mustafa, Nimrah, et al.
Veröffentlicht: (2024)

Masks, Signs, And Learning Rate Rewinding
von: Gadhikar, Advait, et al.
Veröffentlicht: (2024)

Fixed Aggregation Features Can Rival GNNs
von: Rubio-Madrigal, Celia, et al.
Veröffentlicht: (2026)

What is Left After Distillation? How Knowledge Transfer Impacts Fairness and Bias
von: Mohammadshahi, Aida, et al.
Veröffentlicht: (2024)

Spectral Graph Pruning Against Over-Squashing and Over-Smoothing
von: Jamadandi, Adarsh, et al.
Veröffentlicht: (2024)

GNNs Getting ComFy: Community and Feature Similarity Guided Rewiring
von: Rubio-Madrigal, Celia, et al.
Veröffentlicht: (2025)

Implicit Bias of Mirror Flow in Homogeneous Neural Networks: Sparse and Dense Feature Learning
von: Jacobs, Tom, et al.
Veröffentlicht: (2026)

Multi-Agent Systems are Mixtures of Experts: Who Becomes an Influencer?
von: Bause, Franka, et al.
Veröffentlicht: (2026)

Pruning neural network models for gene regulatory dynamics using data and domain knowledge
von: Hossain, Intekhab, et al.
Veröffentlicht: (2024)

When Shift Happens - Confounding Is to Blame
von: Reddy, Abbavaram Gowtham, et al.
Veröffentlicht: (2025)

Frequency-Based Hyperparameter Selection in Games
von: Sanyal, Aniket, et al.
Veröffentlicht: (2026)

SD$^2$: Self-Distilled Sparse Drafters
von: Lasby, Mike, et al.
Veröffentlicht: (2025)

Gradient-Congruity Guided Federated Sparse Training
von: Tian, Chris Xing, et al.
Veröffentlicht: (2024)

Meta-GCN: A Dynamically Weighted Loss Minimization Method for Dealing with the Data Imbalance in Graph Neural Networks
von: Mohammadizadeh, Mahdi, et al.
Veröffentlicht: (2024)

Bridging Domains through Subspace-Aware Model Merging
von: Chaves, Levy, et al.
Veröffentlicht: (2026)

Towards Credit-Fraud Detection via Sparsely Varying Gaussian Approximations
von: Sharma, Harshit, et al.
Veröffentlicht: (2020)

Order-based Structure Learning with Normalizing Flows
von: Kamkari, Hamidreza, et al.
Veröffentlicht: (2023)

Dynamic Sparse Training of Diagonally Sparse Networks
von: Tyagi, Abhishek, et al.
Veröffentlicht: (2025)

Sparse-to-Sparse Training of Diffusion Models
von: Oliveira, Inês Cardoso, et al.
Veröffentlicht: (2025)

Sparse-ProxSkip: Accelerated Sparse-to-Sparse Training in Federated Learning
von: Meinhardt, Georg, et al.
Veröffentlicht: (2024)

Navigating Extremes: Dynamic Sparsity in Large Output Spaces
von: Ullah, Nasib, et al.
Veröffentlicht: (2024)

The Non-Local Model Merging Problem: Permutation Symmetries and Variance Collapse
von: Sharma, Ekansh, et al.
Veröffentlicht: (2024)

Dense Backpropagation Improves Training for Sparse Mixture-of-Experts
von: Panda, Ashwinee, et al.
Veröffentlicht: (2025)

Grass: Compute Efficient Low-Memory LLM Training with Structured Sparse Gradients
von: Muhamed, Aashiq, et al.
Veröffentlicht: (2024)

Lost in Translation: How Language Re-Aligns Vision for Cross-Species Pathology
von: Arora, Ekansh
Veröffentlicht: (2026)

Transformers with Sparse Attention for Granger Causality
von: Mahesh, Riya, et al.
Veröffentlicht: (2024)

Simultaneous linear connectivity of neural networks modulo permutation
von: Sharma, Ekansh, et al.
Veröffentlicht: (2024)