:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Slutzky, Yonatan, Alexander, Yotam, Slor, Tomer, Nagel, Yoav, Cohen, Nadav
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2605.06992
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
by: Slutzky, Yonatan, et al.
Published: (2024)

Do Neural Networks Need Gradient Descent to Generalize? A Theoretical Study
by: Alexander, Yotam, et al.
Published: (2025)

Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data
by: Ran-Milo, Yuval, et al.
Published: (2026)

Deep Learning for Optical Misalignment Diagnostics in Multi-Lens Imaging Systems
by: Slor, Tomer, et al.
Published: (2025)

What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement
by: Alexander, Yotam, et al.
Published: (2023)

Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States
by: Razin, Noam, et al.
Published: (2024)

On the Expressive Power of Sparse Geometric MPNNs
by: Sverdlov, Yonatan, et al.
Published: (2024)

NnD: Diffusion-based Generation of Physically-Nonnegative Objects
by: Torem, Nadav, et al.
Published: (2025)

When and How to Canonize: A Generalization Perspective
by: Sverdlov, Yonatan, et al.
Published: (2026)

Cluster Frequency Conformal Prediction for Local Coverage
by: Lavi, Tomer, et al.
Published: (2026)

Trajectory-Based Difficulty Scoring for Reliable Learning on Tabular Data
by: Lavi, Tomer, et al.
Published: (2026)

Clustered Calibration: Representation-Aware Probability Calibration via Learned Subpopulations
by: Lavi, Tomer, et al.
Published: (2025)

Mamba Knockout for Unraveling Factual Information Flow
by: Endy, Nir, et al.
Published: (2025)

Lecture Notes on Linear Neural Networks: A Tale of Optimization and Generalization in Deep Learning
by: Cohen, Nadav, et al.
Published: (2024)

Revisiting Multi-Permutation Equivariance through the Lens of Irreducible Representations
by: Sverdlov, Yonatan, et al.
Published: (2024)

Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
by: Öncel, Fırat, et al.
Published: (2024)

Why Domain Generalization Fail? A View of Necessity and Sufficiency
by: Vuong, Long-Tung, et al.
Published: (2025)

Monotone and Separable Set Functions: Characterizations and Neural Models
by: Sarangi, Soutrik, et al.
Published: (2025)

FSW-GNN: A Bi-Lipschitz WL-Equivalent Graph Neural Network
by: Sverdlov, Yonatan, et al.
Published: (2024)

Short-Range Oversquashing
by: Mishayev, Yaaqov, et al.
Published: (2025)

BXRL: Behavior-Explainable Reinforcement Learning
by: Rachum, Ram, et al.
Published: (2026)

Generative Myopia: Why Diffusion Models Fail at Structure
by: Siami, Milad
Published: (2025)

Inertial Navigation Meets Deep Learning: A Survey of Current Trends and Future Directions
by: Cohen, Nadav, et al.
Published: (2023)

In-context Learning and Gradient Descent Revisited
by: Deutch, Gilad, et al.
Published: (2023)

Tools Fail: Detecting Silent Errors in Faulty Tools
by: Sun, Jimin, et al.
Published: (2024)

Why Model Selection Fails in Time Series Forecasting: An Empirical Study of Instability Across Data Regimes
by: Akinci, Tahir Cetin, et al.
Published: (2026)

Why Do Safety Guardrails Degrade Across Languages?
by: Zhang, Max, et al.
Published: (2026)

Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation
by: Cohen, Nadav Z., et al.
Published: (2024)

SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
by: Cohen, Yaniv, et al.
Published: (2024)

Failing to Explore: Language Models on Interactive Tasks
by: JafariRaviz, Mahdi, et al.
Published: (2026)

Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild
by: Orzech, Nadav, et al.
Published: (2024)

Improved Distribution Estimation in $\ell_\infty$
by: Cohen, Doron, et al.
Published: (2026)

Why Attention Fails: The Degeneration of Transformers into MLPs in Time Series Forecasting
by: Liang, Zida, et al.
Published: (2025)

When Explanations Lie: Why Many Modified BP Attributions Fail
by: Sixt, Leon, et al.
Published: (2019)

Why Smooth Stability Assumptions Fail for ReLU Learning
by: Katende, Ronald
Published: (2025)

Why Do Transformers Fail to Forecast Time Series In-Context?
by: Zhou, Yufa, et al.
Published: (2025)

Gradient Starvation in Binary-Reward GRPO: Why Group-Mean Centering Fails and Why the Simplest Fix Works
by: Nie, Wenhua, et al.
Published: (2026)

Why Do More Experts Fail? A Theoretical Analysis of Model Merging
by: Wang, Zijing, et al.
Published: (2025)

Rate-Optimal Policy Optimization for Linear Markov Decision Processes
by: Sherman, Uri, et al.
Published: (2023)

Consensus is Not Verification: Why Crowd Wisdom Strategies Fail for LLM Truthfulness
by: Denisov-Blanch, Yegor, et al.
Published: (2026)