Saved in:
| Main Authors: | Slutzky, Yonatan, Alexander, Yotam, Slor, Tomer, Nagel, Yoav, Cohen, Nadav |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.06992 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
by: Slutzky, Yonatan, et al.
Published: (2024)
by: Slutzky, Yonatan, et al.
Published: (2024)
Do Neural Networks Need Gradient Descent to Generalize? A Theoretical Study
by: Alexander, Yotam, et al.
Published: (2025)
by: Alexander, Yotam, et al.
Published: (2025)
Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data
by: Ran-Milo, Yuval, et al.
Published: (2026)
by: Ran-Milo, Yuval, et al.
Published: (2026)
Deep Learning for Optical Misalignment Diagnostics in Multi-Lens Imaging Systems
by: Slor, Tomer, et al.
Published: (2025)
by: Slor, Tomer, et al.
Published: (2025)
What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement
by: Alexander, Yotam, et al.
Published: (2023)
by: Alexander, Yotam, et al.
Published: (2023)
Implicit Bias of Policy Gradient in Linear Quadratic Control: Extrapolation to Unseen Initial States
by: Razin, Noam, et al.
Published: (2024)
by: Razin, Noam, et al.
Published: (2024)
On the Expressive Power of Sparse Geometric MPNNs
by: Sverdlov, Yonatan, et al.
Published: (2024)
by: Sverdlov, Yonatan, et al.
Published: (2024)
NnD: Diffusion-based Generation of Physically-Nonnegative Objects
by: Torem, Nadav, et al.
Published: (2025)
by: Torem, Nadav, et al.
Published: (2025)
When and How to Canonize: A Generalization Perspective
by: Sverdlov, Yonatan, et al.
Published: (2026)
by: Sverdlov, Yonatan, et al.
Published: (2026)
Cluster Frequency Conformal Prediction for Local Coverage
by: Lavi, Tomer, et al.
Published: (2026)
by: Lavi, Tomer, et al.
Published: (2026)
Trajectory-Based Difficulty Scoring for Reliable Learning on Tabular Data
by: Lavi, Tomer, et al.
Published: (2026)
by: Lavi, Tomer, et al.
Published: (2026)
Clustered Calibration: Representation-Aware Probability Calibration via Learned Subpopulations
by: Lavi, Tomer, et al.
Published: (2025)
by: Lavi, Tomer, et al.
Published: (2025)
Mamba Knockout for Unraveling Factual Information Flow
by: Endy, Nir, et al.
Published: (2025)
by: Endy, Nir, et al.
Published: (2025)
Lecture Notes on Linear Neural Networks: A Tale of Optimization and Generalization in Deep Learning
by: Cohen, Nadav, et al.
Published: (2024)
by: Cohen, Nadav, et al.
Published: (2024)
Revisiting Multi-Permutation Equivariance through the Lens of Irreducible Representations
by: Sverdlov, Yonatan, et al.
Published: (2024)
by: Sverdlov, Yonatan, et al.
Published: (2024)
Adaptation Odyssey in LLMs: Why Does Additional Pretraining Sometimes Fail to Improve?
by: Öncel, Fırat, et al.
Published: (2024)
by: Öncel, Fırat, et al.
Published: (2024)
Why Domain Generalization Fail? A View of Necessity and Sufficiency
by: Vuong, Long-Tung, et al.
Published: (2025)
by: Vuong, Long-Tung, et al.
Published: (2025)
Monotone and Separable Set Functions: Characterizations and Neural Models
by: Sarangi, Soutrik, et al.
Published: (2025)
by: Sarangi, Soutrik, et al.
Published: (2025)
FSW-GNN: A Bi-Lipschitz WL-Equivalent Graph Neural Network
by: Sverdlov, Yonatan, et al.
Published: (2024)
by: Sverdlov, Yonatan, et al.
Published: (2024)
Short-Range Oversquashing
by: Mishayev, Yaaqov, et al.
Published: (2025)
by: Mishayev, Yaaqov, et al.
Published: (2025)
BXRL: Behavior-Explainable Reinforcement Learning
by: Rachum, Ram, et al.
Published: (2026)
by: Rachum, Ram, et al.
Published: (2026)
Generative Myopia: Why Diffusion Models Fail at Structure
by: Siami, Milad
Published: (2025)
by: Siami, Milad
Published: (2025)
Inertial Navigation Meets Deep Learning: A Survey of Current Trends and Future Directions
by: Cohen, Nadav, et al.
Published: (2023)
by: Cohen, Nadav, et al.
Published: (2023)
In-context Learning and Gradient Descent Revisited
by: Deutch, Gilad, et al.
Published: (2023)
by: Deutch, Gilad, et al.
Published: (2023)
Tools Fail: Detecting Silent Errors in Faulty Tools
by: Sun, Jimin, et al.
Published: (2024)
by: Sun, Jimin, et al.
Published: (2024)
Why Model Selection Fails in Time Series Forecasting: An Empirical Study of Instability Across Data Regimes
by: Akinci, Tahir Cetin, et al.
Published: (2026)
by: Akinci, Tahir Cetin, et al.
Published: (2026)
Why Do Safety Guardrails Degrade Across Languages?
by: Zhang, Max, et al.
Published: (2026)
by: Zhang, Max, et al.
Published: (2026)
Conditional Balance: Improving Multi-Conditioning Trade-Offs in Image Generation
by: Cohen, Nadav Z., et al.
Published: (2024)
by: Cohen, Nadav Z., et al.
Published: (2024)
SINR-Aware Deep Reinforcement Learning for Distributed Dynamic Channel Allocation in Cognitive Interference Networks
by: Cohen, Yaniv, et al.
Published: (2024)
by: Cohen, Yaniv, et al.
Published: (2024)
Failing to Explore: Language Models on Interactive Tasks
by: JafariRaviz, Mahdi, et al.
Published: (2026)
by: JafariRaviz, Mahdi, et al.
Published: (2026)
Masked Extended Attention for Zero-Shot Virtual Try-On In The Wild
by: Orzech, Nadav, et al.
Published: (2024)
by: Orzech, Nadav, et al.
Published: (2024)
Improved Distribution Estimation in $\ell_\infty$
by: Cohen, Doron, et al.
Published: (2026)
by: Cohen, Doron, et al.
Published: (2026)
Why Attention Fails: The Degeneration of Transformers into MLPs in Time Series Forecasting
by: Liang, Zida, et al.
Published: (2025)
by: Liang, Zida, et al.
Published: (2025)
When Explanations Lie: Why Many Modified BP Attributions Fail
by: Sixt, Leon, et al.
Published: (2019)
by: Sixt, Leon, et al.
Published: (2019)
Why Smooth Stability Assumptions Fail for ReLU Learning
by: Katende, Ronald
Published: (2025)
by: Katende, Ronald
Published: (2025)
Why Do Transformers Fail to Forecast Time Series In-Context?
by: Zhou, Yufa, et al.
Published: (2025)
by: Zhou, Yufa, et al.
Published: (2025)
Gradient Starvation in Binary-Reward GRPO: Why Group-Mean Centering Fails and Why the Simplest Fix Works
by: Nie, Wenhua, et al.
Published: (2026)
by: Nie, Wenhua, et al.
Published: (2026)
Why Do More Experts Fail? A Theoretical Analysis of Model Merging
by: Wang, Zijing, et al.
Published: (2025)
by: Wang, Zijing, et al.
Published: (2025)
Rate-Optimal Policy Optimization for Linear Markov Decision Processes
by: Sherman, Uri, et al.
Published: (2023)
by: Sherman, Uri, et al.
Published: (2023)
Consensus is Not Verification: Why Crowd Wisdom Strategies Fail for LLM Truthfulness
by: Denisov-Blanch, Yegor, et al.
Published: (2026)
by: Denisov-Blanch, Yegor, et al.
Published: (2026)
Similar Items
-
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
by: Slutzky, Yonatan, et al.
Published: (2024) -
Do Neural Networks Need Gradient Descent to Generalize? A Theoretical Study
by: Alexander, Yotam, et al.
Published: (2025) -
Outcome-Based RL Provably Leads Transformers to Reason, but Only With the Right Data
by: Ran-Milo, Yuval, et al.
Published: (2026) -
Deep Learning for Optical Misalignment Diagnostics in Multi-Lens Imaging Systems
by: Slor, Tomer, et al.
Published: (2025) -
What Makes Data Suitable for a Locally Connected Neural Network? A Necessary and Sufficient Condition Based on Quantum Entanglement
by: Alexander, Yotam, et al.
Published: (2023)