Saved in:
| Main Authors: | Gideoni, Yonatan, Risi, Sebastian, Gal, Yarin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.16805 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Do Multilingual LLMs Think In English?
by: Schut, Lisa, et al.
Published: (2025)
by: Schut, Lisa, et al.
Published: (2025)
Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning
by: Melo, Luckeciano C., et al.
Published: (2025)
by: Melo, Luckeciano C., et al.
Published: (2025)
Temporal-Difference Variational Continual Learning
by: Melo, Luckeciano C., et al.
Published: (2024)
by: Melo, Luckeciano C., et al.
Published: (2024)
In-Context Learning Learns Label Relationships but Is Not Conventional Learning
by: Kossen, Jannik, et al.
Published: (2023)
by: Kossen, Jannik, et al.
Published: (2023)
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
by: Nikitin, Alexander, et al.
Published: (2024)
by: Nikitin, Alexander, et al.
Published: (2024)
Evaluating & Reducing Deceptive Dialogue From Language Models with Multi-turn RL
by: Abdulhai, Marwa, et al.
Published: (2025)
by: Abdulhai, Marwa, et al.
Published: (2025)
Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs
by: Kossen, Jannik, et al.
Published: (2024)
by: Kossen, Jannik, et al.
Published: (2024)
Challenges and Considerations in the Evaluation of Bayesian Causal Discovery
by: Mamaghan, Amir Mohammad Karimi, et al.
Published: (2024)
by: Mamaghan, Amir Mohammad Karimi, et al.
Published: (2024)
TextCAVs: Debugging vision models using text
by: Nicolson, Angus, et al.
Published: (2024)
by: Nicolson, Angus, et al.
Published: (2024)
Leveraging Deep Learning for Physical Model Bias of Global Air Quality Estimates
by: Doerksen, Kelsey, et al.
Published: (2025)
by: Doerksen, Kelsey, et al.
Published: (2025)
Uncertainty Quantification for Surface Ozone Emulators using Deep Learning
by: Doerksen, Kelsey, et al.
Published: (2025)
by: Doerksen, Kelsey, et al.
Published: (2025)
Continuous Thought Machines
by: Darlow, Luke, et al.
Published: (2025)
by: Darlow, Luke, et al.
Published: (2025)
GPG: A Simple and Strong Reinforcement Learning Baseline for Model Reasoning
by: Chu, Xiangxiang, et al.
Published: (2025)
by: Chu, Xiangxiang, et al.
Published: (2025)
Deep Minds and Shallow Probes
by: Lee, Su Hyeong, et al.
Published: (2026)
by: Lee, Su Hyeong, et al.
Published: (2026)
Non-invasive Neural Decoding in Source Reconstructed Brain Space
by: Gideoni, Yonatan, et al.
Published: (2024)
by: Gideoni, Yonatan, et al.
Published: (2024)
No Mean Feat: Simple, Strong Baselines for Context Compression
by: Feldman, Yair, et al.
Published: (2025)
by: Feldman, Yair, et al.
Published: (2025)
Reasoning Introduces New Poisoning Attacks Yet Makes Them More Complicated
by: Foerster, Hanna, et al.
Published: (2025)
by: Foerster, Hanna, et al.
Published: (2025)
MolMix: A Simple Yet Effective Baseline for Multimodal Molecular Representation Learning
by: Manolache, Andrei, et al.
Published: (2024)
by: Manolache, Andrei, et al.
Published: (2024)
Iterative Deployment Improves Planning Skills in LLMs
by: Corrêa, Augusto B., et al.
Published: (2025)
by: Corrêa, Augusto B., et al.
Published: (2025)
SPA: A Simple but Tough-to-Beat Baseline for Knowledge Injection
by: Tang, Kexian, et al.
Published: (2026)
by: Tang, Kexian, et al.
Published: (2026)
Hedging Is Not All You Need: A Simple Baseline for Online Learning Under Haphazard Inputs
by: Buckchash, Himanshu, et al.
Published: (2024)
by: Buckchash, Himanshu, et al.
Published: (2024)
Probabilistic Modeling of Latent Agentic Substructures in Deep Neural Networks
by: Lee, Su Hyeong, et al.
Published: (2025)
by: Lee, Su Hyeong, et al.
Published: (2025)
When Does Structure Matter in Continual Learning? Dimensionality Controls When Modularity Shapes Representational Geometry
by: Korte, Kathrin, et al.
Published: (2026)
by: Korte, Kathrin, et al.
Published: (2026)
Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
by: O'Brien, Kyle, et al.
Published: (2025)
by: O'Brien, Kyle, et al.
Published: (2025)
A Simple Baseline for Stable and Plastic Neural Networks
by: Künzel, Étienne, et al.
Published: (2025)
by: Künzel, Étienne, et al.
Published: (2025)
An Embarrassingly Simple Baseline for Imbalanced Semi-Supervised Learning
by: Chen, Hao, et al.
Published: (2022)
by: Chen, Hao, et al.
Published: (2022)
Explaining Explainability: Recommendations for Effective Use of Concept Activation Vectors
by: Nicolson, Angus, et al.
Published: (2024)
by: Nicolson, Angus, et al.
Published: (2024)
Existing Large Language Model Unlearning Evaluations Are Inconclusive
by: Feng, Zhili, et al.
Published: (2025)
by: Feng, Zhili, et al.
Published: (2025)
LayerShuffle: Enhancing Robustness in Vision Transformers by Randomizing Layer Execution Order
by: Freiberger, Matthias, et al.
Published: (2024)
by: Freiberger, Matthias, et al.
Published: (2024)
A Simple, Solid, and Reproducible Baseline for Bridge Bidding AI
by: Kita, Haruka, et al.
Published: (2024)
by: Kita, Haruka, et al.
Published: (2024)
Is there Value in Reinforcement Learning?
by: Fox, Lior, et al.
Published: (2025)
by: Fox, Lior, et al.
Published: (2025)
On the Expressive Power of Sparse Geometric MPNNs
by: Sverdlov, Yonatan, et al.
Published: (2024)
by: Sverdlov, Yonatan, et al.
Published: (2024)
Structurally Flexible Neural Networks: Evolving the Building Blocks for General Agents
by: Pedersen, Joachim Winther, et al.
Published: (2024)
by: Pedersen, Joachim Winther, et al.
Published: (2024)
The Curse of Recursion: Training on Generated Data Makes Models Forget
by: Shumailov, Ilia, et al.
Published: (2023)
by: Shumailov, Ilia, et al.
Published: (2023)
More Test-Time Compute Can Hurt: Overestimation Bias in LLM Beam Search
by: Dalal, Gal, et al.
Published: (2026)
by: Dalal, Gal, et al.
Published: (2026)
Assessing Image Quality Using a Simple Generative Representation
by: Raviv, Simon, et al.
Published: (2024)
by: Raviv, Simon, et al.
Published: (2024)
Improved Distribution Estimation in $\ell_\infty$
by: Cohen, Doron, et al.
Published: (2026)
by: Cohen, Doron, et al.
Published: (2026)
Revisiting Multi-Permutation Equivariance through the Lens of Irreducible Representations
by: Sverdlov, Yonatan, et al.
Published: (2024)
by: Sverdlov, Yonatan, et al.
Published: (2024)
Competition-Aware CPC Forecasting with Near-Market Coverage
by: Frey, Sebastian, et al.
Published: (2026)
by: Frey, Sebastian, et al.
Published: (2026)
Policy Gradient with Tree Expansion
by: Dalal, Gal, et al.
Published: (2023)
by: Dalal, Gal, et al.
Published: (2023)
Similar Items
-
Do Multilingual LLMs Think In English?
by: Schut, Lisa, et al.
Published: (2025) -
Stabilizing Policy Gradients for Sample-Efficient Reinforcement Learning in LLM Reasoning
by: Melo, Luckeciano C., et al.
Published: (2025) -
Temporal-Difference Variational Continual Learning
by: Melo, Luckeciano C., et al.
Published: (2024) -
In-Context Learning Learns Label Relationships but Is Not Conventional Learning
by: Kossen, Jannik, et al.
Published: (2023) -
Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities
by: Nikitin, Alexander, et al.
Published: (2024)