Saved in:
| Main Authors: | Gao, Zhiqi, Ge, Albert, Berenbeim, Alexander, Bastian, Nathaniel D., Sala, Frederic |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2605.21751 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fine-Tuning Small Reasoning Models for Quantum Field Theory
by: Woodward, Nathaniel S., et al.
Published: (2026)
by: Woodward, Nathaniel S., et al.
Published: (2026)
Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint
by: Lee, Heekyung, et al.
Published: (2025)
by: Lee, Heekyung, et al.
Published: (2025)
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
by: Kambhampati, Subbarao, et al.
Published: (2024)
by: Kambhampati, Subbarao, et al.
Published: (2024)
Adaptive Experimentation When You Can't Experiment
by: Zhao, Yao, et al.
Published: (2024)
by: Zhao, Yao, et al.
Published: (2024)
You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning
by: Yang, Tianmeng, et al.
Published: (2024)
by: Yang, Tianmeng, et al.
Published: (2024)
Climbing the Ladder of Reasoning: What LLMs Can-and Still Can't-Solve after SFT?
by: Sun, Yiyou, et al.
Published: (2025)
by: Sun, Yiyou, et al.
Published: (2025)
Reference-Specific Unlearning Metrics Can Hide the Truth: A Reality Check
by: Cho, Sungjun, et al.
Published: (2025)
by: Cho, Sungjun, et al.
Published: (2025)
Can't Remember Details in Long Documents? You Need Some R&R
by: Agrawal, Devanshu, et al.
Published: (2024)
by: Agrawal, Devanshu, et al.
Published: (2024)
When Models Can't Follow: Testing Instruction Adherence Across 256 LLMs
by: Young, Richard J., et al.
Published: (2025)
by: Young, Richard J., et al.
Published: (2025)
Random Initialization Can't Catch Up: The Advantage of Language Model Transfer for Time Series Forecasting
by: Riachi, Roland, et al.
Published: (2025)
by: Riachi, Roland, et al.
Published: (2025)
Shadows Don't Lie and Lines Can't Bend! Generative Models don't know Projective Geometry...for now
by: Sarkar, Ayush, et al.
Published: (2023)
by: Sarkar, Ayush, et al.
Published: (2023)
Optimizing Prompt Sequences using Monte Carlo Tree Search for LLM-Based Optimization
by: Yu, Fei Xu, et al.
Published: (2025)
by: Yu, Fei Xu, et al.
Published: (2025)
The Implicit Bias of Structured State Space Models Can Be Poisoned With Clean Labels
by: Slutzky, Yonatan, et al.
Published: (2024)
by: Slutzky, Yonatan, et al.
Published: (2024)
I Can't Believe It's Not Robust: Catastrophic Collapse of Safety Classifiers under Embedding Drift
by: Sahoo, Subramanyam, et al.
Published: (2026)
by: Sahoo, Subramanyam, et al.
Published: (2026)
Learning What Reinforcement Learning Can't: Interleaved Online Fine-Tuning for Hardest Questions
by: Ma, Lu, et al.
Published: (2025)
by: Ma, Lu, et al.
Published: (2025)
Why Can't I See My Clusters? A Precision-Recall Approach to Dimensionality Reduction Validation
by: van der Hoorn, Diede P. M., et al.
Published: (2025)
by: van der Hoorn, Diede P. M., et al.
Published: (2025)
Why Can't Transformers Learn Multiplication? Reverse-Engineering Reveals Long-Range Dependency Pitfalls
by: Bai, Xiaoyan, et al.
Published: (2025)
by: Bai, Xiaoyan, et al.
Published: (2025)
Can AI Agents Agree?
by: Berdoz, Frédéric, et al.
Published: (2026)
by: Berdoz, Frédéric, et al.
Published: (2026)
Your Teaching Can't Help
by: Rebecca Weaver
Published: (2024)
by: Rebecca Weaver
Published: (2024)
Expressivity-Efficiency Tradeoffs for Hybrid Sequence Models
by: Cooper, John, et al.
Published: (2026)
by: Cooper, John, et al.
Published: (2026)
Test-Time Scaling Makes Overtraining Compute-Optimal
by: Roberts, Nicholas, et al.
Published: (2026)
by: Roberts, Nicholas, et al.
Published: (2026)
I Can't Believe It's Not Real: CV-MuSeNet: Complex-Valued Multi-Signal Segmentation
by: Shin, Sangwon, et al.
Published: (2025)
by: Shin, Sangwon, et al.
Published: (2025)
Language Model Embeddings Can Be Sufficient for Bayesian Optimization
by: Nguyen, Tung, et al.
Published: (2024)
by: Nguyen, Tung, et al.
Published: (2024)
Zero-Shot Robustification of Zero-Shot Models
by: Adila, Dyah, et al.
Published: (2023)
by: Adila, Dyah, et al.
Published: (2023)
Even GPT-5.2 Can't Count to Five: The Case for Zero-Error Horizons in Trustworthy LLMs
by: Sato, Ryoma
Published: (2026)
by: Sato, Ryoma
Published: (2026)
Johnny Still Can't Read
by: Melcher, Daniel
Published: (1973)
by: Melcher, Daniel
Published: (1973)
Can Large Language Models Reason and Optimize Under Constraints?
by: Bernier, Fabien, et al.
Published: (2026)
by: Bernier, Fabien, et al.
Published: (2026)
A Design-based Solution for Causal Inference with Text: Can a Language Model Be Too Large?
by: Tierney, Graham, et al.
Published: (2025)
by: Tierney, Graham, et al.
Published: (2025)
Interactive Critique-Revision Training for Reliable Structured LLM Generation
by: Yu, Fei Xu, et al.
Published: (2026)
by: Yu, Fei Xu, et al.
Published: (2026)
Quantifying Structure in CLIP Embeddings: A Statistical Framework for Concept Interpretation
by: Zhao, Jitian, et al.
Published: (2025)
by: Zhao, Jitian, et al.
Published: (2025)
Can Watermarking Large Language Models Prevent Copyrighted Text Generation and Hide Training Data?
by: Panaitescu-Liess, Michael-Andrei, et al.
Published: (2024)
by: Panaitescu-Liess, Michael-Andrei, et al.
Published: (2024)
XG-NID: Dual-Modality Network Intrusion Detection using a Heterogeneous Graph Neural Network and Large Language Model
by: Farrukh, Yasir Ali, et al.
Published: (2024)
by: Farrukh, Yasir Ali, et al.
Published: (2024)
Deceptive Sequential Decision-Making via Regularized Policy Optimization
by: Kim, Yerin, et al.
Published: (2025)
by: Kim, Yerin, et al.
Published: (2025)
They Can't Keep a Good Profession Down--or Can They?
by: Franckowiak, Bernard
Published: (1978)
by: Franckowiak, Bernard
Published: (1978)
Semantic Deception: When Reasoning Models Can't Compute an Addition
by: de Leeuw, Nathaniël, et al.
Published: (2025)
by: de Leeuw, Nathaniël, et al.
Published: (2025)
Can Graphs Improve Tabular Foundation Models?
by: Le, Franck, et al.
Published: (2025)
by: Le, Franck, et al.
Published: (2025)
Can Large Reasoning Models Self-Train?
by: Shafayat, Sheikh, et al.
Published: (2025)
by: Shafayat, Sheikh, et al.
Published: (2025)
Model Sparsity Can Simplify Machine Unlearning
by: Jia, Jinghan, et al.
Published: (2023)
by: Jia, Jinghan, et al.
Published: (2023)
OTTER: Effortless Label Distribution Adaptation of Zero-shot Models
by: Shin, Changho, et al.
Published: (2024)
by: Shin, Changho, et al.
Published: (2024)
Weight Updates as Activation Shifts: A Principled Framework for Steering
by: Adila, Dyah, et al.
Published: (2026)
by: Adila, Dyah, et al.
Published: (2026)
Similar Items
-
Fine-Tuning Small Reasoning Models for Quantum Field Theory
by: Woodward, Nathaniel S., et al.
Published: (2026) -
Puzzled by Puzzles: When Vision-Language Models Can't Take a Hint
by: Lee, Heekyung, et al.
Published: (2025) -
LLMs Can't Plan, But Can Help Planning in LLM-Modulo Frameworks
by: Kambhampati, Subbarao, et al.
Published: (2024) -
Adaptive Experimentation When You Can't Experiment
by: Zhao, Yao, et al.
Published: (2024) -
You Can't Ignore Either: Unifying Structure and Feature Denoising for Robust Graph Learning
by: Yang, Tianmeng, et al.
Published: (2024)