Saved in:
| Main Authors: | Lomasov, Semyon, Goldfeder, Judah, Erol, Mehmet Hamza, So, Matthew, Yan, Yao, Howard, Addison, Kutz, Nathan, Ziv, Ravid Shwartz |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2510.26025 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
AI Must Embrace Specialization via Superhuman Adaptable Intelligence
by: Goldfeder, Judah, et al.
Published: (2026)
by: Goldfeder, Judah, et al.
Published: (2026)
Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models
by: Paech, Samuel, et al.
Published: (2025)
by: Paech, Samuel, et al.
Published: (2025)
A superpersuasive autonomous policy debating system
by: Roush, Allen, et al.
Published: (2025)
by: Roush, Allen, et al.
Published: (2025)
Generating Auxiliary Tasks with Reinforcement Learning
by: Goldfeder, Judah, et al.
Published: (2025)
by: Goldfeder, Judah, et al.
Published: (2025)
Soft Clustering Anchors for Self-Supervised Speech Representation Learning in Joint Embedding Prediction Architectures
by: Ioannides, Georgios, et al.
Published: (2026)
by: Ioannides, Georgios, et al.
Published: (2026)
Learning to Compress: Local Rank and Information Compression in Deep Neural Networks
by: Patel, Niket, et al.
Published: (2024)
by: Patel, Niket, et al.
Published: (2024)
Bi-Encoder Contrastive Learning for Fingerprint and Iris Biometrics
by: So, Matthew, et al.
Published: (2025)
by: So, Matthew, et al.
Published: (2025)
Do Multi-Agents Dream of Electric Screens? Achieving Perfect Accuracy on AndroidWorld Through Task Decomposition
by: Favreau, Pierre-Louis, et al.
Published: (2026)
by: Favreau, Pierre-Louis, et al.
Published: (2026)
NdLinear: Preserving Multi-Dimensional Structure for Parameter-Efficient Neural Networks
by: Reneau, Alex, et al.
Published: (2025)
by: Reneau, Alex, et al.
Published: (2025)
Video Representation Learning with Joint-Embedding Predictive Architectures
by: Drozdov, Katrina, et al.
Published: (2024)
by: Drozdov, Katrina, et al.
Published: (2024)
Does Representation Matter? Exploring Intermediate Layers in Large Language Models
by: Skean, Oscar, et al.
Published: (2024)
by: Skean, Oscar, et al.
Published: (2024)
From Tokens to Thoughts: How LLMs and Humans Trade Compression for Meaning
by: Shani, Chen, et al.
Published: (2025)
by: Shani, Chen, et al.
Published: (2025)
Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs
by: Chen, Angelica, et al.
Published: (2023)
by: Chen, Angelica, et al.
Published: (2023)
When Attention Collapses: How Degenerate Layers in LLMs Enable Smaller, Stronger Models
by: Sanyal, Sunny, et al.
Published: (2024)
by: Sanyal, Sunny, et al.
Published: (2024)
The Entropy Enigma: Success and Failure of Entropy Minimization
by: Press, Ori, et al.
Published: (2024)
by: Press, Ori, et al.
Published: (2024)
Latent Transfer Attack: Adversarial Examples via Generative Latent Spaces
by: Shaar, Eitan, et al.
Published: (2026)
by: Shaar, Eitan, et al.
Published: (2026)
Evidence of an Emergent "Self" in Continual Robot Learning
by: Jhunjhunwala, Adidev, et al.
Published: (2026)
by: Jhunjhunwala, Adidev, et al.
Published: (2026)
On Training in Imagination
by: Timor, Nadav, et al.
Published: (2026)
by: Timor, Nadav, et al.
Published: (2026)
Variance-Covariance Regularization Improves Representation Learning
by: Zhu, Jiachen, et al.
Published: (2023)
by: Zhu, Jiachen, et al.
Published: (2023)
The Illusion of Progress: Re-evaluating Hallucination Detection in LLMs
by: Janiak, Denis, et al.
Published: (2025)
by: Janiak, Denis, et al.
Published: (2025)
Creative synthesis of kinematic mechanisms
by: Lin, Jiong, et al.
Published: (2025)
by: Lin, Jiong, et al.
Published: (2025)
An Information-Theoretic Perspective on Variance-Invariance-Covariance Regularization
by: Shwartz-Ziv, Ravid, et al.
Published: (2023)
by: Shwartz-Ziv, Ravid, et al.
Published: (2023)
Turning Up the Heat: Min-p Sampling for Creative and Coherent LLM Outputs
by: Nguyen, Minh Nhat, et al.
Published: (2024)
by: Nguyen, Minh Nhat, et al.
Published: (2024)
You Had One Job: Per-Task Quantization Using LLMs' Hidden Representations
by: LeVi, Amit, et al.
Published: (2025)
by: LeVi, Amit, et al.
Published: (2025)
Rate-In: Information-Driven Adaptive Dropout Rates for Improved Inference-Time Uncertainty Estimation
by: Zeevi, Tal, et al.
Published: (2024)
by: Zeevi, Tal, et al.
Published: (2024)
The Illusion of AI Expertise Under Uncertainty: Navigating Elusive Ground Truth via a Probabilistic Paradigm
by: Elangovan, Aparna, et al.
Published: (2026)
by: Elangovan, Aparna, et al.
Published: (2026)
Measure what Matters: Psychometric Evaluation of AI with Situational Judgment Tests
by: Yost, Alexandra, et al.
Published: (2025)
by: Yost, Alexandra, et al.
Published: (2025)
Beyond the Loss Curve: Scaling Laws, Active Learning, and the Limits of Learning from Exact Posteriors
by: Khorasani, Arian, et al.
Published: (2026)
by: Khorasani, Arian, et al.
Published: (2026)
Layer Importance for Mathematical Reasoning is Forged in Pre-Training and Invariant after Post-Training
by: Nepal, Aadim, et al.
Published: (2025)
by: Nepal, Aadim, et al.
Published: (2025)
Maia-2: A Unified Model for Human-AI Alignment in Chess
by: Tang, Zhenwei, et al.
Published: (2024)
by: Tang, Zhenwei, et al.
Published: (2024)
Between Institutional Loneliness and Visibility: Low‐Income Families Navigating Housing Insecurity in Social Welfare Programs
by: Tamar Shwartz‐Ziv, et al.
Published: (2025)
by: Tamar Shwartz‐Ziv, et al.
Published: (2025)
JEPA as a Neural Tokenizer: Learning Robust Speech Representations with Density Adaptive Attention
by: Ioannides, Georgios, et al.
Published: (2025)
by: Ioannides, Georgios, et al.
Published: (2025)
Just How Flexible are Neural Networks in Practice?
by: Shwartz-Ziv, Ravid, et al.
Published: (2024)
by: Shwartz-Ziv, Ravid, et al.
Published: (2024)
UAT-LITE: Inference-Time Uncertainty-Aware Attention for Pretrained Transformers
by: Hossain, Elias, et al.
Published: (2026)
by: Hossain, Elias, et al.
Published: (2026)
Sequencing the Neurome: Towards Scalable Exact Parameter Reconstruction of Black-Box Neural Networks
by: Goldfeder, Judah, et al.
Published: (2024)
by: Goldfeder, Judah, et al.
Published: (2024)
Direct Robot Configuration Space Construction using Convolutional Encoder-Decoders
by: Benka, Christopher, et al.
Published: (2023)
by: Benka, Christopher, et al.
Published: (2023)
Numerical solution of BVP for the incompressible Navier-Stokes equations at large Reynolds numbers
by: Lomasov, D. V., et al.
Published: (2024)
by: Lomasov, D. V., et al.
Published: (2024)
Layer by Layer: Uncovering Hidden Representations in Language Models
by: Skean, Oscar, et al.
Published: (2025)
by: Skean, Oscar, et al.
Published: (2025)
Seq-VCR: Preventing Collapse in Intermediate Transformer Representations for Enhanced Reasoning
by: Arefin, Md Rifat, et al.
Published: (2024)
by: Arefin, Md Rifat, et al.
Published: (2024)
Exploring the Relationship Between Teachers' AI Attitudes, AI Self‐Efficacy, and AI Technological Pedagogical Content Knowledge
by: Mustafa Erol, et al.
Published: (2025)
by: Mustafa Erol, et al.
Published: (2025)
Similar Items
-
AI Must Embrace Specialization via Superhuman Adaptable Intelligence
by: Goldfeder, Judah, et al.
Published: (2026) -
Antislop: A Comprehensive Framework for Identifying and Eliminating Repetitive Patterns in Language Models
by: Paech, Samuel, et al.
Published: (2025) -
A superpersuasive autonomous policy debating system
by: Roush, Allen, et al.
Published: (2025) -
Generating Auxiliary Tasks with Reinforcement Learning
by: Goldfeder, Judah, et al.
Published: (2025) -
Soft Clustering Anchors for Self-Supervised Speech Representation Learning in Joint Embedding Prediction Architectures
by: Ioannides, Georgios, et al.
Published: (2026)