Saved in:
| Main Authors: | Liang, Qiyao, Song, Jinyeop, Liu, Yizhou, Gore, Jeff, Fiete, Ila, Miikkulainen, Risto, Qiu, Xin |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00170 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Attractor Geometry of Transformer Memory: From Conflict Arbitration to Confident Hallucination
by: Liang, Qiyao, et al.
Published: (2026)
by: Liang, Qiyao, et al.
Published: (2026)
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
by: Xu, Yinggan, et al.
Published: (2026)
by: Xu, Yinggan, et al.
Published: (2026)
Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?
by: Liang, Qiyao, et al.
Published: (2024)
by: Liang, Qiyao, et al.
Published: (2024)
Compositional Generalization via Forced Rendering of Disentangled Latents
by: Liang, Qiyao, et al.
Published: (2025)
by: Liang, Qiyao, et al.
Published: (2025)
How Diffusion Models Learn to Factorize and Compose
by: Liang, Qiyao, et al.
Published: (2024)
by: Liang, Qiyao, et al.
Published: (2024)
Emergence and Effectiveness of Task Vectors in In-Context Learning: An Encoder Decoder Perspective
by: Han, Seungwook, et al.
Published: (2024)
by: Han, Seungwook, et al.
Published: (2024)
Overcoming Forgetting in LLM Fine-Tuning with Evolution Strategies
by: Schweighofer, Kajetan, et al.
Published: (2026)
by: Schweighofer, Kajetan, et al.
Published: (2026)
Estimating the Empowerment of Language Model Agents
by: Song, Jinyeop, et al.
Published: (2025)
by: Song, Jinyeop, et al.
Published: (2025)
Evolution Strategies at Scale: LLM Fine-Tuning Beyond Reinforcement Learning
by: Qiu, Xin, et al.
Published: (2025)
by: Qiu, Xin, et al.
Published: (2025)
A Resource Model For Neural Scaling Law
by: Song, Jinyeop, et al.
Published: (2024)
by: Song, Jinyeop, et al.
Published: (2024)
Semantic Density: Uncertainty Quantification for Large Language Models through Confidence Measurement in Semantic Space
by: Qiu, Xin, et al.
Published: (2024)
by: Qiu, Xin, et al.
Published: (2024)
The Odyssey of the Fittest: Can Agents Survive and Still Be Good?
by: Waldner, Dylan, et al.
Published: (2025)
by: Waldner, Dylan, et al.
Published: (2025)
Superposition Yields Robust Neural Scaling
by: Liu, Yizhou, et al.
Published: (2025)
by: Liu, Yizhou, et al.
Published: (2025)
Fine-Tuning Language Models to Know What They Know
by: Park, Sangjun, et al.
Published: (2026)
by: Park, Sangjun, et al.
Published: (2026)
Permutation Invariant Learning with High-Dimensional Particle Filters
by: Boopathy, Akhilan, et al.
Published: (2024)
by: Boopathy, Akhilan, et al.
Published: (2024)
Modular connectivity in neural networks emerges from Poisson noise-motivated regularisation, and promotes robustness and compositional generalisation
by: Qian, Daoyuan, et al.
Published: (2025)
by: Qian, Daoyuan, et al.
Published: (2025)
Universal One-third Time Scaling in Learning Peaked Distributions
by: Liu, Yizhou, et al.
Published: (2026)
by: Liu, Yizhou, et al.
Published: (2026)
EVOTER: Evolution of Transparent Explainable Rule-sets
by: Shahrzad, Hormoz, et al.
Published: (2022)
by: Shahrzad, Hormoz, et al.
Published: (2022)
Key-value memory in the brain
by: Gershman, Samuel J., et al.
Published: (2025)
by: Gershman, Samuel J., et al.
Published: (2025)
Inverse Depth Scaling From Most Layers Being Similar
by: Liu, Yizhou, et al.
Published: (2026)
by: Liu, Yizhou, et al.
Published: (2026)
Optimizing the Design of an Artificial Pancreas to Improve Diabetes Management
by: Khanna, Ashok, et al.
Published: (2024)
by: Khanna, Ashok, et al.
Published: (2024)
Neural Thermodynamic Laws for Large Language Model Training
by: Liu, Ziming, et al.
Published: (2025)
by: Liu, Ziming, et al.
Published: (2025)
Ratio-Variance Regularized Policy Optimization for Efficient LLM Fine-tuning
by: Luo, Yu, et al.
Published: (2026)
by: Luo, Yu, et al.
Published: (2026)
Neural Cellular Automata for ARC-AGI
by: Xu, Kevin, et al.
Published: (2025)
by: Xu, Kevin, et al.
Published: (2025)
The Blessing and Curse of Dimensionality in Safety Alignment
by: Teo, Rachel S. Y., et al.
Published: (2025)
by: Teo, Rachel S. Y., et al.
Published: (2025)
Superposition unifies power-law training dynamics
by: Chen, Zixin Jessie, et al.
Published: (2026)
by: Chen, Zixin Jessie, et al.
Published: (2026)
Optimize Wider, Not Deeper: Consensus Aggregation for Policy Optimization
by: Su, Zelal, et al.
Published: (2026)
by: Su, Zelal, et al.
Published: (2026)
Rethinking Safety in LLM Fine-tuning: An Optimization Perspective
by: Kim, Minseon, et al.
Published: (2025)
by: Kim, Minseon, et al.
Published: (2025)
Efficient Pre-Training of LLMs through Truncated SVD Layers
by: Kamali, Kaivan, et al.
Published: (2026)
by: Kamali, Kaivan, et al.
Published: (2026)
Physics of Skill Learning
by: Liu, Ziming, et al.
Published: (2025)
by: Liu, Ziming, et al.
Published: (2025)
NeuroBack: Improving CDCL SAT Solving using Graph Neural Networks
by: Wang, Wenxi, et al.
Published: (2021)
by: Wang, Wenxi, et al.
Published: (2021)
Learning from the Past: How Previous Technological Transformations Can Guide AI Development
by: Miikkulainen, Risto, et al.
Published: (2019)
by: Miikkulainen, Risto, et al.
Published: (2019)
Parameter Efficient Fine-tuning via Explained Variance Adaptation
by: Paischer, Fabian, et al.
Published: (2024)
by: Paischer, Fabian, et al.
Published: (2024)
Effective Regularization Through Loss-Function Metalearning
by: Gonzalez, Santiago, et al.
Published: (2020)
by: Gonzalez, Santiago, et al.
Published: (2020)
The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training
by: Cao, Hengjie, et al.
Published: (2026)
by: Cao, Hengjie, et al.
Published: (2026)
Learn Globally, Speak Locally: Bridging the Gaps in Multilingual Reasoning
by: Hwang, Jaedong, et al.
Published: (2025)
by: Hwang, Jaedong, et al.
Published: (2025)
Transformer Copilot: Learning from The Mistake Log in LLM Fine-tuning
by: Zou, Jiaru, et al.
Published: (2025)
by: Zou, Jiaru, et al.
Published: (2025)
Outlier-weighed Layerwise Sampling for LLM Fine-tuning
by: Li, Pengxiang, et al.
Published: (2024)
by: Li, Pengxiang, et al.
Published: (2024)
Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective
by: Xie, Hong, et al.
Published: (2026)
by: Xie, Hong, et al.
Published: (2026)
Large Pre-Training Datasets Don't Always Guarantee Robustness after Fine-Tuning
by: Hwang, Jaedong, et al.
Published: (2024)
by: Hwang, Jaedong, et al.
Published: (2024)
Similar Items
-
Attractor Geometry of Transformer Memory: From Conflict Arbitration to Confident Hallucination
by: Liang, Qiyao, et al.
Published: (2026) -
Quantized Evolution Strategies: High-precision Fine-tuning of Quantized LLMs at Low-precision Cost
by: Xu, Yinggan, et al.
Published: (2026) -
Do Diffusion Models Learn Semantically Meaningful and Efficient Representations?
by: Liang, Qiyao, et al.
Published: (2024) -
Compositional Generalization via Forced Rendering of Disentangled Latents
by: Liang, Qiyao, et al.
Published: (2025) -
How Diffusion Models Learn to Factorize and Compose
by: Liang, Qiyao, et al.
Published: (2024)