Saved in:
| Main Authors: | Kirsanov, Artem, Chou, Chi-Ning, Cho, Kyunghyun, Chung, SueYeon |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.08009 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Diagnosing Generalization Failures from Representational Geometry Markers
by: Chou, Chi-Ning, et al.
Published: (2026)
by: Chou, Chi-Ning, et al.
Published: (2026)
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
by: Chou, Chi-Ning, et al.
Published: (2025)
by: Chou, Chi-Ning, et al.
Published: (2025)
Nonlinear classification of neural manifolds with contextual information
by: Mignacco, Francesca, et al.
Published: (2024)
by: Mignacco, Francesca, et al.
Published: (2024)
Linear Readout of Neural Manifolds with Continuous Variables
by: Slatton, Will, et al.
Published: (2026)
by: Slatton, Will, et al.
Published: (2026)
Statistical Mechanics of Support Vector Regression
by: Canatar, Abdulkadir, et al.
Published: (2024)
by: Canatar, Abdulkadir, et al.
Published: (2024)
Emergent Manifold Separability during Reasoning in Large Language Models
by: Chun, Chanwoo, et al.
Published: (2026)
by: Chun, Chanwoo, et al.
Published: (2026)
Neural population geometry and optimal coding of tasks with shared latent structure
by: Wakhloo, Albert J., et al.
Published: (2024)
by: Wakhloo, Albert J., et al.
Published: (2024)
Task-Induced Representational Invariances Depend on Learning Objective in Deep RL
by: Halvagal, Manu Srinath, et al.
Published: (2026)
by: Halvagal, Manu Srinath, et al.
Published: (2026)
Two Speeds of Learning: A Representation-Readout Decomposition of Grokking and Double Descent
by: Chou, Chi-Ning, et al.
Published: (2026)
by: Chou, Chi-Ning, et al.
Published: (2026)
Shared Heritage, Distinct Writing: Rethinking Resource Selection for East Asian Historical Documents
by: Song, Seyoung, et al.
Published: (2024)
by: Song, Seyoung, et al.
Published: (2024)
Scaling Laws Are Unreliable for Downstream Tasks: A Reality Check
by: Lourie, Nicholas, et al.
Published: (2025)
by: Lourie, Nicholas, et al.
Published: (2025)
Spectral Analysis of Representational Similarity with Limited Neurons
by: Kang, Hyunmo, et al.
Published: (2025)
by: Kang, Hyunmo, et al.
Published: (2025)
Language Models as Causal Effect Generators
by: Bynum, Lucius E. J., et al.
Published: (2024)
by: Bynum, Lucius E. J., et al.
Published: (2024)
First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models
by: Saphra, Naomi, et al.
Published: (2023)
by: Saphra, Naomi, et al.
Published: (2023)
System-Level Natural Language Feedback
by: Yuan, Weizhe, et al.
Published: (2023)
by: Yuan, Weizhe, et al.
Published: (2023)
Code-Switching In-Context Learning for Cross-Lingual Transfer of Large Language Models
by: Yoo, Haneul, et al.
Published: (2025)
by: Yoo, Haneul, et al.
Published: (2025)
Estimating the Spectral Moments of the Kernel Integral Operator from Finite Sample Matrices
by: Chun, Chanwoo, et al.
Published: (2024)
by: Chun, Chanwoo, et al.
Published: (2024)
Characterizing the Predictive Impact of Modalities with Supervised Latent-Variable Modeling
by: Madaan, Divyam, et al.
Published: (2026)
by: Madaan, Divyam, et al.
Published: (2026)
Generalization Measures for Zero-Shot Cross-Lingual Transfer
by: Bassi, Saksham, et al.
Published: (2024)
by: Bassi, Saksham, et al.
Published: (2024)
Transfer-Prompting: Enhancing Cross-Task Adaptation in Large Language Models via Dual-Stage Prompts Optimization
by: Chang, Yupeng, et al.
Published: (2025)
by: Chang, Yupeng, et al.
Published: (2025)
Efficient semantic uncertainty quantification in language models via diversity-steered sampling
by: Park, Ji Won, et al.
Published: (2025)
by: Park, Ji Won, et al.
Published: (2025)
Hyperparameter Loss Surfaces Are Simple Near their Optima
by: Lourie, Nicholas, et al.
Published: (2025)
by: Lourie, Nicholas, et al.
Published: (2025)
Show Your Work with Confidence: Confidence Bands for Tuning Curves
by: Lourie, Nicholas, et al.
Published: (2023)
by: Lourie, Nicholas, et al.
Published: (2023)
Cross-Lingual Unlearning of Selective Knowledge in Multilingual Language Models
by: Choi, Minseok, et al.
Published: (2024)
by: Choi, Minseok, et al.
Published: (2024)
Estimating Dimensionality of Neural Representations from Finite Samples
by: Chun, Chanwoo, et al.
Published: (2025)
by: Chun, Chanwoo, et al.
Published: (2025)
TAPO: Task-Referenced Adaptation for Prompt Optimization
by: Luo, Wenxin, et al.
Published: (2025)
by: Luo, Wenxin, et al.
Published: (2025)
No Two Devils Alike: Unveiling Distinct Mechanisms of Fine-tuning Attacks
by: Leong, Chak Tou, et al.
Published: (2024)
by: Leong, Chak Tou, et al.
Published: (2024)
Temporal Generalization: A Reality Check
by: Madaan, Divyam, et al.
Published: (2025)
by: Madaan, Divyam, et al.
Published: (2025)
Jointly Modeling Inter- & Intra-Modality Dependencies for Multi-modal Learning
by: Madaan, Divyam, et al.
Published: (2024)
by: Madaan, Divyam, et al.
Published: (2024)
Self-Rewarding Language Models
by: Yuan, Weizhe, et al.
Published: (2024)
by: Yuan, Weizhe, et al.
Published: (2024)
On the Relationship Between the Choice of Representation and In-Context Learning
by: Marinescu, Ioana, et al.
Published: (2025)
by: Marinescu, Ioana, et al.
Published: (2025)
Aioli: A Unified Optimization Framework for Language Model Data Mixing
by: Chen, Mayee F., et al.
Published: (2024)
by: Chen, Mayee F., et al.
Published: (2024)
Unveiling and Manipulating Prompt Influence in Large Language Models
by: Feng, Zijian, et al.
Published: (2024)
by: Feng, Zijian, et al.
Published: (2024)
Task as Context Prompting for Accurate Medical Symptom Coding Using Large Language Models
by: He, Chengyang, et al.
Published: (2025)
by: He, Chengyang, et al.
Published: (2025)
Training Language Models with Language Feedback at Scale
by: Scheurer, Jérémy, et al.
Published: (2023)
by: Scheurer, Jérémy, et al.
Published: (2023)
MentalAgora: A Gateway to Advanced Personalized Care in Mental Health through Multi-Agent Debating and Attribute Control
by: Lee, Yeonji, et al.
Published: (2024)
by: Lee, Yeonji, et al.
Published: (2024)
Estimating Neural Representation Alignment from Sparsely Sampled Inputs and Features
by: Chun, Chanwoo, et al.
Published: (2025)
by: Chun, Chanwoo, et al.
Published: (2025)
Unveiling Imitation Learning: Exploring the Impact of Data Falsity to Large Language Model
by: Cho, Hyunsoo
Published: (2024)
by: Cho, Hyunsoo
Published: (2024)
How Does Vision-Language Adaptation Impact the Safety of Vision Language Models?
by: Lee, Seongyun, et al.
Published: (2024)
by: Lee, Seongyun, et al.
Published: (2024)
Embedding Geometries of Contrastive Language-Image Pre-Training
by: Chou, Jason Chuan-Chih, et al.
Published: (2024)
by: Chou, Jason Chuan-Chih, et al.
Published: (2024)
Similar Items
-
Diagnosing Generalization Failures from Representational Geometry Markers
by: Chou, Chi-Ning, et al.
Published: (2026) -
Feature Learning beyond the Lazy-Rich Dichotomy: Insights from Representational Geometry
by: Chou, Chi-Ning, et al.
Published: (2025) -
Nonlinear classification of neural manifolds with contextual information
by: Mignacco, Francesca, et al.
Published: (2024) -
Linear Readout of Neural Manifolds with Continuous Variables
by: Slatton, Will, et al.
Published: (2026) -
Statistical Mechanics of Support Vector Regression
by: Canatar, Abdulkadir, et al.
Published: (2024)