Saved in:
| Main Authors: | Karkada, Dhruva, Simon, James B., Bahri, Yasaman, DeWeese, Michael R. |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2502.09863 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
On the Emergence of Linear Analogies in Word Embeddings
by: Korchinski, Daniel J., et al.
Published: (2025)
by: Korchinski, Daniel J., et al.
Published: (2025)
Symmetry in language statistics shapes the geometry of model representations
by: Karkada, Dhruva, et al.
Published: (2026)
by: Karkada, Dhruva, et al.
Published: (2026)
Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks
by: Kunin, Daniel, et al.
Published: (2025)
by: Kunin, Daniel, et al.
Published: (2025)
Beyond Linear Response: Equivalence between Thermodynamic Geometry and Optimal Transport
by: Zhong, Adrianne, et al.
Published: (2024)
by: Zhong, Adrianne, et al.
Published: (2024)
The lazy (NTK) and rich ($μ$P) regimes: a gentle tutorial
by: Karkada, Dhruva
Published: (2024)
by: Karkada, Dhruva
Published: (2024)
Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
by: DeWeese, Alex, et al.
Published: (2024)
by: DeWeese, Alex, et al.
Published: (2024)
Status Concerns and Library Professionalism
by: DeWeese, L. Carroll
Published: (1972)
by: DeWeese, L. Carroll
Published: (1972)
Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$-step Policy Gradients
by: DeWeese, Alex, et al.
Published: (2026)
by: DeWeese, Alex, et al.
Published: (2026)
Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs
by: DeWeese, Alex, et al.
Published: (2025)
by: DeWeese, Alex, et al.
Published: (2025)
A Paradigm of Commitment
by: DeWeese, Lemuel Carroll, III
Published: (1970)
by: DeWeese, Lemuel Carroll, III
Published: (1970)
A Paradigm of Commitment: Toward Professional Identity for Librarians.
by: DeWeese, Lemuel Carroll, III
Published: (1970)
by: DeWeese, Lemuel Carroll, III
Published: (1970)
A Theory of Saddle Escape in Deep Nonlinear Networks
by: Rawal, Divit, et al.
Published: (2026)
by: Rawal, Divit, et al.
Published: (2026)
More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
by: Simon, James B., et al.
Published: (2023)
by: Simon, James B., et al.
Published: (2023)
Temperature and flow data from a sediment tank experiment and numerical Advection-Dispersion Model code
by: Luce, Charles, et al.
Published: (2017)
by: Luce, Charles, et al.
Published: (2017)
Predicting kernel regression learning curves from only raw data statistics
by: Karkada, Dhruva, et al.
Published: (2025)
by: Karkada, Dhruva, et al.
Published: (2025)
The Thermodynamic Costs of Simple Linear Regression
by: D'Ambrosia, Samuel H., et al.
Published: (2026)
by: D'Ambrosia, Samuel H., et al.
Published: (2026)
Context Structure Reshapes the Representational Geometry of Language Models
by: Hosseini, Eghbal A., et al.
Published: (2026)
by: Hosseini, Eghbal A., et al.
Published: (2026)
Higher-order response theory in optimal stochastic thermodynamics
by: DAmbrosia, Samuel. H., et al.
Published: (2025)
by: DAmbrosia, Samuel. H., et al.
Published: (2025)
Time-Asymmetric Fluctuation Theorem and Efficient Free Energy Estimation
by: Zhong, Adrianne, et al.
Published: (2023)
by: Zhong, Adrianne, et al.
Published: (2023)
Spoken Word2Vec: Learning Skipgram Embeddings from Speech
by: Sayeed, Mohammad Amaan, et al.
Published: (2023)
by: Sayeed, Mohammad Amaan, et al.
Published: (2023)
Language Models Implement Simple Word2Vec-style Vector Arithmetic
by: Merullo, Jack, et al.
Published: (2023)
by: Merullo, Jack, et al.
Published: (2023)
The Entropy of Floating-Point Numbers
by: Daniels, Sultan, et al.
Published: (2026)
by: Daniels, Sultan, et al.
Published: (2026)
Meta-Learning for Better Learning: Using Meta-Learning Methods to Automatically Label Exam Questions with Detailed Learning Objectives
by: Zur, Amir, et al.
Published: (2023)
by: Zur, Amir, et al.
Published: (2023)
Evaluating the Representation of Vowels in Wav2Vec Feature Extractor: A Layer-Wise Analysis Using MFCCs
by: De Cristofaro, Domenico, et al.
Published: (2025)
by: De Cristofaro, Domenico, et al.
Published: (2025)
Tokenization Strategies for Low-Resource Agglutinative Languages in Word2Vec: Case Study on Turkish and Finnish
by: Hu, Jinfan Frank
Published: (2025)
by: Hu, Jinfan Frank
Published: (2025)
From Word2Vec to Transformers: Text-Derived Composition Embeddings for Filtering Combinatorial Electrocatalysts
by: Zhang, Lei, et al.
Published: (2026)
by: Zhang, Lei, et al.
Published: (2026)
Disaggregation Reveals Hidden Training Dynamics: The Case of Agreement Attraction
by: Michaelov, James A., et al.
Published: (2025)
by: Michaelov, James A., et al.
Published: (2025)
Constructions are Revealed in Word Distributions
by: Rozner, Joshua, et al.
Published: (2025)
by: Rozner, Joshua, et al.
Published: (2025)
Improving Detection of Watermarked Language Models
by: Bahri, Dara, et al.
Published: (2025)
by: Bahri, Dara, et al.
Published: (2025)
Active Learning of Upward-Closed Sets of Words
by: Aristote, Quentin
Published: (2025)
by: Aristote, Quentin
Published: (2025)
A Watermark for Black-Box Language Models
by: Bahri, Dara, et al.
Published: (2024)
by: Bahri, Dara, et al.
Published: (2024)
VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks
by: Jiang, Ziyan, et al.
Published: (2024)
by: Jiang, Ziyan, et al.
Published: (2024)
GT2Vec: Large Language Models as Multi-Modal Encoders for Text and Graph-Structured Data
by: Lin, Jiacheng, et al.
Published: (2024)
by: Lin, Jiacheng, et al.
Published: (2024)
VecGlypher: Unified Vector Glyph Generation with Language Models
by: Huang, Xiaoke, et al.
Published: (2026)
by: Huang, Xiaoke, et al.
Published: (2026)
EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning
by: Mekala, Rajasekhar Reddy, et al.
Published: (2023)
by: Mekala, Rajasekhar Reddy, et al.
Published: (2023)
Constructing Vec-tionaries to Extract Message Features from Texts: A Case Study of Moral Appeals
by: Duan, Zening, et al.
Published: (2023)
by: Duan, Zening, et al.
Published: (2023)
Gram2Vec: An Interpretable Document Vectorizer
by: Zeng, Peter, et al.
Published: (2024)
by: Zeng, Peter, et al.
Published: (2024)
Linear Dynamics in the RLVR Training of Large Language Models
by: Wang, Tianle, et al.
Published: (2026)
by: Wang, Tianle, et al.
Published: (2026)
Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training
by: Zhang, Mozhi, et al.
Published: (2025)
by: Zhang, Mozhi, et al.
Published: (2025)
New Encoders for German Trained from Scratch: Comparing ModernGBERT with Converted LLM2Vec Models
by: Wunderle, Julia, et al.
Published: (2025)
by: Wunderle, Julia, et al.
Published: (2025)
Similar Items
-
On the Emergence of Linear Analogies in Word Embeddings
by: Korchinski, Daniel J., et al.
Published: (2025) -
Symmetry in language statistics shapes the geometry of model representations
by: Karkada, Dhruva, et al.
Published: (2026) -
Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks
by: Kunin, Daniel, et al.
Published: (2025) -
Beyond Linear Response: Equivalence between Thermodynamic Geometry and Optimal Transport
by: Zhong, Adrianne, et al.
Published: (2024) -
The lazy (NTK) and rich ($μ$P) regimes: a gentle tutorial
by: Karkada, Dhruva
Published: (2024)