:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Karkada, Dhruva, Simon, James B., Bahri, Yasaman, DeWeese, Michael R.
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2502.09863
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

On the Emergence of Linear Analogies in Word Embeddings
by: Korchinski, Daniel J., et al.
Published: (2025)

Symmetry in language statistics shapes the geometry of model representations
by: Karkada, Dhruva, et al.
Published: (2026)

Alternating Gradient Flows: A Theory of Feature Learning in Two-layer Neural Networks
by: Kunin, Daniel, et al.
Published: (2025)

Beyond Linear Response: Equivalence between Thermodynamic Geometry and Optimal Transport
by: Zhong, Adrianne, et al.
Published: (2024)

The lazy (NTK) and rich ($μ$P) regimes: a gentle tutorial
by: Karkada, Dhruva
Published: (2024)

Locally Interdependent Multi-Agent MDP: Theoretical Framework for Decentralized Agents with Dynamic Dependencies
by: DeWeese, Alex, et al.
Published: (2024)

Status Concerns and Library Professionalism
by: DeWeese, L. Carroll
Published: (1972)

Revisiting Policy Gradients for Restricted Policy Classes: Escaping Myopic Local Optima with $k$-step Policy Gradients
by: DeWeese, Alex, et al.
Published: (2026)

Thinking Beyond Visibility: A Near-Optimal Policy Framework for Locally Interdependent Multi-Agent MDPs
by: DeWeese, Alex, et al.
Published: (2025)

A Paradigm of Commitment
by: DeWeese, Lemuel Carroll, III
Published: (1970)

A Paradigm of Commitment: Toward Professional Identity for Librarians.
by: DeWeese, Lemuel Carroll, III
Published: (1970)

A Theory of Saddle Escape in Deep Nonlinear Networks
by: Rawal, Divit, et al.
Published: (2026)

More is Better in Modern Machine Learning: when Infinite Overparameterization is Optimal and Overfitting is Obligatory
by: Simon, James B., et al.
Published: (2023)

Temperature and flow data from a sediment tank experiment and numerical Advection-Dispersion Model code
by: Luce, Charles, et al.
Published: (2017)

Predicting kernel regression learning curves from only raw data statistics
by: Karkada, Dhruva, et al.
Published: (2025)

The Thermodynamic Costs of Simple Linear Regression
by: D'Ambrosia, Samuel H., et al.
Published: (2026)

Context Structure Reshapes the Representational Geometry of Language Models
by: Hosseini, Eghbal A., et al.
Published: (2026)

Higher-order response theory in optimal stochastic thermodynamics
by: DAmbrosia, Samuel. H., et al.
Published: (2025)

Time-Asymmetric Fluctuation Theorem and Efficient Free Energy Estimation
by: Zhong, Adrianne, et al.
Published: (2023)

Spoken Word2Vec: Learning Skipgram Embeddings from Speech
by: Sayeed, Mohammad Amaan, et al.
Published: (2023)

Language Models Implement Simple Word2Vec-style Vector Arithmetic
by: Merullo, Jack, et al.
Published: (2023)

The Entropy of Floating-Point Numbers
by: Daniels, Sultan, et al.
Published: (2026)

Meta-Learning for Better Learning: Using Meta-Learning Methods to Automatically Label Exam Questions with Detailed Learning Objectives
by: Zur, Amir, et al.
Published: (2023)

Evaluating the Representation of Vowels in Wav2Vec Feature Extractor: A Layer-Wise Analysis Using MFCCs
by: De Cristofaro, Domenico, et al.
Published: (2025)

Tokenization Strategies for Low-Resource Agglutinative Languages in Word2Vec: Case Study on Turkish and Finnish
by: Hu, Jinfan Frank
Published: (2025)

From Word2Vec to Transformers: Text-Derived Composition Embeddings for Filtering Combinatorial Electrocatalysts
by: Zhang, Lei, et al.
Published: (2026)

Disaggregation Reveals Hidden Training Dynamics: The Case of Agreement Attraction
by: Michaelov, James A., et al.
Published: (2025)

Constructions are Revealed in Word Distributions
by: Rozner, Joshua, et al.
Published: (2025)

Improving Detection of Watermarked Language Models
by: Bahri, Dara, et al.
Published: (2025)

Active Learning of Upward-Closed Sets of Words
by: Aristote, Quentin
Published: (2025)

A Watermark for Black-Box Language Models
by: Bahri, Dara, et al.
Published: (2024)

VLM2Vec: Training Vision-Language Models for Massive Multimodal Embedding Tasks
by: Jiang, Ziyan, et al.
Published: (2024)

GT2Vec: Large Language Models as Multi-Modal Encoders for Text and Graph-Structured Data
by: Lin, Jiacheng, et al.
Published: (2024)

VecGlypher: Unified Vector Glyph Generation with Language Models
by: Huang, Xiaoke, et al.
Published: (2026)

EchoPrompt: Instructing the Model to Rephrase Queries for Improved In-context Learning
by: Mekala, Rajasekhar Reddy, et al.
Published: (2023)

Constructing Vec-tionaries to Extract Message Features from Texts: A Case Study of Moral Appeals
by: Duan, Zening, et al.
Published: (2023)

Gram2Vec: An Interpretable Document Vectorizer
by: Zeng, Peter, et al.
Published: (2024)

Linear Dynamics in the RLVR Training of Large Language Models
by: Wang, Tianle, et al.
Published: (2026)

Domain2Vec: Vectorizing Datasets to Find the Optimal Data Mixture without Training
by: Zhang, Mozhi, et al.
Published: (2025)

New Encoders for German Trained from Scratch: Comparing ModernGBERT with Converted LLM2Vec Models
by: Wunderle, Julia, et al.
Published: (2025)