:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Prashanth, USVSN Sai, Deng, Alvin, O'Brien, Kyle, S V, Jyothir, Khan, Mohammad Aflah, Borkar, Jaydeep, Choquette-Choo, Christopher A., Fuehne, Jacob Ray, Biderman, Stella, Ke, Tracy, Lee, Katherine, Saphra, Naomi
Format:	Preprint
Published:	2024
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2406.17746
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
by: Borkar, Jaydeep, et al.
Published: (2025)

PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
by: van der Wal, Oskar, et al.
Published: (2025)

Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription
by: Borkar, Jaydeep, et al.
Published: (2024)

Mechanistic?
by: Saphra, Naomi, et al.
Published: (2024)

Memorization Dynamics in Knowledge Distillation for Language Models
by: Borkar, Jaydeep, et al.
Published: (2026)

Hidden Breakthroughs in Language Model Training
by: Kangaslahti, Sara, et al.
Published: (2025)

ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context
by: Li, Victoria R., et al.
Published: (2024)

Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization
by: Qin, Tian, et al.
Published: (2024)

Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
by: O'Brien, Kyle, et al.
Published: (2025)

Recital Review
Published: (2022)

A Taxonomy of Transcendence
by: Abreu, Natalie, et al.
Published: (2025)

First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models
by: Saphra, Naomi, et al.
Published: (2023)

TRAM: Bridging Trust Regions and Sharpness Aware Minimization
by: Sherborne, Tom, et al.
Published: (2023)

Fast Forwarding Low-Rank Training
by: Rahamim, Adir, et al.
Published: (2024)

Rethinking Memorization Measures and their Implications in Large Language Models
by: Ghosh, Bishwamittra, et al.
Published: (2025)

Optimal Rates for $O(1)$-Smooth DP-SCO with a Single Epoch and Large Batches
by: Choquette-Choo, Christopher A., et al.
Published: (2024)

MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization
by: O'Brien, Dayyán, et al.
Published: (2025)

Latent State Models of Training Dynamics
by: Hu, Michael Y., et al.
Published: (2023)

Recollecting Resonances
Published: (2020)

Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs
by: Weissweiler, Leonie, et al.
Published: (2025)

Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
by: Zhang, Yuwei, et al.
Published: (2025)

Rote Learning Considered Useful: Generalizing over Memorized Data in LLMs
by: Wu, Qinyuan, et al.
Published: (2025)

The Censorship Phenomenon in College and Research Libraries: An Investigation of the Canadian Prairie Provinces, 1980-1985.
by: Schrader, Alvin M., et al.
Published: (1989)

LLM Circuit Analyses Are Consistent Across Training and Scale
by: Tigges, Curt, et al.
Published: (2024)

Grokking Group Multiplication with Cosets
by: Stander, Dashiell, et al.
Published: (2023)

The Ghost in the Keys: A Disklavier Demo for Human-AI Musical Co-Creativity
by: Bradshaw, Louis, et al.
Published: (2025)

Causal Drawbridges: Characterizing Gradient Blocking of Syntactic Islands in Transformer LMs
by: Boguraev, Sasha, et al.
Published: (2026)

Privacy Amplification for Matrix Mechanisms
by: Choquette-Choo, Christopher A., et al.
Published: (2023)

Na ante-sala da discriminação: o preço dos atributos de sexo ecor no Brasil (19891999)
by: Ciro Biderman
Published: (2004)

Benchmarks as Microscopes: A Call for Model Metrology
by: Saxon, Michael, et al.
Published: (2024)

Random Scaling of Emergent Capabilities
by: Zhao, Rosie, et al.
Published: (2025)

Dynamic Masking Rate Schedules for MLM Pretraining
by: Ankner, Zachary, et al.
Published: (2023)

Auditing Private Prediction
by: Chadha, Karan, et al.
Published: (2024)

Hubble: a Model Suite to Advance the Study of LLM Memorization
by: Wei, Johnny Tian-Zheng, et al.
Published: (2025)

Recite Your Ask Out Loud
Published: (2025)

Recollections of a Nuclear War
by: Morrison, Philip
Published: (1945)

Recollections of a Nuclear War
Published: (1995)

A suite of LMs comprehend puzzle statements as well as humans
by: Goldberg, Adele E, et al.
Published: (2025)

What Matters in Memorizing and Recalling Facts? Multifaceted Benchmarks for Knowledge Probing in Language Models
by: Zhao, Xin, et al.
Published: (2024)

Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs
by: Chen, Angelica, et al.
Published: (2023)