Saved in:
| Main Authors: | Prashanth, USVSN Sai, Deng, Alvin, O'Brien, Kyle, S V, Jyothir, Khan, Mohammad Aflah, Borkar, Jaydeep, Choquette-Choo, Christopher A., Fuehne, Jacob Ray, Biderman, Stella, Ke, Tracy, Lee, Katherine, Saphra, Naomi |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2406.17746 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
by: Borkar, Jaydeep, et al.
Published: (2025)
by: Borkar, Jaydeep, et al.
Published: (2025)
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
by: van der Wal, Oskar, et al.
Published: (2025)
by: van der Wal, Oskar, et al.
Published: (2025)
Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription
by: Borkar, Jaydeep, et al.
Published: (2024)
by: Borkar, Jaydeep, et al.
Published: (2024)
Mechanistic?
by: Saphra, Naomi, et al.
Published: (2024)
by: Saphra, Naomi, et al.
Published: (2024)
Memorization Dynamics in Knowledge Distillation for Language Models
by: Borkar, Jaydeep, et al.
Published: (2026)
by: Borkar, Jaydeep, et al.
Published: (2026)
Hidden Breakthroughs in Language Model Training
by: Kangaslahti, Sara, et al.
Published: (2025)
by: Kangaslahti, Sara, et al.
Published: (2025)
ChatGPT Doesn't Trust Chargers Fans: Guardrail Sensitivity in Context
by: Li, Victoria R., et al.
Published: (2024)
by: Li, Victoria R., et al.
Published: (2024)
Sometimes I am a Tree: Data Drives Unstable Hierarchical Generalization
by: Qin, Tian, et al.
Published: (2024)
by: Qin, Tian, et al.
Published: (2024)
Deep Ignorance: Filtering Pretraining Data Builds Tamper-Resistant Safeguards into Open-Weight LLMs
by: O'Brien, Kyle, et al.
Published: (2025)
by: O'Brien, Kyle, et al.
Published: (2025)
Recital Review
Published: (2022)
Published: (2022)
A Taxonomy of Transcendence
by: Abreu, Natalie, et al.
Published: (2025)
by: Abreu, Natalie, et al.
Published: (2025)
First Tragedy, then Parse: History Repeats Itself in the New Era of Large Language Models
by: Saphra, Naomi, et al.
Published: (2023)
by: Saphra, Naomi, et al.
Published: (2023)
TRAM: Bridging Trust Regions and Sharpness Aware Minimization
by: Sherborne, Tom, et al.
Published: (2023)
by: Sherborne, Tom, et al.
Published: (2023)
Fast Forwarding Low-Rank Training
by: Rahamim, Adir, et al.
Published: (2024)
by: Rahamim, Adir, et al.
Published: (2024)
Rethinking Memorization Measures and their Implications in Large Language Models
by: Ghosh, Bishwamittra, et al.
Published: (2025)
by: Ghosh, Bishwamittra, et al.
Published: (2025)
Optimal Rates for $O(1)$-Smooth DP-SCO with a Single Epoch and Large Batches
by: Choquette-Choo, Christopher A., et al.
Published: (2024)
by: Choquette-Choo, Christopher A., et al.
Published: (2024)
MatheMagic: Generating Dynamic Mathematics Benchmarks Robust to Memorization
by: O'Brien, Dayyán, et al.
Published: (2025)
by: O'Brien, Dayyán, et al.
Published: (2025)
Latent State Models of Training Dynamics
by: Hu, Michael Y., et al.
Published: (2023)
by: Hu, Michael Y., et al.
Published: (2023)
Recollecting Resonances
Published: (2020)
Published: (2020)
Linguistic Generalizations are not Rules: Impacts on Evaluation of LMs
by: Weissweiler, Leonie, et al.
Published: (2025)
by: Weissweiler, Leonie, et al.
Published: (2025)
Bidirectional LMs are Better Knowledge Memorizers? A Benchmark for Real-world Knowledge Injection
by: Zhang, Yuwei, et al.
Published: (2025)
by: Zhang, Yuwei, et al.
Published: (2025)
Rote Learning Considered Useful: Generalizing over Memorized Data in LLMs
by: Wu, Qinyuan, et al.
Published: (2025)
by: Wu, Qinyuan, et al.
Published: (2025)
The Censorship Phenomenon in College and Research Libraries: An Investigation of the Canadian Prairie Provinces, 1980-1985.
by: Schrader, Alvin M., et al.
Published: (1989)
by: Schrader, Alvin M., et al.
Published: (1989)
LLM Circuit Analyses Are Consistent Across Training and Scale
by: Tigges, Curt, et al.
Published: (2024)
by: Tigges, Curt, et al.
Published: (2024)
Grokking Group Multiplication with Cosets
by: Stander, Dashiell, et al.
Published: (2023)
by: Stander, Dashiell, et al.
Published: (2023)
The Ghost in the Keys: A Disklavier Demo for Human-AI Musical Co-Creativity
by: Bradshaw, Louis, et al.
Published: (2025)
by: Bradshaw, Louis, et al.
Published: (2025)
Causal Drawbridges: Characterizing Gradient Blocking of Syntactic Islands in Transformer LMs
by: Boguraev, Sasha, et al.
Published: (2026)
by: Boguraev, Sasha, et al.
Published: (2026)
Privacy Amplification for Matrix Mechanisms
by: Choquette-Choo, Christopher A., et al.
Published: (2023)
by: Choquette-Choo, Christopher A., et al.
Published: (2023)
Na ante-sala da discriminação: o preço dos atributos de sexo ecor no Brasil (19891999)
by: Ciro Biderman
Published: (2004)
by: Ciro Biderman
Published: (2004)
Benchmarks as Microscopes: A Call for Model Metrology
by: Saxon, Michael, et al.
Published: (2024)
by: Saxon, Michael, et al.
Published: (2024)
Random Scaling of Emergent Capabilities
by: Zhao, Rosie, et al.
Published: (2025)
by: Zhao, Rosie, et al.
Published: (2025)
Dynamic Masking Rate Schedules for MLM Pretraining
by: Ankner, Zachary, et al.
Published: (2023)
by: Ankner, Zachary, et al.
Published: (2023)
Auditing Private Prediction
by: Chadha, Karan, et al.
Published: (2024)
by: Chadha, Karan, et al.
Published: (2024)
Hubble: a Model Suite to Advance the Study of LLM Memorization
by: Wei, Johnny Tian-Zheng, et al.
Published: (2025)
by: Wei, Johnny Tian-Zheng, et al.
Published: (2025)
Recite Your Ask Out Loud
Published: (2025)
Published: (2025)
Recollections of a Nuclear War
by: Morrison, Philip
Published: (1945)
by: Morrison, Philip
Published: (1945)
Recollections of a Nuclear War
Published: (1995)
Published: (1995)
A suite of LMs comprehend puzzle statements as well as humans
by: Goldberg, Adele E, et al.
Published: (2025)
by: Goldberg, Adele E, et al.
Published: (2025)
What Matters in Memorizing and Recalling Facts? Multifaceted Benchmarks for Knowledge Probing in Language Models
by: Zhao, Xin, et al.
Published: (2024)
by: Zhao, Xin, et al.
Published: (2024)
Sudden Drops in the Loss: Syntax Acquisition, Phase Transitions, and Simplicity Bias in MLMs
by: Chen, Angelica, et al.
Published: (2023)
by: Chen, Angelica, et al.
Published: (2023)
Similar Items
-
Privacy Ripple Effects from Adding or Removing Personal Information in Language Model Training
by: Borkar, Jaydeep, et al.
Published: (2025) -
PolyPythias: Stability and Outliers across Fifty Language Model Pre-Training Runs
by: van der Wal, Oskar, et al.
Published: (2025) -
Mind the Gap: Analyzing Lacunae with Transformer-Based Transcription
by: Borkar, Jaydeep, et al.
Published: (2024) -
Mechanistic?
by: Saphra, Naomi, et al.
Published: (2024) -
Memorization Dynamics in Knowledge Distillation for Language Models
by: Borkar, Jaydeep, et al.
Published: (2026)