:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Michael Y., Fox, Emily B., Goodman, Noah D.
Format:	Preprint
Published:	2024
Subjects:	Machine Learning Computation and Language
Online Access:	https://arxiv.org/abs/2402.17879
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

CriticAL: Critic Automation with Language Models
by: Li, Michael Y., et al.
Published: (2024)

Is Child-Directed Speech Effective Training Data for Language Models?
by: Feng, Steven Y., et al.
Published: (2024)

Large Language Model Reasoning Failures
by: Song, Peiyang, et al.
Published: (2026)

Hypothesis Search: Inductive Reasoning with Language Models
by: Wang, Ruocheng, et al.
Published: (2023)

Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
by: Zelikman, Eric, et al.
Published: (2024)

Language and Experience: A Computational Model of Social Learning in Complex Tasks
by: Colas, Cédric, et al.
Published: (2025)

Endless Terminals: Scaling RL Environments for Terminal Agents
by: Gandhi, Kanishk, et al.
Published: (2026)

Neural Garbage Collection: Learning to Forget while Learning to Reason
by: Li, Michael Y., et al.
Published: (2026)

Automating Governing Knowledge Commons and Contextual Integrity (GKC-CI) Privacy Policy Annotations with Large Language Models
by: Chanenson, Jake, et al.
Published: (2023)

pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
by: Wu, Zhengxuan, et al.
Published: (2024)

Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
by: Gandhi, Kanishk, et al.
Published: (2025)

Stream of Search (SoS): Learning to Search in Language
by: Gandhi, Kanishk, et al.
Published: (2024)

Sample, Don't Search: Rethinking Test-Time Alignment for Language Models
by: Faria, Gonçalo, et al.
Published: (2025)

BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
by: Gandhi, Kanishk, et al.
Published: (2025)

Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents
by: Kim, Jaekyeom, et al.
Published: (2024)

The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling
by: Kerce, J. Clayton, et al.
Published: (2026)

BALAR : A Bayesian Agentic Loop for Active Reasoning
by: Echarghaoui, Aymen, et al.
Published: (2026)

Large Language Model-driven Meta-structure Discovery in Heterogeneous Information Network
by: Chen, Lin, et al.
Published: (2024)

Automated Capability Discovery via Foundation Model Self-Exploration
by: Lu, Cong, et al.
Published: (2025)

What Makes a Maze Look Like a Maze?
by: Hsu, Joy, et al.
Published: (2024)

Advancing Event Forecasting through Massive Training of Large Language Models: Challenges, Solutions, and Broader Impacts
by: Lee, Sang-Woo, et al.
Published: (2025)

Evaluating Copyright Takedown Methods for Language Models
by: Wei, Boyi, et al.
Published: (2024)

Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
by: Morrison, Jacob, et al.
Published: (2024)

Bayesian scaling laws for in-context learning
by: Arora, Aryaman, et al.
Published: (2024)

SuperBPE: Space Travel for Language Models
by: Liu, Alisa, et al.
Published: (2025)

Improve Mathematical Reasoning in Language Models by Automated Process Supervision
by: Luo, Liangchen, et al.
Published: (2024)

Sequences of Logits Reveal the Low Rank Structure of Language Models
by: Golowich, Noah, et al.
Published: (2025)

DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)

Automating Evaluation of Diffusion Model Unlearning with (Vision-) Language Model World Knowledge
by: Yeats, Eric, et al.
Published: (2025)

In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery
by: Merler, Matteo, et al.
Published: (2024)

Knowledge Boundary Discovery for Large Language Models
by: Wang, Ziquan, et al.
Published: (2026)

CLEVRER-Humans: Describing Physical and Causal Events the Human Way
by: Mao, Jiayuan, et al.
Published: (2023)

Efficient Post-Training Pruning of Large Language Models with Statistical Correction
by: Yu, Peiqi, et al.
Published: (2026)

ELMES: An Automated Framework for Evaluating Large Language Models in Educational Scenarios
by: Wei, Shou'ang, et al.
Published: (2025)

Sampling from Your Language Model One Byte at a Time
by: Hayase, Jonathan, et al.
Published: (2025)

Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
by: Nottingham, Kolby, et al.
Published: (2024)

Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition
by: Hsu, Aliyah R., et al.
Published: (2024)

Towards Statistical Factuality Guarantee for Large Vision-Language Models
by: Li, Zhuohang, et al.
Published: (2025)

'Simulacrum of Stories': Examining Large Language Models as Qualitative Research Participants
by: Kapania, Shivani, et al.
Published: (2024)

Hallucination Detection and Mitigation with Diffusion in Multi-Variate Time-Series Foundation Models
by: Wichitwechkarn, Vijja, et al.
Published: (2025)