Saved in:
| Main Authors: | Li, Michael Y., Fox, Emily B., Goodman, Noah D. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.17879 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
CriticAL: Critic Automation with Language Models
by: Li, Michael Y., et al.
Published: (2024)
by: Li, Michael Y., et al.
Published: (2024)
Is Child-Directed Speech Effective Training Data for Language Models?
by: Feng, Steven Y., et al.
Published: (2024)
by: Feng, Steven Y., et al.
Published: (2024)
Large Language Model Reasoning Failures
by: Song, Peiyang, et al.
Published: (2026)
by: Song, Peiyang, et al.
Published: (2026)
Hypothesis Search: Inductive Reasoning with Language Models
by: Wang, Ruocheng, et al.
Published: (2023)
by: Wang, Ruocheng, et al.
Published: (2023)
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
by: Zelikman, Eric, et al.
Published: (2024)
by: Zelikman, Eric, et al.
Published: (2024)
Language and Experience: A Computational Model of Social Learning in Complex Tasks
by: Colas, Cédric, et al.
Published: (2025)
by: Colas, Cédric, et al.
Published: (2025)
Endless Terminals: Scaling RL Environments for Terminal Agents
by: Gandhi, Kanishk, et al.
Published: (2026)
by: Gandhi, Kanishk, et al.
Published: (2026)
Neural Garbage Collection: Learning to Forget while Learning to Reason
by: Li, Michael Y., et al.
Published: (2026)
by: Li, Michael Y., et al.
Published: (2026)
Automating Governing Knowledge Commons and Contextual Integrity (GKC-CI) Privacy Policy Annotations with Large Language Models
by: Chanenson, Jake, et al.
Published: (2023)
by: Chanenson, Jake, et al.
Published: (2023)
pyvene: A Library for Understanding and Improving PyTorch Models via Interventions
by: Wu, Zhengxuan, et al.
Published: (2024)
by: Wu, Zhengxuan, et al.
Published: (2024)
Cognitive Behaviors that Enable Self-Improving Reasoners, or, Four Habits of Highly Effective STaRs
by: Gandhi, Kanishk, et al.
Published: (2025)
by: Gandhi, Kanishk, et al.
Published: (2025)
Stream of Search (SoS): Learning to Search in Language
by: Gandhi, Kanishk, et al.
Published: (2024)
by: Gandhi, Kanishk, et al.
Published: (2024)
Sample, Don't Search: Rethinking Test-Time Alignment for Language Models
by: Faria, Gonçalo, et al.
Published: (2025)
by: Faria, Gonçalo, et al.
Published: (2025)
BoxingGym: Benchmarking Progress in Automated Experimental Design and Model Discovery
by: Gandhi, Kanishk, et al.
Published: (2025)
by: Gandhi, Kanishk, et al.
Published: (2025)
Auto-Intent: Automated Intent Discovery and Self-Exploration for Large Language Model Web Agents
by: Kim, Jaekyeom, et al.
Published: (2024)
by: Kim, Jaekyeom, et al.
Published: (2024)
The Dual-Stream Transformer: Channelized Architecture for Interpretable Language Modeling
by: Kerce, J. Clayton, et al.
Published: (2026)
by: Kerce, J. Clayton, et al.
Published: (2026)
BALAR : A Bayesian Agentic Loop for Active Reasoning
by: Echarghaoui, Aymen, et al.
Published: (2026)
by: Echarghaoui, Aymen, et al.
Published: (2026)
Large Language Model-driven Meta-structure Discovery in Heterogeneous Information Network
by: Chen, Lin, et al.
Published: (2024)
by: Chen, Lin, et al.
Published: (2024)
Automated Capability Discovery via Foundation Model Self-Exploration
by: Lu, Cong, et al.
Published: (2025)
by: Lu, Cong, et al.
Published: (2025)
What Makes a Maze Look Like a Maze?
by: Hsu, Joy, et al.
Published: (2024)
by: Hsu, Joy, et al.
Published: (2024)
Advancing Event Forecasting through Massive Training of Large Language Models: Challenges, Solutions, and Broader Impacts
by: Lee, Sang-Woo, et al.
Published: (2025)
by: Lee, Sang-Woo, et al.
Published: (2025)
Evaluating Copyright Takedown Methods for Language Models
by: Wei, Boyi, et al.
Published: (2024)
by: Wei, Boyi, et al.
Published: (2024)
Merge to Learn: Efficiently Adding Skills to Language Models with Model Merging
by: Morrison, Jacob, et al.
Published: (2024)
by: Morrison, Jacob, et al.
Published: (2024)
Bayesian scaling laws for in-context learning
by: Arora, Aryaman, et al.
Published: (2024)
by: Arora, Aryaman, et al.
Published: (2024)
SuperBPE: Space Travel for Language Models
by: Liu, Alisa, et al.
Published: (2025)
by: Liu, Alisa, et al.
Published: (2025)
Improve Mathematical Reasoning in Language Models by Automated Process Supervision
by: Luo, Liangchen, et al.
Published: (2024)
by: Luo, Liangchen, et al.
Published: (2024)
Sequences of Logits Reveal the Low Rank Structure of Language Models
by: Golowich, Noah, et al.
Published: (2025)
by: Golowich, Noah, et al.
Published: (2025)
DiscoveryBench: Towards Data-Driven Discovery with Large Language Models
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
by: Majumder, Bodhisattwa Prasad, et al.
Published: (2024)
Automating Evaluation of Diffusion Model Unlearning with (Vision-) Language Model World Knowledge
by: Yeats, Eric, et al.
Published: (2025)
by: Yeats, Eric, et al.
Published: (2025)
In-Context Symbolic Regression: Leveraging Large Language Models for Function Discovery
by: Merler, Matteo, et al.
Published: (2024)
by: Merler, Matteo, et al.
Published: (2024)
Knowledge Boundary Discovery for Large Language Models
by: Wang, Ziquan, et al.
Published: (2026)
by: Wang, Ziquan, et al.
Published: (2026)
CLEVRER-Humans: Describing Physical and Causal Events the Human Way
by: Mao, Jiayuan, et al.
Published: (2023)
by: Mao, Jiayuan, et al.
Published: (2023)
Efficient Post-Training Pruning of Large Language Models with Statistical Correction
by: Yu, Peiqi, et al.
Published: (2026)
by: Yu, Peiqi, et al.
Published: (2026)
ELMES: An Automated Framework for Evaluating Large Language Models in Educational Scenarios
by: Wei, Shou'ang, et al.
Published: (2025)
by: Wei, Shou'ang, et al.
Published: (2025)
Sampling from Your Language Model One Byte at a Time
by: Hayase, Jonathan, et al.
Published: (2025)
by: Hayase, Jonathan, et al.
Published: (2025)
Skill Set Optimization: Reinforcing Language Model Behavior via Transferable Skills
by: Nottingham, Kolby, et al.
Published: (2024)
by: Nottingham, Kolby, et al.
Published: (2024)
Efficient Automated Circuit Discovery in Transformers using Contextual Decomposition
by: Hsu, Aliyah R., et al.
Published: (2024)
by: Hsu, Aliyah R., et al.
Published: (2024)
Towards Statistical Factuality Guarantee for Large Vision-Language Models
by: Li, Zhuohang, et al.
Published: (2025)
by: Li, Zhuohang, et al.
Published: (2025)
'Simulacrum of Stories': Examining Large Language Models as Qualitative Research Participants
by: Kapania, Shivani, et al.
Published: (2024)
by: Kapania, Shivani, et al.
Published: (2024)
Hallucination Detection and Mitigation with Diffusion in Multi-Variate Time-Series Foundation Models
by: Wichitwechkarn, Vijja, et al.
Published: (2025)
by: Wichitwechkarn, Vijja, et al.
Published: (2025)
Similar Items
-
CriticAL: Critic Automation with Language Models
by: Li, Michael Y., et al.
Published: (2024) -
Is Child-Directed Speech Effective Training Data for Language Models?
by: Feng, Steven Y., et al.
Published: (2024) -
Large Language Model Reasoning Failures
by: Song, Peiyang, et al.
Published: (2026) -
Hypothesis Search: Inductive Reasoning with Language Models
by: Wang, Ruocheng, et al.
Published: (2023) -
Quiet-STaR: Language Models Can Teach Themselves to Think Before Speaking
by: Zelikman, Eric, et al.
Published: (2024)