Saved in:
| Main Authors: | Hewitt, John, Chen, Sarah, Xie, Lanruo Lora, Adams, Edward, Liang, Percy, Manning, Christopher D. |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2402.06155 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Instruction Following without Instruction Tuning
by: Hewitt, John, et al.
Published: (2024)
by: Hewitt, John, et al.
Published: (2024)
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
by: Zhong, Zexuan, et al.
Published: (2023)
by: Zhong, Zexuan, et al.
Published: (2023)
Drop Dropout on Single-Epoch Language Model Pretraining
by: Liu, Houjun, et al.
Published: (2025)
by: Liu, Houjun, et al.
Published: (2025)
Improving Parametric Knowledge Access in Reasoning Language Models
by: Ma, Melody, et al.
Published: (2026)
by: Ma, Melody, et al.
Published: (2026)
Osiris: A Lightweight Open-Source Hallucination Detection System
by: Shan, Alex, et al.
Published: (2025)
by: Shan, Alex, et al.
Published: (2025)
Sneaking Syntax into Transformer Language Models with Tree Regularization
by: Nandi, Ananjan, et al.
Published: (2024)
by: Nandi, Ananjan, et al.
Published: (2024)
Humans and transformer LMs: Abstraction drives language learning
by: Jian, Jasper, et al.
Published: (2026)
by: Jian, Jasper, et al.
Published: (2026)
A New Pair of GloVes
by: Carlson, Riley, et al.
Published: (2025)
by: Carlson, Riley, et al.
Published: (2025)
Self-Verified Distillation: Your Language Model Is Secretly Its Own Synthetic Data Pipeline
by: Lee, Tony, et al.
Published: (2026)
by: Lee, Tony, et al.
Published: (2026)
ReCOGS: How Incidental Details of a Logical Form Overshadow an Evaluation of Semantic Interpretation
by: Wu, Zhengxuan, et al.
Published: (2023)
by: Wu, Zhengxuan, et al.
Published: (2023)
BioMedLM: A 2.7B Parameter Language Model Trained On Biomedical Text
by: Bolton, Elliot, et al.
Published: (2024)
by: Bolton, Elliot, et al.
Published: (2024)
Stronger Baselines for Retrieval-Augmented Generation with Long-Context Language Models
by: Laitenberger, Alex, et al.
Published: (2025)
by: Laitenberger, Alex, et al.
Published: (2025)
Trustworthy Social Bias Measurement
by: Bommasani, Rishi, et al.
Published: (2022)
by: Bommasani, Rishi, et al.
Published: (2022)
Language Models Prefer What They Know: Relative Confidence Estimation via Confidence Preferences
by: Shrivastava, Vaishnavi, et al.
Published: (2025)
by: Shrivastava, Vaishnavi, et al.
Published: (2025)
Improved Representation Steering for Language Models
by: Wu, Zhengxuan, et al.
Published: (2025)
by: Wu, Zhengxuan, et al.
Published: (2025)
Subliminal Steering: Stronger Encoding of Hidden Signals
by: Morgulis, George, et al.
Published: (2026)
by: Morgulis, George, et al.
Published: (2026)
Transcribe, Translate, or Transliterate: An Investigation of Intermediate Representations in Spoken Language Models
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
by: Ògúnrèmí, Tolúlopé, et al.
Published: (2025)
Robust Distortion-free Watermarks for Language Models
by: Kuditipudi, Rohith, et al.
Published: (2023)
by: Kuditipudi, Rohith, et al.
Published: (2023)
Replaying pre-training data improves fine-tuning
by: Kotha, Suhas, et al.
Published: (2026)
by: Kotha, Suhas, et al.
Published: (2026)
Semgrex and Ssurgeon, Searching and Manipulating Dependency Graphs
by: Bauer, John, et al.
Published: (2024)
by: Bauer, John, et al.
Published: (2024)
On the Entropy Calibration of Language Models
by: Cao, Steven, et al.
Published: (2025)
by: Cao, Steven, et al.
Published: (2025)
Blackbox Model Provenance via Palimpsestic Membership Inference
by: Kuditipudi, Rohith, et al.
Published: (2025)
by: Kuditipudi, Rohith, et al.
Published: (2025)
LawInstruct: A Resource for Studying Language Model Adaptation to the Legal Domain
by: Niklaus, Joel, et al.
Published: (2024)
by: Niklaus, Joel, et al.
Published: (2024)
Independence Tests for Language Models
by: Zhu, Sally, et al.
Published: (2025)
by: Zhu, Sally, et al.
Published: (2025)
Do "English" Named Entity Recognizers Work Well on Global Englishes?
by: Shan, Alexander, et al.
Published: (2024)
by: Shan, Alexander, et al.
Published: (2024)
NNetNav: Unsupervised Learning of Browser Agents Through Environment Interaction in the Wild
by: Murty, Shikhar, et al.
Published: (2024)
by: Murty, Shikhar, et al.
Published: (2024)
Mechanisms vs. Outcomes: Probing for Syntax Fails to Explain Performance on Targeted Syntactic Evaluations
by: Agarwal, Ananth, et al.
Published: (2025)
by: Agarwal, Ananth, et al.
Published: (2025)
SpecEval: Evaluating Model Adherence to Behavior Specifications
by: Ahmed, Ahmed, et al.
Published: (2025)
by: Ahmed, Ahmed, et al.
Published: (2025)
MrT5: Dynamic Token Merging for Efficient Byte-level Language Models
by: Kallini, Julie, et al.
Published: (2024)
by: Kallini, Julie, et al.
Published: (2024)
Revealing the Deceptiveness of Knowledge Editing: A Mechanistic Analysis of Superficial Editing
by: Xie, Jiakuan, et al.
Published: (2025)
by: Xie, Jiakuan, et al.
Published: (2025)
Eliciting Language Model Behaviors with Investigator Agents
by: Li, Xiang Lisa, et al.
Published: (2025)
by: Li, Xiang Lisa, et al.
Published: (2025)
Lightweight Baselines for Medical Abstract Classification: DistilBERT with Cross-Entropy as a Strong Default
by: Liu, Jiaqi, et al.
Published: (2025)
by: Liu, Jiaqi, et al.
Published: (2025)
We Can't Understand AI Using our Existing Vocabulary
by: Hewitt, John, et al.
Published: (2025)
by: Hewitt, John, et al.
Published: (2025)
Learning to Retrieve In-Context Examples for Large Language Models
by: Wang, Liang, et al.
Published: (2023)
by: Wang, Liang, et al.
Published: (2023)
Consecutive Batch Model Editing with HooK Layers
by: Li, Shuaiyi, et al.
Published: (2024)
by: Li, Shuaiyi, et al.
Published: (2024)
Relative Scaling Laws for LLMs
by: Held, William, et al.
Published: (2025)
by: Held, William, et al.
Published: (2025)
Neologism Learning for Controllability and Self-Verbalization
by: Hewitt, John, et al.
Published: (2025)
by: Hewitt, John, et al.
Published: (2025)
On the Learnability of Watermarks for Language Models
by: Gu, Chenchen, et al.
Published: (2023)
by: Gu, Chenchen, et al.
Published: (2023)
Statistical Uncertainty in Word Embeddings: GloVe-V
by: Vallebueno, Andrea, et al.
Published: (2024)
by: Vallebueno, Andrea, et al.
Published: (2024)
RedPajama: an Open Dataset for Training Large Language Models
by: Weber, Maurice, et al.
Published: (2024)
by: Weber, Maurice, et al.
Published: (2024)
Similar Items
-
Instruction Following without Instruction Tuning
by: Hewitt, John, et al.
Published: (2024) -
MQuAKE: Assessing Knowledge Editing in Language Models via Multi-Hop Questions
by: Zhong, Zexuan, et al.
Published: (2023) -
Drop Dropout on Single-Epoch Language Model Pretraining
by: Liu, Houjun, et al.
Published: (2025) -
Improving Parametric Knowledge Access in Reasoning Language Models
by: Ma, Melody, et al.
Published: (2026) -
Osiris: A Lightweight Open-Source Hallucination Detection System
by: Shan, Alex, et al.
Published: (2025)