:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Yoshida, Davis, Goyal, Kartik, Gimpel, Kevin
Format:	Preprint
Published:	2023
Subjects:	Computation and Language Artificial Intelligence Machine Learning
Online Access:	https://arxiv.org/abs/2311.08817
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

ABBEL: LLM Agents Acting through Belief Bottlenecks Expressed in Language
by: Lidayan, Aly, et al.
Published: (2025)

AI-AI Bias: large language models favor communications generated by large language models
by: Laurito, Walter, et al.
Published: (2024)

The language of time: a language model perspective on time-series foundation models
by: Xie, Yi, et al.
Published: (2025)

Don't throw away your value model! Generating more preferable text with Value-Guided Monte-Carlo Tree Search decoding
by: Liu, Jiacheng, et al.
Published: (2023)

RadLite: Multi-Task LoRA Fine-Tuning of Small Language Models for CPU-Deployable Radiology AI
by: Gupta, Pankaj, et al.
Published: (2026)

COLD-Steer: Steering Large Language Models via In-Context One-step Learning Dynamics
by: Sharma, Kartik, et al.
Published: (2026)

Representation in large language models
by: Yetman, Cameron
Published: (2025)

A comprehensive study of on-device NLP applications -- VQA, automated Form filling, Smart Replies for Linguistic Codeswitching
by: Goyal, Naman
Published: (2024)

Lightweight reranking for language model generations
by: Jain, Siddhartha, et al.
Published: (2023)

Alignment faking in large language models
by: Greenblatt, Ryan, et al.
Published: (2024)

Auditing language models for hidden objectives
by: Marks, Samuel, et al.
Published: (2025)

MAP-Neo: Highly Capable and Transparent Bilingual Large Language Model Series
by: Zhang, Ge, et al.
Published: (2024)

Long-form factuality in large language models
by: Wei, Jerry, et al.
Published: (2024)

Can large language models explore in-context?
by: Krishnamurthy, Akshay, et al.
Published: (2024)

Extracting books from production language models
by: Ahmed, Ahmed, et al.
Published: (2026)

Addressing LLM Diversity by Infusing Random Concepts
by: Agrawal, Pulin, et al.
Published: (2026)

DCRM: A Heuristic to Measure Response Pair Quality in Preference Optimization
by: Huang, Chengyu, et al.
Published: (2025)

Inner Speech as Behavior Guides: Steerable Imitation of Diverse Behaviors for Human-AI coordination
by: Trivedi, Rakshit, et al.
Published: (2026)

FCoReBench: Can Large Language Models Solve Challenging First-Order Combinatorial Reasoning Problems?
by: Mittal, Chinmay, et al.
Published: (2024)

Inducing anxiety in large language models can induce bias
by: Coda-Forno, Julian, et al.
Published: (2023)

An artificial intelligence framework for end-to-end rare disease phenotyping from clinical notes using large language models
by: Shyr, Cathy, et al.
Published: (2026)

Ayn: A Tiny yet Competitive Indian Legal Language Model Pretrained from Scratch
by: Niyogi, Mitodru, et al.
Published: (2024)

EE-Tuning: An Economical yet Scalable Solution for Tuning Early-Exit Large Language Models
by: Pan, Xuchen, et al.
Published: (2024)

Memorization vs. Reasoning: Updating LLMs with New Knowledge
by: Li, Aochong Oliver, et al.
Published: (2025)

Steering Safely or Off a Cliff? Rethinking Specificity and Robustness in Inference-Time Interventions
by: Goyal, Navita, et al.
Published: (2026)

Post-training makes large language models less human-like
by: Binz, Marcel, et al.
Published: (2026)

LitLLMs, LLMs for Literature Review: Are we there yet?
by: Agarwal, Shubham, et al.
Published: (2024)

Layer by Layer: Uncovering Hidden Representations in Language Models
by: Skean, Oscar, et al.
Published: (2025)

Uncovering Competency Gaps in Large Language Models and Their Benchmarks
by: Bohacek, Maty, et al.
Published: (2025)

SOLAR 10.7B: Scaling Large Language Models with Simple yet Effective Depth Up-Scaling
by: Kim, Dahyun, et al.
Published: (2023)

Debate Helps Weak Judges Reward Stronger Models
by: Elasky, Ethan, et al.
Published: (2026)

Fresh in memory: Training-order recency is linearly encoded in language model activations
by: Krasheninnikov, Dmitrii, et al.
Published: (2025)

Safety and accuracy follow different scaling laws in clinical large language models
by: Wind, Sebastian, et al.
Published: (2026)

On the generalization of language models from in-context learning and finetuning: a controlled study
by: Lampinen, Andrew K., et al.
Published: (2025)

A dataset and benchmark for hospital course summarization with adapted large language models
by: Aali, Asad, et al.
Published: (2024)

Efficient semantic uncertainty quantification in language models via diversity-steered sampling
by: Park, Ji Won, et al.
Published: (2025)

Vocabulary shapes cross-lingual variation of word-order learnability in language models
by: Martins, Jonas Mayer, et al.
Published: (2026)

CogBench: a large language model walks into a psychology lab
by: Coda-Forno, Julian, et al.
Published: (2024)

Perturbation: A simple and efficient adversarial tracer for representation learning in language models
by: Rozner, Joshua, et al.
Published: (2026)

Uncovering Customer Issues through Topological Natural Language Analysis
by: Pi, Shu-Ting, et al.
Published: (2024)