Saved in:
| Main Authors: | Lu, Christina, Gallagher, Jack, Michala, Jonathan, Fish, Kyle, Lindsey, Jack |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.10387 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Emergent Introspective Awareness in Large Language Models
by: Lindsey, Jack
Published: (2026)
by: Lindsey, Jack
Published: (2026)
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
by: Chen, Runjin, et al.
Published: (2025)
by: Chen, Runjin, et al.
Published: (2025)
Beyond Static Personas: Situational Personality Steering for Large Language Models
by: Wei, Zesheng, et al.
Published: (2026)
by: Wei, Zesheng, et al.
Published: (2026)
Slot Machines: How LLMs Keep Track of Multiple Entities
by: Bogdan, Paul C., et al.
Published: (2026)
by: Bogdan, Paul C., et al.
Published: (2026)
Emotion Concepts and their Function in a Large Language Model
by: Sofroniew, Nicholas, et al.
Published: (2026)
by: Sofroniew, Nicholas, et al.
Published: (2026)
Improving Agent Interactions in Virtual Environments with Language Models
by: Zhang, Jack
Published: (2024)
by: Zhang, Jack
Published: (2024)
Aligning VLM Assistants with Personalized Situated Cognition
by: Li, Yongqi, et al.
Published: (2025)
by: Li, Yongqi, et al.
Published: (2025)
Generics and Default Reasoning in Large Language Models
by: Kirkpatrick, James Ravi, et al.
Published: (2025)
by: Kirkpatrick, James Ravi, et al.
Published: (2025)
On Efficient Language and Vision Assistants for Visually-Situated Natural Language Understanding: What Matters in Reading and Reasoning
by: Kim, Geewook, et al.
Published: (2024)
by: Kim, Geewook, et al.
Published: (2024)
Plant in Cupboard, Orange on Rably, Inat Aphone. Benchmarking Incremental Learning of Situation and Language Model using a Text-Simulated Situated Environment
by: Jordan, Jonathan, et al.
Published: (2025)
by: Jordan, Jonathan, et al.
Published: (2025)
ALMs: Authorial Language Models for Authorship Attribution
by: Huang, Weihang, et al.
Published: (2024)
by: Huang, Weihang, et al.
Published: (2024)
A Statistical Physics of Language Model Reasoning
by: Carson, Jack David, et al.
Published: (2025)
by: Carson, Jack David, et al.
Published: (2025)
UNcommonsense Reasoning: Abductive Reasoning about Uncommon Situations
by: Zhao, Wenting, et al.
Published: (2023)
by: Zhao, Wenting, et al.
Published: (2023)
Mechanistic Decomposition of Sentence Representations
by: Tehenan, Matthieu, et al.
Published: (2025)
by: Tehenan, Matthieu, et al.
Published: (2025)
Leveraging Transformer-Based Models for Predicting Inflection Classes of Words in an Endangered Sami Language
by: Alnajjar, Khalid, et al.
Published: (2024)
by: Alnajjar, Khalid, et al.
Published: (2024)
S0 Tuning: Zero-Overhead Adaptation of Hybrid Recurrent-Attention Models
by: Young, Jack
Published: (2026)
by: Young, Jack
Published: (2026)
Persona Jailbreaking in Large Language Models
by: Sandhan, Jivnesh, et al.
Published: (2026)
by: Sandhan, Jivnesh, et al.
Published: (2026)
Circuit Component Reuse Across Tasks in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2023)
by: Merullo, Jack, et al.
Published: (2023)
When "A Helpful Assistant" Is Not Really Helpful: Personas in System Prompts Do Not Improve Performances of Large Language Models
by: Zheng, Mingqian, et al.
Published: (2023)
by: Zheng, Mingqian, et al.
Published: (2023)
Talking Heads: Understanding Inter-layer Communication in Transformer Language Models
by: Merullo, Jack, et al.
Published: (2024)
by: Merullo, Jack, et al.
Published: (2024)
The Persona Paradox: Medical Personas as Behavioral Priors in Clinical Language Models
by: Abdullahi, Tassallah, et al.
Published: (2026)
by: Abdullahi, Tassallah, et al.
Published: (2026)
Language Models Implement Simple Word2Vec-style Vector Arithmetic
by: Merullo, Jack, et al.
Published: (2023)
by: Merullo, Jack, et al.
Published: (2023)
PersonaLens: A Benchmark for Personalization Evaluation in Conversational AI Assistants
by: Zhao, Zheng, et al.
Published: (2025)
by: Zhao, Zheng, et al.
Published: (2025)
SignBind-LLM: Multi-Stage Modality Fusion for Sign Language Translation
by: Thomas, Marshall, et al.
Published: (2025)
by: Thomas, Marshall, et al.
Published: (2025)
Transferring Linear Features Across Language Models With Model Stitching
by: Chen, Alan, et al.
Published: (2025)
by: Chen, Alan, et al.
Published: (2025)
Situated Natural Language Explanations
by: Zhu, Zining, et al.
Published: (2023)
by: Zhu, Zining, et al.
Published: (2023)
Open Character Training: Shaping the Persona of AI Assistants through Constitutional AI
by: Maiya, Sharan, et al.
Published: (2025)
by: Maiya, Sharan, et al.
Published: (2025)
Can Large Language Models abstract Medical Coded Language?
by: Lee, Simon A., et al.
Published: (2024)
by: Lee, Simon A., et al.
Published: (2024)
Large Language Model based Situational Dialogues for Second Language Learning
by: Xu, Shuyao, et al.
Published: (2024)
by: Xu, Shuyao, et al.
Published: (2024)
Targeted Syntactic Evaluation of Language Models on Georgian Case Alignment
by: Gallagher, Daniel, et al.
Published: (2026)
by: Gallagher, Daniel, et al.
Published: (2026)
Probing Persona-Dependent Preferences in Language Models
by: Gilg, Oscar, et al.
Published: (2026)
by: Gilg, Oscar, et al.
Published: (2026)
Mixture-of-Personas Language Models for Population Simulation
by: Bui, Ngoc, et al.
Published: (2025)
by: Bui, Ngoc, et al.
Published: (2025)
On Linear Representations and Pretraining Data Frequency in Language Models
by: Merullo, Jack, et al.
Published: (2025)
by: Merullo, Jack, et al.
Published: (2025)
Representational Curvature Modulates Behavioral Uncertainty in Large Language Models
by: King, Jack, et al.
Published: (2026)
by: King, Jack, et al.
Published: (2026)
EHRmonize: A Framework for Medical Concept Abstraction from Electronic Health Records using Large Language Models
by: Matos, João, et al.
Published: (2024)
by: Matos, João, et al.
Published: (2024)
Masculine Defaults via Gendered Discourse in Podcasts and Large Language Models
by: Teleki, Maria, et al.
Published: (2025)
by: Teleki, Maria, et al.
Published: (2025)
Improving Language Model Personas via Rationalization with Psychological Scaffolds
by: Joshi, Brihi, et al.
Published: (2025)
by: Joshi, Brihi, et al.
Published: (2025)
Evaluating Large Language Model Biases in Persona-Steered Generation
by: Liu, Andy, et al.
Published: (2024)
by: Liu, Andy, et al.
Published: (2024)
TRACE: Real-Time Multimodal Common Ground Tracking in Situated Collaborative Dialogues
by: VanderHoeven, Hannah, et al.
Published: (2025)
by: VanderHoeven, Hannah, et al.
Published: (2025)
Standard Occupation Classifier -- A Natural Language Processing Approach
by: Rony, Sidharth, et al.
Published: (2025)
by: Rony, Sidharth, et al.
Published: (2025)
Similar Items
-
Emergent Introspective Awareness in Large Language Models
by: Lindsey, Jack
Published: (2026) -
Persona Vectors: Monitoring and Controlling Character Traits in Language Models
by: Chen, Runjin, et al.
Published: (2025) -
Beyond Static Personas: Situational Personality Steering for Large Language Models
by: Wei, Zesheng, et al.
Published: (2026) -
Slot Machines: How LLMs Keep Track of Multiple Entities
by: Bogdan, Paul C., et al.
Published: (2026) -
Emotion Concepts and their Function in a Large Language Model
by: Sofroniew, Nicholas, et al.
Published: (2026)