Saved in:
| Main Authors: | Jafari, Mehdi, Xue, Hao, Salim, Flora |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01716 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction
by: Jafari, Mehdi, et al.
Published: (2025)
by: Jafari, Mehdi, et al.
Published: (2025)
Is my model "mind blurting"? Interpreting the dynamics of reasoning tokens with Recurrence Quantification Analysis (RQA)
by: Pham, Quoc Tuan, et al.
Published: (2026)
by: Pham, Quoc Tuan, et al.
Published: (2026)
MAPLE: Mobile App Prediction Leveraging Large Language Model Embeddings
by: Khaokaew, Yonchanok, et al.
Published: (2023)
by: Khaokaew, Yonchanok, et al.
Published: (2023)
SensorLLM: Aligning Large Language Models with Motion Sensors for Human Activity Recognition
by: Li, Zechen, et al.
Published: (2024)
by: Li, Zechen, et al.
Published: (2024)
What am I missing here?: Evaluating Large Language Models for Masked Sentence Prediction
by: Wyatt, Charlie, et al.
Published: (2025)
by: Wyatt, Charlie, et al.
Published: (2025)
Mechanistic Indicators of Understanding in Large Language Models
by: Beckmann, Pierre, et al.
Published: (2025)
by: Beckmann, Pierre, et al.
Published: (2025)
Prompt Mining for Language-based Human Mobility Forecasting
by: Xue, Hao, et al.
Published: (2024)
by: Xue, Hao, et al.
Published: (2024)
RIDE: Enhancing Large Language Model Alignment through Restyled In-Context Learning Demonstration Exemplars
by: Hua, Yuncheng, et al.
Published: (2025)
by: Hua, Yuncheng, et al.
Published: (2025)
Locate, Steer, and Improve: A Practical Survey of Actionable Mechanistic Interpretability in Large Language Models
by: Zhang, Hengyuan, et al.
Published: (2026)
by: Zhang, Hengyuan, et al.
Published: (2026)
ZARA: Training-Free Motion Time-Series Reasoning via Evidence-Grounded LLM Agents
by: Li, Zechen, et al.
Published: (2025)
by: Li, Zechen, et al.
Published: (2025)
RELOOP: Recursive Retrieval with Multi-Hop Reasoner and Planners for Heterogeneous QA
by: Yang, Ruiyi, et al.
Published: (2025)
by: Yang, Ruiyi, et al.
Published: (2025)
Psychological Steering of Large Language Models
by: Blas, Leonardo, et al.
Published: (2026)
by: Blas, Leonardo, et al.
Published: (2026)
The Effectiveness of Style Vectors for Steering Large Language Models: A Human Evaluation
by: Diallo, Diaoulé, et al.
Published: (2026)
by: Diallo, Diaoulé, et al.
Published: (2026)
Compositional Steering of Large Language Models with Steering Tokens
by: Radevski, Gorjan, et al.
Published: (2026)
by: Radevski, Gorjan, et al.
Published: (2026)
Steering Vector Fields for Context-Aware Inference-Time Control in Large Language Models
by: Li, Jiaqian, et al.
Published: (2026)
by: Li, Jiaqian, et al.
Published: (2026)
Steering Without Breaking: Mechanistically Informed Interventions for Discrete Diffusion Language Models
by: Zhou, Hanhan, et al.
Published: (2026)
by: Zhou, Hanhan, et al.
Published: (2026)
Self-Steering Optimization: Autonomous Preference Optimization for Large Language Models
by: Xiang, Hao, et al.
Published: (2024)
by: Xiang, Hao, et al.
Published: (2024)
Harnessing Test-time Adaptation for NLU tasks Involving Dialects of English
by: Nguyen, Duke, et al.
Published: (2025)
by: Nguyen, Duke, et al.
Published: (2025)
Alternatives To Next Token Prediction In Text Generation -- A Survey
by: Wyatt, Charlie, et al.
Published: (2025)
by: Wyatt, Charlie, et al.
Published: (2025)
Steering When Necessary: Flexible Steering Large Language Models with Backtracking
by: Cheng, Zifeng, et al.
Published: (2025)
by: Cheng, Zifeng, et al.
Published: (2025)
Preference Heads in Large Language Models: A Mechanistic Framework for Interpretable Personalization
by: Zhang, Weixu, et al.
Published: (2026)
by: Zhang, Weixu, et al.
Published: (2026)
Decomposing and Steering Functional Metacognition in Large Language Models
by: Li, Yanshi, et al.
Published: (2026)
by: Li, Yanshi, et al.
Published: (2026)
Prototype-Based Dynamic Steering for Large Language Models
by: Kayan, Ceyhun Efe, et al.
Published: (2025)
by: Kayan, Ceyhun Efe, et al.
Published: (2025)
Style Vectors for Steering Generative Large Language Model
by: Konen, Kai, et al.
Published: (2024)
by: Konen, Kai, et al.
Published: (2024)
EchoAtt: Attend, Copy, then Adjust for More Efficient Large Language Models
by: Rajabzadeh, Hossein, et al.
Published: (2024)
by: Rajabzadeh, Hossein, et al.
Published: (2024)
AnyMo: Geometry-Aware Setup-Agnostic Modeling of Human Motion in the Wild
by: Chen, Baiyu, et al.
Published: (2026)
by: Chen, Baiyu, et al.
Published: (2026)
LF-Steering: Latent Feature Activation Steering for Enhancing Semantic Consistency in Large Language Models
by: Yang, Jingyuan, et al.
Published: (2025)
by: Yang, Jingyuan, et al.
Published: (2025)
Why Supervised Fine-Tuning Fails to Learn: A Systematic Study of Incomplete Learning in Large Language Models
by: Xue, Chao, et al.
Published: (2026)
by: Xue, Chao, et al.
Published: (2026)
Language Steering for Multilingual In-Context Learning
by: Kirtane, Neeraja, et al.
Published: (2026)
by: Kirtane, Neeraja, et al.
Published: (2026)
Evaluating and Steering Modality Preferences in Multimodal Large Language Model
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
Controlling Large Language Model Agents with Entropic Activation Steering
by: Rahn, Nate, et al.
Published: (2024)
by: Rahn, Nate, et al.
Published: (2024)
Multilingual Political Views of Large Language Models: Identification and Steering
by: Gurgurov, Daniil, et al.
Published: (2025)
by: Gurgurov, Daniil, et al.
Published: (2025)
Evaluating Large Language Model Biases in Persona-Steered Generation
by: Liu, Andy, et al.
Published: (2024)
by: Liu, Andy, et al.
Published: (2024)
Mechanistic Circuit-Based Knowledge Editing in Large Language Models
by: Zhao, Tianyi, et al.
Published: (2026)
by: Zhao, Tianyi, et al.
Published: (2026)
What Drives Representation Steering? A Mechanistic Case Study on Steering Refusal
by: Cheng, Stephen, et al.
Published: (2026)
by: Cheng, Stephen, et al.
Published: (2026)
Differentially Private Steering for Large Language Model Alignment
by: Goel, Anmol, et al.
Published: (2025)
by: Goel, Anmol, et al.
Published: (2025)
Steering Large Language Models to Evaluate and Amplify Creativity
by: Olson, Matthew Lyle, et al.
Published: (2024)
by: Olson, Matthew Lyle, et al.
Published: (2024)
Prompt-Based Value Steering of Large Language Models
by: Abbo, Giulio Antonio, et al.
Published: (2025)
by: Abbo, Giulio Antonio, et al.
Published: (2025)
Corpus-Steered Query Expansion with Large Language Models
by: Lei, Yibin, et al.
Published: (2024)
by: Lei, Yibin, et al.
Published: (2024)
SOCIA-EVO: Automated Simulator Construction via Dual-Anchored Bi-Level Optimization
by: Hua, Yuncheng, et al.
Published: (2026)
by: Hua, Yuncheng, et al.
Published: (2026)
Similar Items
-
Enhancing Conversational Agents with Theory of Mind: Aligning Beliefs, Desires, and Intentions for Human-Like Interaction
by: Jafari, Mehdi, et al.
Published: (2025) -
Is my model "mind blurting"? Interpreting the dynamics of reasoning tokens with Recurrence Quantification Analysis (RQA)
by: Pham, Quoc Tuan, et al.
Published: (2026) -
MAPLE: Mobile App Prediction Leveraging Large Language Model Embeddings
by: Khaokaew, Yonchanok, et al.
Published: (2023) -
SensorLLM: Aligning Large Language Models with Motion Sensors for Human Activity Recognition
by: Li, Zechen, et al.
Published: (2024) -
What am I missing here?: Evaluating Large Language Models for Masked Sentence Prediction
by: Wyatt, Charlie, et al.
Published: (2025)