Saved in:
| Main Authors: | Kang, Xinyue, Shi, Diwei, Chen, Li |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.04748 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Style Vectors for Steering Generative Large Language Model
by: Konen, Kai, et al.
Published: (2024)
by: Konen, Kai, et al.
Published: (2024)
Steering Vector Fields for Context-Aware Inference-Time Control in Large Language Models
by: Li, Jiaqian, et al.
Published: (2026)
by: Li, Jiaqian, et al.
Published: (2026)
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
by: Cao, Yuanpu, et al.
Published: (2024)
by: Cao, Yuanpu, et al.
Published: (2024)
Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
by: Xu, Boyan, et al.
Published: (2024)
by: Xu, Boyan, et al.
Published: (2024)
The Impact of Steering Large Language Models with Persona Vectors in Educational Applications
by: Wu, Yongchao, et al.
Published: (2026)
by: Wu, Yongchao, et al.
Published: (2026)
Whispers that Shake Foundations: Analyzing and Mitigating False Premise Hallucinations in Large Language Models
by: Yuan, Hongbang, et al.
Published: (2024)
by: Yuan, Hongbang, et al.
Published: (2024)
Youtu-LLM: Unlocking the Native Agentic Potential for Lightweight Large Language Models
by: Lu, Junru, et al.
Published: (2025)
by: Lu, Junru, et al.
Published: (2025)
BILLY: Steering Large Language Models via Merging Persona Vectors for Creative Generation
by: Pai, Tsung-Min, et al.
Published: (2025)
by: Pai, Tsung-Min, et al.
Published: (2025)
The Effectiveness of Style Vectors for Steering Large Language Models: A Human Evaluation
by: Diallo, Diaoulé, et al.
Published: (2026)
by: Diallo, Diaoulé, et al.
Published: (2026)
Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation
by: Zhang, Yeqin, et al.
Published: (2025)
by: Zhang, Yeqin, et al.
Published: (2025)
Unlocking the Potential of Large Language Models in the Nuclear Industry with Synthetic Data
by: Anwar, Muhammad, et al.
Published: (2025)
by: Anwar, Muhammad, et al.
Published: (2025)
Psychological Steering of Large Language Models
by: Blas, Leonardo, et al.
Published: (2026)
by: Blas, Leonardo, et al.
Published: (2026)
Prototype-Based Dynamic Steering for Large Language Models
by: Kayan, Ceyhun Efe, et al.
Published: (2025)
by: Kayan, Ceyhun Efe, et al.
Published: (2025)
RepIt: Steering Language Models with Concept-Specific Refusal Vectors
by: Siu, Vincent, et al.
Published: (2025)
by: Siu, Vincent, et al.
Published: (2025)
Unlocking the Power of Large Language Models for Entity Alignment
by: Jiang, Xuhui, et al.
Published: (2024)
by: Jiang, Xuhui, et al.
Published: (2024)
Textual Steering Vectors Can Improve Visual Understanding in Multimodal Large Language Models
by: Gan, Woody Haosheng, et al.
Published: (2025)
by: Gan, Woody Haosheng, et al.
Published: (2025)
Decomposing and Steering Functional Metacognition in Large Language Models
by: Li, Yanshi, et al.
Published: (2026)
by: Li, Yanshi, et al.
Published: (2026)
Unlocking the Potential of Model Merging for Low-Resource Languages
by: Tao, Mingxu, et al.
Published: (2024)
by: Tao, Mingxu, et al.
Published: (2024)
Unlocking the Future: Exploring Look-Ahead Planning Mechanistic Interpretability in Large Language Models
by: Men, Tianyi, et al.
Published: (2024)
by: Men, Tianyi, et al.
Published: (2024)
Denoising Concept Vectors with Sparse Autoencoders for Improved Language Model Steering
by: Zhao, Haiyan, et al.
Published: (2025)
by: Zhao, Haiyan, et al.
Published: (2025)
Compositional Steering of Large Language Models with Steering Tokens
by: Radevski, Gorjan, et al.
Published: (2026)
by: Radevski, Gorjan, et al.
Published: (2026)
Unlocking the Potentials of Retrieval-Augmented Generation for Diffusion Language Models
by: Yu, Chuanyue, et al.
Published: (2026)
by: Yu, Chuanyue, et al.
Published: (2026)
Adaptive Task Vectors for Large Language Models
by: Kang, Joonseong, et al.
Published: (2025)
by: Kang, Joonseong, et al.
Published: (2025)
LF-Steering: Latent Feature Activation Steering for Enhancing Semantic Consistency in Large Language Models
by: Yang, Jingyuan, et al.
Published: (2025)
by: Yang, Jingyuan, et al.
Published: (2025)
Sorted LLaMA: Unlocking the Potential of Intermediate Layers of Large Language Models for Dynamic Inference
by: Kavehzadeh, Parsa, et al.
Published: (2023)
by: Kavehzadeh, Parsa, et al.
Published: (2023)
Evaluating and Steering Modality Preferences in Multimodal Large Language Model
by: Zhang, Yu, et al.
Published: (2025)
by: Zhang, Yu, et al.
Published: (2025)
Skills-in-Context Prompting: Unlocking Compositionality in Large Language Models
by: Chen, Jiaao, et al.
Published: (2023)
by: Chen, Jiaao, et al.
Published: (2023)
Steering When Necessary: Flexible Steering Large Language Models with Backtracking
by: Cheng, Zifeng, et al.
Published: (2025)
by: Cheng, Zifeng, et al.
Published: (2025)
Mechanistic Indicators of Steering Effectiveness in Large Language Models
by: Jafari, Mehdi, et al.
Published: (2026)
by: Jafari, Mehdi, et al.
Published: (2026)
CoSToM:Causal-oriented Steering for Intrinsic Theory-of-Mind Alignment in Large Language Models
by: Li, Mengfan, et al.
Published: (2026)
by: Li, Mengfan, et al.
Published: (2026)
Steering Large Language Models between Code Execution and Textual Reasoning
by: Chen, Yongchao, et al.
Published: (2024)
by: Chen, Yongchao, et al.
Published: (2024)
Safety Arithmetic: A Framework for Test-time Safety Alignment of Language Models by Steering Parameters and Activations
by: Hazra, Rima, et al.
Published: (2024)
by: Hazra, Rima, et al.
Published: (2024)
Beyond Static Personas: Situational Personality Steering for Large Language Models
by: Wei, Zesheng, et al.
Published: (2026)
by: Wei, Zesheng, et al.
Published: (2026)
Towards Inference-time Category-wise Safety Steering for Large Language Models
by: Bhattacharjee, Amrita, et al.
Published: (2024)
by: Bhattacharjee, Amrita, et al.
Published: (2024)
Function Vectors in Large Language Models
by: Todd, Eric, et al.
Published: (2023)
by: Todd, Eric, et al.
Published: (2023)
TransportationGames: Benchmarking Transportation Knowledge of (Multimodal) Large Language Models
by: Zhang, Xue, et al.
Published: (2024)
by: Zhang, Xue, et al.
Published: (2024)
Merlin's Whisper: Enabling Efficient Reasoning in Large Language Models via Black-box Persuasive Prompting
by: Xia, Heming, et al.
Published: (2025)
by: Xia, Heming, et al.
Published: (2025)
RIRO: Reshaping Inputs, Refining Outputs Unlocking the Potential of Large Language Models in Data-Scarce Contexts
by: Hamdi, Ali, et al.
Published: (2024)
by: Hamdi, Ali, et al.
Published: (2024)
CogSteer: Cognition-Inspired Selective Layer Intervention for Efficiently Steering Large Language Models
by: Wang, Xintong, et al.
Published: (2024)
by: Wang, Xintong, et al.
Published: (2024)
Spontaneous Speech-Based Suicide Risk Detection Using Whisper and Large Language Models
by: Cui, Ziyun, et al.
Published: (2024)
by: Cui, Ziyun, et al.
Published: (2024)
Similar Items
-
Style Vectors for Steering Generative Large Language Model
by: Konen, Kai, et al.
Published: (2024) -
Steering Vector Fields for Context-Aware Inference-Time Control in Large Language Models
by: Li, Jiaqian, et al.
Published: (2026) -
Personalized Steering of Large Language Models: Versatile Steering Vectors Through Bi-directional Preference Optimization
by: Cao, Yuanpu, et al.
Published: (2024) -
Unlocking the Potential: Benchmarking Large Language Models in Water Engineering and Research
by: Xu, Boyan, et al.
Published: (2024) -
The Impact of Steering Large Language Models with Persona Vectors in Educational Applications
by: Wu, Yongchao, et al.
Published: (2026)