Saved in:
| Main Authors: | Rahman, Md Mahbubur, Guha, Arjun, Menon, Harshitha |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2603.23629 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
More Than a Score: Probing the Impact of Prompt Specificity on LLM Code Generation
by: Zi, Yangtian, et al.
Published: (2025)
by: Zi, Yangtian, et al.
Published: (2025)
Understanding How CodeLLMs (Mis)Predict Types with Activation Steering
by: Lucchetti, Francesca, et al.
Published: (2024)
by: Lucchetti, Francesca, et al.
Published: (2024)
Emotion Detection From Social Media Posts
by: Rahman, Md Mahbubur, et al.
Published: (2023)
by: Rahman, Md Mahbubur, et al.
Published: (2023)
Substance Beats Style: Why Beginning Students Fail to Code with LLMs
by: Lucchetti, Francesca, et al.
Published: (2024)
by: Lucchetti, Francesca, et al.
Published: (2024)
Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs
by: Hegazy, Amr, et al.
Published: (2025)
by: Hegazy, Amr, et al.
Published: (2025)
Relative Positioning Based Code Chunking Method For Rich Context Retrieval In Repository Level Code Completion Task With Code Language Model
by: Rahman, Imranur, et al.
Published: (2025)
by: Rahman, Imranur, et al.
Published: (2025)
Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs
by: Cassano, Federico, et al.
Published: (2023)
by: Cassano, Federico, et al.
Published: (2023)
Activation Steering with a Feedback Controller
by: Nguyen, Dung V., et al.
Published: (2025)
by: Nguyen, Dung V., et al.
Published: (2025)
Elevating Intrusion Detection and Security Fortification in Intelligent Networks through Cutting-Edge Machine Learning Paradigms
by: Munna, Md Minhazul Islam, et al.
Published: (2025)
by: Munna, Md Minhazul Islam, et al.
Published: (2025)
Steering Language Models With Activation Engineering
by: Turner, Alexander Matt, et al.
Published: (2023)
by: Turner, Alexander Matt, et al.
Published: (2023)
SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models
by: Sivakumar, Anushka, et al.
Published: (2025)
by: Sivakumar, Anushka, et al.
Published: (2025)
Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment
by: Boruch-Gruszecki, Aleksander, et al.
Published: (2025)
by: Boruch-Gruszecki, Aleksander, et al.
Published: (2025)
Steering MoE LLMs via Expert (De)Activation
by: Fayyaz, Mohsen, et al.
Published: (2025)
by: Fayyaz, Mohsen, et al.
Published: (2025)
Depth-Wise Activation Steering for Honest Language Models
by: Góral, Gracjan, et al.
Published: (2025)
by: Góral, Gracjan, et al.
Published: (2025)
Extracting Unlearned Information from LLMs with Activation Steering
by: Seyitoğlu, Atakan, et al.
Published: (2024)
by: Seyitoğlu, Atakan, et al.
Published: (2024)
Automated Neuron Labelling Enables Generative Steering and Interpretability in Protein Language Models
by: Banerjee, Arjun, et al.
Published: (2025)
by: Banerjee, Arjun, et al.
Published: (2025)
Global Evolutionary Steering: Refining Activation Steering Control via Cross-Layer Consistency
by: Jiang, Xinyan, et al.
Published: (2026)
by: Jiang, Xinyan, et al.
Published: (2026)
A Unifying Human-Centered AI Fairness Framework
by: Rahman, Munshi Mahbubur, et al.
Published: (2025)
by: Rahman, Munshi Mahbubur, et al.
Published: (2025)
Enhancing Instruction Following of LLMs via Activation Steering with Dynamic Rejection
by: Kang, Minjae, et al.
Published: (2026)
by: Kang, Minjae, et al.
Published: (2026)
Towards Causal Deep Learning for Vulnerability Detection
by: Rahman, Md Mahbubur, et al.
Published: (2023)
by: Rahman, Md Mahbubur, et al.
Published: (2023)
Steering Large Language Model Activations in Sparse Spaces
by: Bayat, Reza, et al.
Published: (2025)
by: Bayat, Reza, et al.
Published: (2025)
Endogenous Resistance to Activation Steering in Language Models
by: McKenzie, Alex, et al.
Published: (2026)
by: McKenzie, Alex, et al.
Published: (2026)
Angular Steering: Behavior Control via Rotation in Activation Space
by: Vu, Hieu M., et al.
Published: (2025)
by: Vu, Hieu M., et al.
Published: (2025)
Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control
by: Skifstad, Julian, et al.
Published: (2026)
by: Skifstad, Julian, et al.
Published: (2026)
HyperSteer: Activation Steering at Scale with Hypernetworks
by: Sun, Jiuding, et al.
Published: (2025)
by: Sun, Jiuding, et al.
Published: (2025)
Spherical Steering: Geometry-Aware Activation Rotation for Language Models
by: You, Zejia, et al.
Published: (2026)
by: You, Zejia, et al.
Published: (2026)
LeMo-NADe: Multi-Parameter Neural Architecture Discovery with LLMs
by: Rahman, Md Hafizur, et al.
Published: (2024)
by: Rahman, Md Hafizur, et al.
Published: (2024)
Dynamically Scaled Activation Steering
by: Ferrando, Alex, et al.
Published: (2025)
by: Ferrando, Alex, et al.
Published: (2025)
SteerConf: Steering LLMs for Confidence Elicitation
by: Zhou, Ziang, et al.
Published: (2025)
by: Zhou, Ziang, et al.
Published: (2025)
SafeSteer: Interpretable Safety Steering with Refusal-Evasion in LLMs
by: Ghosh, Shaona, et al.
Published: (2025)
by: Ghosh, Shaona, et al.
Published: (2025)
Steering to Say No: Configurable Refusal via Activation Steering in Vision Language Models
by: Yang, Jiaxi, et al.
Published: (2026)
by: Yang, Jiaxi, et al.
Published: (2026)
Steer Like the LLM: Activation Steering that Mimics Prompting
by: Heyman, Geert, et al.
Published: (2026)
by: Heyman, Geert, et al.
Published: (2026)
How a Bit Becomes a Story: Semantic Steering via Differentiable Fault Injection
by: Haider, Zafaryab, et al.
Published: (2025)
by: Haider, Zafaryab, et al.
Published: (2025)
Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
by: Jones, Erik, et al.
Published: (2025)
by: Jones, Erik, et al.
Published: (2025)
Minimizing Collateral Damage in Activation Steering
by: Nguyen, Tam, et al.
Published: (2026)
by: Nguyen, Tam, et al.
Published: (2026)
Steered LLM Activations are Non-Surjective
by: Mishra, Aayush, et al.
Published: (2026)
by: Mishra, Aayush, et al.
Published: (2026)
Activation Steering for Chain-of-Thought Compression
by: Azizi, Seyedarmin, et al.
Published: (2025)
by: Azizi, Seyedarmin, et al.
Published: (2025)
Improving Instruction-Following in Language Models through Activation Steering
by: Stolfo, Alessandro, et al.
Published: (2024)
by: Stolfo, Alessandro, et al.
Published: (2024)
GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time
by: Handa, Divij, et al.
Published: (2025)
by: Handa, Divij, et al.
Published: (2025)
Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention
by: Jin, Zehao, et al.
Published: (2026)
by: Jin, Zehao, et al.
Published: (2026)
Similar Items
-
More Than a Score: Probing the Impact of Prompt Specificity on LLM Code Generation
by: Zi, Yangtian, et al.
Published: (2025) -
Understanding How CodeLLMs (Mis)Predict Types with Activation Steering
by: Lucchetti, Francesca, et al.
Published: (2024) -
Emotion Detection From Social Media Posts
by: Rahman, Md Mahbubur, et al.
Published: (2023) -
Substance Beats Style: Why Beginning Students Fail to Code with LLMs
by: Lucchetti, Francesca, et al.
Published: (2024) -
Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs
by: Hegazy, Amr, et al.
Published: (2025)