:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Rahman, Md Mahbubur, Guha, Arjun, Menon, Harshitha
Format:	Preprint
Published:	2026
Subjects:	Machine Learning
Online Access:	https://arxiv.org/abs/2603.23629
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

More Than a Score: Probing the Impact of Prompt Specificity on LLM Code Generation
by: Zi, Yangtian, et al.
Published: (2025)

Understanding How CodeLLMs (Mis)Predict Types with Activation Steering
by: Lucchetti, Francesca, et al.
Published: (2024)

Emotion Detection From Social Media Posts
by: Rahman, Md Mahbubur, et al.
Published: (2023)

Substance Beats Style: Why Beginning Students Fail to Code with LLMs
by: Lucchetti, Francesca, et al.
Published: (2024)

Guiding Giants: Lightweight Controllers for Weighted Activation Steering in LLMs
by: Hegazy, Amr, et al.
Published: (2025)

Relative Positioning Based Code Chunking Method For Rich Context Retrieval In Repository Level Code Completion Task With Code Language Model
by: Rahman, Imranur, et al.
Published: (2025)

Knowledge Transfer from High-Resource to Low-Resource Programming Languages for Code LLMs
by: Cassano, Federico, et al.
Published: (2023)

Activation Steering with a Feedback Controller
by: Nguyen, Dung V., et al.
Published: (2025)

Elevating Intrusion Detection and Security Fortification in Intelligent Networks through Cutting-Edge Machine Learning Paradigms
by: Munna, Md Minhazul Islam, et al.
Published: (2025)

Steering Language Models With Activation Engineering
by: Turner, Alexander Matt, et al.
Published: (2023)

SteerVLM: Robust Model Control through Lightweight Activation Steering for Vision Language Models
by: Sivakumar, Anushka, et al.
Published: (2025)

Agnostics: Learning to Code in Any Programming Language via Reinforcement with a Universal Learning Environment
by: Boruch-Gruszecki, Aleksander, et al.
Published: (2025)

Steering MoE LLMs via Expert (De)Activation
by: Fayyaz, Mohsen, et al.
Published: (2025)

Depth-Wise Activation Steering for Honest Language Models
by: Góral, Gracjan, et al.
Published: (2025)

Extracting Unlearned Information from LLMs with Activation Steering
by: Seyitoğlu, Atakan, et al.
Published: (2024)

Automated Neuron Labelling Enables Generative Steering and Interpretability in Protein Language Models
by: Banerjee, Arjun, et al.
Published: (2025)

Global Evolutionary Steering: Refining Activation Steering Control via Cross-Layer Consistency
by: Jiang, Xinyan, et al.
Published: (2026)

A Unifying Human-Centered AI Fairness Framework
by: Rahman, Munshi Mahbubur, et al.
Published: (2025)

Enhancing Instruction Following of LLMs via Activation Steering with Dynamic Rejection
by: Kang, Minjae, et al.
Published: (2026)

Towards Causal Deep Learning for Vulnerability Detection
by: Rahman, Md Mahbubur, et al.
Published: (2023)

Steering Large Language Model Activations in Sparse Spaces
by: Bayat, Reza, et al.
Published: (2025)

Endogenous Resistance to Activation Steering in Language Models
by: McKenzie, Alex, et al.
Published: (2026)

Angular Steering: Behavior Control via Rotation in Activation Space
by: Vu, Hieu M., et al.
Published: (2025)

Local Linearity of LLMs Enables Activation Steering via Model-Based Linear Optimal Control
by: Skifstad, Julian, et al.
Published: (2026)

HyperSteer: Activation Steering at Scale with Hypernetworks
by: Sun, Jiuding, et al.
Published: (2025)

Spherical Steering: Geometry-Aware Activation Rotation for Language Models
by: You, Zejia, et al.
Published: (2026)

LeMo-NADe: Multi-Parameter Neural Architecture Discovery with LLMs
by: Rahman, Md Hafizur, et al.
Published: (2024)

Dynamically Scaled Activation Steering
by: Ferrando, Alex, et al.
Published: (2025)

SteerConf: Steering LLMs for Confidence Elicitation
by: Zhou, Ziang, et al.
Published: (2025)

SafeSteer: Interpretable Safety Steering with Refusal-Evasion in LLMs
by: Ghosh, Shaona, et al.
Published: (2025)

Steering to Say No: Configurable Refusal via Activation Steering in Vision Language Models
by: Yang, Jiaxi, et al.
Published: (2026)

Steer Like the LLM: Activation Steering that Mimics Prompting
by: Heyman, Geert, et al.
Published: (2026)

How a Bit Becomes a Story: Semantic Steering via Differentiable Fault Injection
by: Haider, Zafaryab, et al.
Published: (2025)

Uncovering Gaps in How Humans and LLMs Interpret Subjective Language
by: Jones, Erik, et al.
Published: (2025)

Minimizing Collateral Damage in Activation Steering
by: Nguyen, Tam, et al.
Published: (2026)

Steered LLM Activations are Non-Surjective
by: Mishra, Aayush, et al.
Published: (2026)

Activation Steering for Chain-of-Thought Compression
by: Azizi, Seyedarmin, et al.
Published: (2025)

Improving Instruction-Following in Language Models through Activation Steering
by: Stolfo, Alessandro, et al.
Published: (2024)

GuidedSampling: Steering LLMs Towards Diverse Candidate Solutions at Inference-Time
by: Handa, Divij, et al.
Published: (2025)

Beyond Steering Vector: Flow-based Activation Steering for Inference-Time Intervention
by: Jin, Zehao, et al.
Published: (2026)