Saved in:
Bibliographic Details
Main Authors: Zhu, Dongxuan, Khanh, Ly Tran Ho, Cheung, Andy Yat-Ming, Yue, Man-Chung, Nguyen, Viet Anh
Format: Preprint
Published: 2026
Subjects:
Online Access:https://arxiv.org/abs/2601.22010
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911407286517760
author Zhu, Dongxuan
Khanh, Ly Tran Ho
Cheung, Andy Yat-Ming
Yue, Man-Chung
Nguyen, Viet Anh
author_facet Zhu, Dongxuan
Khanh, Ly Tran Ho
Cheung, Andy Yat-Ming
Yue, Man-Chung
Nguyen, Viet Anh
contents Language models often default to a narrow set of high-probability outputs, leaving their generation paths homogeneous and prone to mode collapse. Sampling-based strategies inject randomness but still struggle to guarantee diversity across multiple concurrent generation runs. We address this limitation by introducing STARS ($\textbf{St}$iefel-based $\textbf{A}$ctivation Steering for Diverse $\textbf{R}$ea$\textbf{S}$oning), a training-free, inference-time intervention method that transforms activation steering into an exploration engine. At each token, STARS collects the hidden activations of concurrent generation runs and optimizes multiple additive steering directions jointly on the Stiefel manifold. STARS maximizes the geometric volume of the steered activations, while the Stiefel manifold induces orthogonality of the steering interventions. This formulation explicitly promotes divergent activation vectors of concurrent generation runs, and implicitly promotes divergent generation trajectories. This manifold optimization formulation can be solved using a Riemannian gradient descent algorithm with convergence guarantees, but this algorithm is too time-consuming for real-time inference. To guarantee low latency, we further design a lightweight one-step update with an aggressive, closed-form stepsize. For test case generation and scientific discovery benchmarks, STARS consistently outperforms standard sampling methods, achieving greater diversity without sacrificing qualitative performance.
format Preprint
id arxiv_https___arxiv_org_abs_2601_22010
institution arXiv
publishDate 2026
record_format arxiv
spellingShingle Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering
Zhu, Dongxuan
Khanh, Ly Tran Ho
Cheung, Andy Yat-Ming
Yue, Man-Chung
Nguyen, Viet Anh
Machine Learning
Language models often default to a narrow set of high-probability outputs, leaving their generation paths homogeneous and prone to mode collapse. Sampling-based strategies inject randomness but still struggle to guarantee diversity across multiple concurrent generation runs. We address this limitation by introducing STARS ($\textbf{St}$iefel-based $\textbf{A}$ctivation Steering for Diverse $\textbf{R}$ea$\textbf{S}$oning), a training-free, inference-time intervention method that transforms activation steering into an exploration engine. At each token, STARS collects the hidden activations of concurrent generation runs and optimizes multiple additive steering directions jointly on the Stiefel manifold. STARS maximizes the geometric volume of the steered activations, while the Stiefel manifold induces orthogonality of the steering interventions. This formulation explicitly promotes divergent activation vectors of concurrent generation runs, and implicitly promotes divergent generation trajectories. This manifold optimization formulation can be solved using a Riemannian gradient descent algorithm with convergence guarantees, but this algorithm is too time-consuming for real-time inference. To guarantee low latency, we further design a lightweight one-step update with an aggressive, closed-form stepsize. For test case generation and scientific discovery benchmarks, STARS consistently outperforms standard sampling methods, achieving greater diversity without sacrificing qualitative performance.
title Exploring Diverse Generation Paths via Inference-time Stiefel Activation Steering
topic Machine Learning
url https://arxiv.org/abs/2601.22010