:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Li, Yubin, Liu, Xingyu, Chen, Guozhang
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.13011
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Decoding Cortical Microcircuits: A Generative Model for Latent Space Exploration and Controlled Synthesis
by: Liu, Xingyu, et al.
Published: (2025)

Seeing It or Not? Interpretable Vision-aware Latent Steering to Mitigate Object Hallucinations
by: Chen, Boxu, et al.
Published: (2025)

Characterizing Continuous and Discrete Hybrid Latent Spaces for Structural Connectomes
by: Rudravaram, Gaurav, et al.
Published: (2025)

LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation
by: Shu, Huizhen, et al.
Published: (2025)

SLIM: Sparse Latent Steering for Interpretable and Property-Directed LLM-Based Molecular Editing
by: Zhang, Mingxu, et al.
Published: (2026)

Exploring the Personality Traits of LLMs through Latent Features Steering
by: Yang, Shu, et al.
Published: (2024)

Steer LLM Latents for Hallucination Detection
by: Park, Seongheon, et al.
Published: (2025)

ConnectomeBench: Can LLMs Proofread the Connectome?
by: Brown, Jeff, et al.
Published: (2025)

Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation
by: Prokopiou, Ioannis, et al.
Published: (2026)

ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation
by: Hosseini, Hesam, et al.
Published: (2024)

RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering
by: Ye, Wencheng, et al.
Published: (2026)

Activation Scaling for Steering and Interpreting Language Models
by: Stoehr, Niklas, et al.
Published: (2024)

Latent Reward Steering: An Adaptive Inference-Time Framework that Implicitly Promotes Cognitive Behaviors in Reasoning LLMs
by: Li, Jiakang, et al.
Published: (2026)

Memory Inception: Latent-Space KV Cache Manipulation for Steering LLMs
by: Liu, Andy Zeyi, et al.
Published: (2026)

Scalable Random Feature Latent Variable Models
by: Li, Ying, et al.
Published: (2024)

Activation Steering for Bias Mitigation: An Interpretable Approach to Safer LLMs
by: Dubey, Shivam
Published: (2025)

Large Connectome Model: An fMRI Foundation Model of Brain Connectomes Empowered by Brain-Environment Interaction in Multitask Learning Landscape
by: Wei, Ziquan, et al.
Published: (2025)

Feedforward Ordering in Neural Connectomes via Feedback Arc Minimization
by: Vahidi, Soroush
Published: (2025)

Local Causal Structure Learning in the Presence of Latent Variables
by: Xie, Feng, et al.
Published: (2024)

Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute
by: Liu, Sheng, et al.
Published: (2025)

Preemptive Detection and Steering of LLM Misalignment via Latent Reachability
by: Karnik, Sathwik, et al.
Published: (2025)

On Effects of Steering Latent Representation for Large Language Model Unlearning
by: Huu-Tien, Dang, et al.
Published: (2024)

DeepMultiConnectome: Deep Multi-Task Prediction of Structural Connectomes Directly from Diffusion MRI Tractography
by: Vroemen, Marcus J., et al.
Published: (2025)

Improving LLM Reasoning through Interpretable Role-Playing Steering
by: Wang, Anyi, et al.
Published: (2025)

Latent Policy Steering with Embodiment-Agnostic Pretrained World Models
by: Wang, Yiqi, et al.
Published: (2025)

Scaling Up Probabilistic Circuits by Latent Variable Distillation
by: Liu, Anji, et al.
Published: (2022)

Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization
by: Egbuna, Nathan, et al.
Published: (2025)

Interpretable Steering of Large Language Models with Feature Guided Activation Additions
by: Soo, Samuel, et al.
Published: (2025)

Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks
by: Wang, Yuwen, et al.
Published: (2024)

Hypergraph and Latent ODE Learning for Multimodal Root Cause Localization in Microservices
by: Liu, Xin, et al.
Published: (2026)

Fair In-Context Learning via Latent Concept Variables
by: Bhaila, Karuna, et al.
Published: (2024)

FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering
by: Ding, Zikang, et al.
Published: (2026)

Reducing Hallucinations in Vision-Language Models via Latent Space Steering
by: Liu, Sheng, et al.
Published: (2024)

Temporal Latent Variable Structural Causal Model for Causal Discovery under External Interferences
by: Cai, Ruichu, et al.
Published: (2025)

Model Connectomes: A Generational Approach to Data-Efficient Language Models
by: Kotar, Klemen, et al.
Published: (2025)

SketchJudge: A Diagnostic Benchmark for Grading Hand-drawn Diagrams with Multimodal Large Language Models
by: Su, Yuhang, et al.
Published: (2026)

DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation
by: Mao, Xiaowei, et al.
Published: (2024)

Two Heads Are Better than One: Model-Weight and Latent-Space Analysis for Federated Learning on Non-iid Data against Poisoning Attacks
by: Lyu, Xingyu, et al.
Published: (2025)

In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
by: Liu, Sheng, et al.
Published: (2023)

Natural Latents: Latent Variables Stable Across Ontologies
by: Wentworth, John, et al.
Published: (2025)