Saved in:
| Main Authors: | Li, Yubin, Liu, Xingyu, Chen, Guozhang |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2505.13011 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Decoding Cortical Microcircuits: A Generative Model for Latent Space Exploration and Controlled Synthesis
by: Liu, Xingyu, et al.
Published: (2025)
by: Liu, Xingyu, et al.
Published: (2025)
Seeing It or Not? Interpretable Vision-aware Latent Steering to Mitigate Object Hallucinations
by: Chen, Boxu, et al.
Published: (2025)
by: Chen, Boxu, et al.
Published: (2025)
Characterizing Continuous and Discrete Hybrid Latent Spaces for Structural Connectomes
by: Rudravaram, Gaurav, et al.
Published: (2025)
by: Rudravaram, Gaurav, et al.
Published: (2025)
LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation
by: Shu, Huizhen, et al.
Published: (2025)
by: Shu, Huizhen, et al.
Published: (2025)
SLIM: Sparse Latent Steering for Interpretable and Property-Directed LLM-Based Molecular Editing
by: Zhang, Mingxu, et al.
Published: (2026)
by: Zhang, Mingxu, et al.
Published: (2026)
Exploring the Personality Traits of LLMs through Latent Features Steering
by: Yang, Shu, et al.
Published: (2024)
by: Yang, Shu, et al.
Published: (2024)
Steer LLM Latents for Hallucination Detection
by: Park, Seongheon, et al.
Published: (2025)
by: Park, Seongheon, et al.
Published: (2025)
ConnectomeBench: Can LLMs Proofread the Connectome?
by: Brown, Jeff, et al.
Published: (2025)
by: Brown, Jeff, et al.
Published: (2025)
Latent Space Disentanglement via Activation Steering for Interpretable Attribute Control in Symbolic Music Generation
by: Prokopiou, Ioannis, et al.
Published: (2026)
by: Prokopiou, Ioannis, et al.
Published: (2026)
ULTra: Unveiling Latent Token Interpretability in Transformer-Based Understanding and Segmentation
by: Hosseini, Hesam, et al.
Published: (2024)
by: Hosseini, Hesam, et al.
Published: (2024)
RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering
by: Ye, Wencheng, et al.
Published: (2026)
by: Ye, Wencheng, et al.
Published: (2026)
Activation Scaling for Steering and Interpreting Language Models
by: Stoehr, Niklas, et al.
Published: (2024)
by: Stoehr, Niklas, et al.
Published: (2024)
Latent Reward Steering: An Adaptive Inference-Time Framework that Implicitly Promotes Cognitive Behaviors in Reasoning LLMs
by: Li, Jiakang, et al.
Published: (2026)
by: Li, Jiakang, et al.
Published: (2026)
Memory Inception: Latent-Space KV Cache Manipulation for Steering LLMs
by: Liu, Andy Zeyi, et al.
Published: (2026)
by: Liu, Andy Zeyi, et al.
Published: (2026)
Scalable Random Feature Latent Variable Models
by: Li, Ying, et al.
Published: (2024)
by: Li, Ying, et al.
Published: (2024)
Activation Steering for Bias Mitigation: An Interpretable Approach to Safer LLMs
by: Dubey, Shivam
Published: (2025)
by: Dubey, Shivam
Published: (2025)
Large Connectome Model: An fMRI Foundation Model of Brain Connectomes Empowered by Brain-Environment Interaction in Multitask Learning Landscape
by: Wei, Ziquan, et al.
Published: (2025)
by: Wei, Ziquan, et al.
Published: (2025)
Feedforward Ordering in Neural Connectomes via Feedback Arc Minimization
by: Vahidi, Soroush
Published: (2025)
by: Vahidi, Soroush
Published: (2025)
Local Causal Structure Learning in the Presence of Latent Variables
by: Xie, Feng, et al.
Published: (2024)
by: Xie, Feng, et al.
Published: (2024)
Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute
by: Liu, Sheng, et al.
Published: (2025)
by: Liu, Sheng, et al.
Published: (2025)
Preemptive Detection and Steering of LLM Misalignment via Latent Reachability
by: Karnik, Sathwik, et al.
Published: (2025)
by: Karnik, Sathwik, et al.
Published: (2025)
On Effects of Steering Latent Representation for Large Language Model Unlearning
by: Huu-Tien, Dang, et al.
Published: (2024)
by: Huu-Tien, Dang, et al.
Published: (2024)
DeepMultiConnectome: Deep Multi-Task Prediction of Structural Connectomes Directly from Diffusion MRI Tractography
by: Vroemen, Marcus J., et al.
Published: (2025)
by: Vroemen, Marcus J., et al.
Published: (2025)
Improving LLM Reasoning through Interpretable Role-Playing Steering
by: Wang, Anyi, et al.
Published: (2025)
by: Wang, Anyi, et al.
Published: (2025)
Latent Policy Steering with Embodiment-Agnostic Pretrained World Models
by: Wang, Yiqi, et al.
Published: (2025)
by: Wang, Yiqi, et al.
Published: (2025)
Scaling Up Probabilistic Circuits by Latent Variable Distillation
by: Liu, Anji, et al.
Published: (2022)
by: Liu, Anji, et al.
Published: (2022)
Amortized Latent Steering: Low-Cost Alternative to Test-Time Optimization
by: Egbuna, Nathan, et al.
Published: (2025)
by: Egbuna, Nathan, et al.
Published: (2025)
Interpretable Steering of Large Language Models with Feature Guided Activation Additions
by: Soo, Samuel, et al.
Published: (2025)
by: Soo, Samuel, et al.
Published: (2025)
Unveiling Global Interactive Patterns across Graphs: Towards Interpretable Graph Neural Networks
by: Wang, Yuwen, et al.
Published: (2024)
by: Wang, Yuwen, et al.
Published: (2024)
Hypergraph and Latent ODE Learning for Multimodal Root Cause Localization in Microservices
by: Liu, Xin, et al.
Published: (2026)
by: Liu, Xin, et al.
Published: (2026)
Fair In-Context Learning via Latent Concept Variables
by: Bhaila, Karuna, et al.
Published: (2024)
by: Bhaila, Karuna, et al.
Published: (2024)
FaithSteer-BENCH: A Deployment-Aligned Stress-Testing Benchmark for Inference-Time Steering
by: Ding, Zikang, et al.
Published: (2026)
by: Ding, Zikang, et al.
Published: (2026)
Reducing Hallucinations in Vision-Language Models via Latent Space Steering
by: Liu, Sheng, et al.
Published: (2024)
by: Liu, Sheng, et al.
Published: (2024)
Temporal Latent Variable Structural Causal Model for Causal Discovery under External Interferences
by: Cai, Ruichu, et al.
Published: (2025)
by: Cai, Ruichu, et al.
Published: (2025)
Model Connectomes: A Generational Approach to Data-Efficient Language Models
by: Kotar, Klemen, et al.
Published: (2025)
by: Kotar, Klemen, et al.
Published: (2025)
SketchJudge: A Diagnostic Benchmark for Grading Hand-drawn Diagrams with Multimodal Large Language Models
by: Su, Yuhang, et al.
Published: (2026)
by: Su, Yuhang, et al.
Published: (2026)
DutyTTE: Deciphering Uncertainty in Origin-Destination Travel Time Estimation
by: Mao, Xiaowei, et al.
Published: (2024)
by: Mao, Xiaowei, et al.
Published: (2024)
Two Heads Are Better than One: Model-Weight and Latent-Space Analysis for Federated Learning on Non-iid Data against Poisoning Attacks
by: Lyu, Xingyu, et al.
Published: (2025)
by: Lyu, Xingyu, et al.
Published: (2025)
In-context Vectors: Making In Context Learning More Effective and Controllable Through Latent Space Steering
by: Liu, Sheng, et al.
Published: (2023)
by: Liu, Sheng, et al.
Published: (2023)
Natural Latents: Latent Variables Stable Across Ontologies
by: Wentworth, John, et al.
Published: (2025)
by: Wentworth, John, et al.
Published: (2025)
Similar Items
-
Decoding Cortical Microcircuits: A Generative Model for Latent Space Exploration and Controlled Synthesis
by: Liu, Xingyu, et al.
Published: (2025) -
Seeing It or Not? Interpretable Vision-aware Latent Steering to Mitigate Object Hallucinations
by: Chen, Boxu, et al.
Published: (2025) -
Characterizing Continuous and Discrete Hybrid Latent Spaces for Structural Connectomes
by: Rudravaram, Gaurav, et al.
Published: (2025) -
LatentGuard: Controllable Latent Steering for Robust Refusal of Attacks and Reliable Response Generation
by: Shu, Huizhen, et al.
Published: (2025) -
SLIM: Sparse Latent Steering for Interpretable and Property-Directed LLM-Based Molecular Editing
by: Zhang, Mingxu, et al.
Published: (2026)