Guardado en:
| Autores principales: | Li, Jiakang, Zhu, Guanyu, Jin, Can, Huang, Chenxi, Yu, Dexu, Chen, Ronghao, Zhou, Yang, Peng, Hongwu, Lan, Xuanqi, Metaxas, Dimitris N., Li, Youhua |
|---|---|
| Formato: | Preprint |
| Publicado: |
2026
|
| Materias: | |
| Acceso en línea: | https://arxiv.org/abs/2606.00726 |
| Etiquetas: |
Agregar Etiqueta
Sin Etiquetas, Sea el primero en etiquetar este registro!
|
Ejemplares similares
Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models
por: Dafnis, Konstantinos M., et al.
Publicado: (2025)
por: Dafnis, Konstantinos M., et al.
Publicado: (2025)
Weak Critics Make Strong Learners: On-Policy Critique Distillation for Scalable Oversight
por: Jin, Can, et al.
Publicado: (2026)
por: Jin, Can, et al.
Publicado: (2026)
Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning
por: Jin, Can, et al.
Publicado: (2025)
por: Jin, Can, et al.
Publicado: (2025)
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
por: Jin, Can, et al.
Publicado: (2024)
por: Jin, Can, et al.
Publicado: (2024)
GSS: Gated Subspace Steering for Selective Memorization Mitigation in LLMs
por: Zhang, Xuanqi, et al.
Publicado: (2026)
por: Zhang, Xuanqi, et al.
Publicado: (2026)
Chain of Mindset: Reasoning with Adaptive Cognitive Modes
por: Jiang, Tianyi, et al.
Publicado: (2026)
por: Jiang, Tianyi, et al.
Publicado: (2026)
On the Role of Language Representations in Auto-Bidding: Findings and Implications
por: Zhu, Guanyu, et al.
Publicado: (2026)
por: Zhu, Guanyu, et al.
Publicado: (2026)
Distribution-Aware Reward Estimation for Test-Time Reinforcement Learning
por: Du, Bodong, et al.
Publicado: (2026)
por: Du, Bodong, et al.
Publicado: (2026)
ATLAS: Adaptive Test-Time Latent Steering with External Verifiers for Enhancing LLMs Reasoning
por: Nguyen, Tuc, et al.
Publicado: (2026)
por: Nguyen, Tuc, et al.
Publicado: (2026)
Beyond Explicit Edges: Robust Reasoning over Noisy and Sparse Knowledge Graphs
por: Gao, Hang, et al.
Publicado: (2026)
por: Gao, Hang, et al.
Publicado: (2026)
Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS
por: Jin, Can, et al.
Publicado: (2025)
por: Jin, Can, et al.
Publicado: (2025)
Steering Rectified Flow Models in the Vector Field for Controlled Image Generation
por: Patel, Maitreya, et al.
Publicado: (2024)
por: Patel, Maitreya, et al.
Publicado: (2024)
Thinking in Latents: Adaptive Anchor Refinement for Implicit Reasoning in LLMs
por: Sheshanarayana, Disha, et al.
Publicado: (2026)
por: Sheshanarayana, Disha, et al.
Publicado: (2026)
Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time
por: Zhang, Zhenyu, et al.
Publicado: (2025)
por: Zhang, Zhenyu, et al.
Publicado: (2025)
Beyond Interpretability: When, Why, and How Sparse Autoencoders Enable Label-Free Visual Steering
por: Chatzoudis, Gerasimos, et al.
Publicado: (2025)
por: Chatzoudis, Gerasimos, et al.
Publicado: (2025)
FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering
por: Li, Yichen, et al.
Publicado: (2025)
por: Li, Yichen, et al.
Publicado: (2025)
Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute
por: Liu, Sheng, et al.
Publicado: (2025)
por: Liu, Sheng, et al.
Publicado: (2025)
MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
por: Dao, Quan, et al.
Publicado: (2026)
por: Dao, Quan, et al.
Publicado: (2026)
Implicit In-context Learning
por: Li, Zhuowei, et al.
Publicado: (2024)
por: Li, Zhuowei, et al.
Publicado: (2024)
Improved Training Technique for Latent Consistency Models
por: Dao, Quan, et al.
Publicado: (2025)
por: Dao, Quan, et al.
Publicado: (2025)
Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety
por: Jin, Can, et al.
Publicado: (2026)
por: Jin, Can, et al.
Publicado: (2026)
Token-Controlled Re-ranking for Sequential Recommendation via LLMs
por: Dai, Wenxi, et al.
Publicado: (2025)
por: Dai, Wenxi, et al.
Publicado: (2025)
RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
por: Jin, Can, et al.
Publicado: (2025)
por: Jin, Can, et al.
Publicado: (2025)
Latent Implicit Visual Reasoning
por: Li, Kelvin, et al.
Publicado: (2025)
por: Li, Kelvin, et al.
Publicado: (2025)
DTop-p MoE: Sparsity-Controlled Dynamic Top-p MoE for Foundation Model Pre-training
por: Jin, Can, et al.
Publicado: (2025)
por: Jin, Can, et al.
Publicado: (2025)
APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking
por: Jin, Can, et al.
Publicado: (2024)
por: Jin, Can, et al.
Publicado: (2024)
RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering
por: Ye, Wencheng, et al.
Publicado: (2026)
por: Ye, Wencheng, et al.
Publicado: (2026)
CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards
por: Liu, Cheng, et al.
Publicado: (2025)
por: Liu, Cheng, et al.
Publicado: (2025)
Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs
por: Wen, Xumeng, et al.
Publicado: (2025)
por: Wen, Xumeng, et al.
Publicado: (2025)
Mitigating Cognitive Inertia in Large Reasoning Models via Latent Spike Steering
por: Lee, Seojin, et al.
Publicado: (2026)
por: Lee, Seojin, et al.
Publicado: (2026)
Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR
por: Wang, Jiakang, et al.
Publicado: (2025)
por: Wang, Jiakang, et al.
Publicado: (2025)
Improving Visual Reasoning with Iterative Evidence Refinement
por: Shi, Zeru, et al.
Publicado: (2026)
por: Shi, Zeru, et al.
Publicado: (2026)
Score-Guided Diffusion for 3D Human Recovery
por: Stathopoulos, Anastasis, et al.
Publicado: (2024)
por: Stathopoulos, Anastasis, et al.
Publicado: (2024)
Adaptive Predefined‐Time Stabilization for A Class of Nonlinear Time‐Delay Systems With Input Unmodeled Dynamics
por: Qiang Li, et al.
Publicado: (2025)
por: Qiang Li, et al.
Publicado: (2025)
Learning Persistent Community Structures in Dynamic Networks via Topological Data Analysis
por: Kong, Dexu, et al.
Publicado: (2024)
por: Kong, Dexu, et al.
Publicado: (2024)
How to Trace Latent Generative Model Generated Images without Artificial Watermark?
por: Wang, Zhenting, et al.
Publicado: (2024)
por: Wang, Zhenting, et al.
Publicado: (2024)
HabitAction: A Video Dataset for Human Habitual Behavior Recognition
por: Li, Hongwu, et al.
Publicado: (2024)
por: Li, Hongwu, et al.
Publicado: (2024)
DARE: Difficulty-Adaptive Reinforcement Learning with Co-Evolved Difficulty Estimation
por: Zhou, Yang, et al.
Publicado: (2026)
por: Zhou, Yang, et al.
Publicado: (2026)
iCLP: Large Language Model Reasoning with Implicit Cognition Latent Planning
por: Chen, Sijia, et al.
Publicado: (2025)
por: Chen, Sijia, et al.
Publicado: (2025)
Adaptive Fuzzy‐Based Event‐Triggered Consensus of Switched Nonlinear Multiagent Systems With Communication Faults and State‐Dependent Switchings
por: Ronghao Zhang, et al.
Publicado: (2025)
por: Ronghao Zhang, et al.
Publicado: (2025)
Ejemplares similares
-
Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models
por: Dafnis, Konstantinos M., et al.
Publicado: (2025) -
Weak Critics Make Strong Learners: On-Policy Critique Distillation for Scalable Oversight
por: Jin, Can, et al.
Publicado: (2026) -
Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning
por: Jin, Can, et al.
Publicado: (2025) -
Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
por: Jin, Can, et al.
Publicado: (2024) -
GSS: Gated Subspace Steering for Selective Memorization Mitigation in LLMs
por: Zhang, Xuanqi, et al.
Publicado: (2026)