:: Library Catalog

Imagen de Portada

Guardado en:

Detalles Bibliográficos
Autores principales:	Li, Jiakang, Zhu, Guanyu, Jin, Can, Huang, Chenxi, Yu, Dexu, Chen, Ronghao, Zhou, Yang, Peng, Hongwu, Lan, Xuanqi, Metaxas, Dimitris N., Li, Youhua
Formato:	Preprint
Publicado:	2026
Materias:	Artificial Intelligence
Acceso en línea:	https://arxiv.org/abs/2606.00726
Etiquetas:	Agregar Etiqueta Sin Etiquetas, Sea el primero en etiquetar este registro!

Ejemplares similares

Test-Time Spectrum-Aware Latent Steering for Zero-Shot Generalization in Vision-Language Models
por: Dafnis, Konstantinos M., et al.
Publicado: (2025)

Weak Critics Make Strong Learners: On-Policy Critique Distillation for Scalable Oversight
por: Jin, Can, et al.
Publicado: (2026)

Two Heads are Better Than One: Test-time Scaling of Multi-agent Collaborative Reasoning
por: Jin, Can, et al.
Publicado: (2025)

Learning from Teaching Regularization: Generalizable Correlations Should be Easy to Imitate
por: Jin, Can, et al.
Publicado: (2024)

GSS: Gated Subspace Steering for Selective Memorization Mitigation in LLMs
por: Zhang, Xuanqi, et al.
Publicado: (2026)

Chain of Mindset: Reasoning with Adaptive Cognitive Modes
por: Jiang, Tianyi, et al.
Publicado: (2026)

On the Role of Language Representations in Auto-Bidding: Findings and Implications
por: Zhu, Guanyu, et al.
Publicado: (2026)

Distribution-Aware Reward Estimation for Test-Time Reinforcement Learning
por: Du, Bodong, et al.
Publicado: (2026)

ATLAS: Adaptive Test-Time Latent Steering with External Verifiers for Enhancing LLMs Reasoning
por: Nguyen, Tuc, et al.
Publicado: (2026)

Beyond Explicit Edges: Robust Reasoning over Noisy and Sparse Knowledge Graphs
por: Gao, Hang, et al.
Publicado: (2026)

Your Reward Function for RL is Your Best PRM for Search: Unifying RL and Search-Based TTS
por: Jin, Can, et al.
Publicado: (2025)

Steering Rectified Flow Models in the Vector Field for Controlled Image Generation
por: Patel, Maitreya, et al.
Publicado: (2024)

Thinking in Latents: Adaptive Anchor Refinement for Implicit Reasoning in LLMs
por: Sheshanarayana, Disha, et al.
Publicado: (2026)

Understanding and Steering the Cognitive Behaviors of Reasoning Models at Test-Time
por: Zhang, Zhenyu, et al.
Publicado: (2025)

Beyond Interpretability: When, Why, and How Sparse Autoencoders Enable Label-Free Visual Steering
por: Chatzoudis, Gerasimos, et al.
Publicado: (2025)

FairSteer: Inference Time Debiasing for LLMs with Dynamic Activation Steering
por: Li, Yichen, et al.
Publicado: (2025)

Fractional Reasoning via Latent Steering Vectors Improves Inference Time Compute
por: Liu, Sheng, et al.
Publicado: (2025)

MPDiT: Multi-Patch Global-to-Local Transformer Architecture For Efficient Flow Matching and Diffusion Model
por: Dao, Quan, et al.
Publicado: (2026)

Implicit In-context Learning
por: Li, Zhuowei, et al.
Publicado: (2024)

Improved Training Technique for Latent Consistency Models
por: Dao, Quan, et al.
Publicado: (2025)

Reasoning over Precedents Alongside Statutes: Case-Augmented Deliberative Alignment for LLM Safety
por: Jin, Can, et al.
Publicado: (2026)

Token-Controlled Re-ranking for Sequential Recommendation via LLMs
por: Dai, Wenxi, et al.
Publicado: (2025)

RankFlow: A Multi-Role Collaborative Reranking Workflow Utilizing Large Language Models
por: Jin, Can, et al.
Publicado: (2025)

Latent Implicit Visual Reasoning
por: Li, Kelvin, et al.
Publicado: (2025)

DTop-p MoE: Sparsity-Controlled Dynamic Top-p MoE for Foundation Model Pre-training
por: Jin, Can, et al.
Publicado: (2025)

APEER: Automatic Prompt Engineering Enhances Large Language Model Reranking
por: Jin, Can, et al.
Publicado: (2024)

RISER: Orchestrating Latent Reasoning Skills for Adaptive Activation Steering
por: Ye, Wencheng, et al.
Publicado: (2026)

CogDual: Enhancing Dual Cognition of LLMs via Reinforcement Learning with Implicit Rule-Based Rewards
por: Liu, Cheng, et al.
Publicado: (2025)

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs
por: Wen, Xumeng, et al.
Publicado: (2025)

Mitigating Cognitive Inertia in Large Reasoning Models via Latent Spike Steering
por: Lee, Seojin, et al.
Publicado: (2026)

Stabilizing Knowledge, Promoting Reasoning: Dual-Token Constraints for RLVR
por: Wang, Jiakang, et al.
Publicado: (2025)

Improving Visual Reasoning with Iterative Evidence Refinement
por: Shi, Zeru, et al.
Publicado: (2026)

Score-Guided Diffusion for 3D Human Recovery
por: Stathopoulos, Anastasis, et al.
Publicado: (2024)

Adaptive Predefined‐Time Stabilization for A Class of Nonlinear Time‐Delay Systems With Input Unmodeled Dynamics
por: Qiang Li, et al.
Publicado: (2025)

Learning Persistent Community Structures in Dynamic Networks via Topological Data Analysis
por: Kong, Dexu, et al.
Publicado: (2024)

How to Trace Latent Generative Model Generated Images without Artificial Watermark?
por: Wang, Zhenting, et al.
Publicado: (2024)

HabitAction: A Video Dataset for Human Habitual Behavior Recognition
por: Li, Hongwu, et al.
Publicado: (2024)

DARE: Difficulty-Adaptive Reinforcement Learning with Co-Evolved Difficulty Estimation
por: Zhou, Yang, et al.
Publicado: (2026)

iCLP: Large Language Model Reasoning with Implicit Cognition Latent Planning
por: Chen, Sijia, et al.
Publicado: (2025)

Adaptive Fuzzy‐Based Event‐Triggered Consensus of Switched Nonlinear Multiagent Systems With Communication Faults and State‐Dependent Switchings
por: Ronghao Zhang, et al.
Publicado: (2025)