Saved in:
| Main Authors: | Lin, Dongding, Wang, Jian, Li, Yongqi, Li, Wenjie |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2604.20749 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
by: Wang, Jian, et al.
Published: (2024)
by: Wang, Jian, et al.
Published: (2024)
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
by: Wang, Jian, et al.
Published: (2024)
by: Wang, Jian, et al.
Published: (2024)
R$^2$ec: Towards Large Recommender Models with Reasoning
by: You, Runyang, et al.
Published: (2025)
by: You, Runyang, et al.
Published: (2025)
Scaling over Scaling: Exploring Test-Time Scaling Plateau in Large Reasoning Models
by: Wang, Jian, et al.
Published: (2025)
by: Wang, Jian, et al.
Published: (2025)
Parallel Test-Time Scaling for Latent Reasoning Models
by: You, Runyang, et al.
Published: (2025)
by: You, Runyang, et al.
Published: (2025)
Aligning Deep Implicit Preferences by Learning to Reason Defensively
by: Li, Peiming, et al.
Published: (2025)
by: Li, Peiming, et al.
Published: (2025)
Omni-R1: Towards the Unified Generative Paradigm for Multimodal Reasoning
by: Cheng, Dongjie, et al.
Published: (2026)
by: Cheng, Dongjie, et al.
Published: (2026)
TInR: Exploring Tool-Internalized Reasoning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2026)
by: Xu, Qiancheng, et al.
Published: (2026)
TokenSkip: Controllable Chain-of-Thought Compression in LLMs
by: Xia, Heming, et al.
Published: (2025)
by: Xia, Heming, et al.
Published: (2025)
DynamicPO: Dynamic Preference Optimization for Recommendation
by: Hu, Xingyu, et al.
Published: (2026)
by: Hu, Xingyu, et al.
Published: (2026)
ImplicitRM: Unbiased Reward Modeling from Implicit Preference Data for LLM alignment
by: Wang, Hao, et al.
Published: (2026)
by: Wang, Hao, et al.
Published: (2026)
Enhancing Tool Retrieval with Iterative Feedback from Large Language Models
by: Xu, Qiancheng, et al.
Published: (2024)
by: Xu, Qiancheng, et al.
Published: (2024)
Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
by: Chen, Chao, et al.
Published: (2025)
by: Chen, Chao, et al.
Published: (2025)
Implicit Safety Alignment from Crowd Preferences
by: Lin, Qian, et al.
Published: (2026)
by: Lin, Qian, et al.
Published: (2026)
Reasoning over User Preferences: Knowledge Graph-Augmented LLMs for Explainable Conversational Recommendations
by: Qiu, Zhangchi, et al.
Published: (2024)
by: Qiu, Zhangchi, et al.
Published: (2024)
ImpReSS: Implicit Recommender System for Support Conversations
by: Haller, Omri, et al.
Published: (2025)
by: Haller, Omri, et al.
Published: (2025)
Conditional Quantile Estimation for Uncertain Watch Time in Short-Video Recommendation
by: Lin, Chengzhi, et al.
Published: (2024)
by: Lin, Chengzhi, et al.
Published: (2024)
One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment
by: Cai, Hongru, et al.
Published: (2026)
by: Cai, Hongru, et al.
Published: (2026)
Distillation Enhanced Generative Retrieval
by: Li, Yongqi, et al.
Published: (2024)
by: Li, Yongqi, et al.
Published: (2024)
Reinforced Latent Reasoning for LLM-based Recommendation
by: Zhang, Yang, et al.
Published: (2025)
by: Zhang, Yang, et al.
Published: (2025)
Aligning VLM Assistants with Personalized Situated Cognition
by: Li, Yongqi, et al.
Published: (2025)
by: Li, Yongqi, et al.
Published: (2025)
Controlling Multimodal Conversational Agents with Coverage-Enhanced Latent Actions
by: Li, Yongqi, et al.
Published: (2026)
by: Li, Yongqi, et al.
Published: (2026)
Interleaved-Modal Chain-of-Thought
by: Gao, Jun, et al.
Published: (2024)
by: Gao, Jun, et al.
Published: (2024)
Personalized Large Language Model Assistant with Evolving Conditional Memory
by: Yuan, Ruifeng, et al.
Published: (2023)
by: Yuan, Ruifeng, et al.
Published: (2023)
Large Language Models Empowered Personalized Web Agents
by: Cai, Hongru, et al.
Published: (2024)
by: Cai, Hongru, et al.
Published: (2024)
RecNet: Self-Evolving Preference Propagation for Agentic Recommender Systems
by: Li, Bingqian, et al.
Published: (2026)
by: Li, Bingqian, et al.
Published: (2026)
Safe Semantics, Unsafe Interpretations: Tackling Implicit Reasoning Safety in Large Vision-Language Models
by: Cai, Wei, et al.
Published: (2025)
by: Cai, Wei, et al.
Published: (2025)
AR-Omni: A Unified Autoregressive Model for Any-to-Any Generation
by: Cheng, Dongjie, et al.
Published: (2026)
by: Cheng, Dongjie, et al.
Published: (2026)
PEToolLLM: Towards Personalized Tool Learning in Large Language Models
by: Xu, Qiancheng, et al.
Published: (2025)
by: Xu, Qiancheng, et al.
Published: (2025)
Subtle Errors in Reasoning: Preference Learning via Error-injected Self-editing
by: Xu, Kaishuai, et al.
Published: (2024)
by: Xu, Kaishuai, et al.
Published: (2024)
Auto-Rubric as Reward: From Implicit Preferences to Explicit Multimodal Generative Criteria
by: Tian, Juanxi, et al.
Published: (2026)
by: Tian, Juanxi, et al.
Published: (2026)
Extended Inductive Reasoning for Personalized Preference Inference from Behavioral Signals
by: Li, Jia-Nan, et al.
Published: (2025)
by: Li, Jia-Nan, et al.
Published: (2025)
Validating Generalist Robots with Situation Calculus and STL Falsification
by: Li, Changwen, et al.
Published: (2026)
by: Li, Changwen, et al.
Published: (2026)
Generative Cross-Modal Retrieval: Memorizing Images in Multimodal Language Models for Retrieval and Beyond
by: Li, Yongqi, et al.
Published: (2024)
by: Li, Yongqi, et al.
Published: (2024)
Generating Usage-related Questions for Preference Elicitation in Conversational Recommender Systems
by: Kostric, Ivica, et al.
Published: (2021)
by: Kostric, Ivica, et al.
Published: (2021)
Implicit Neural Differential Model for Spatiotemporal Dynamics
by: Akhare, Deepak, et al.
Published: (2025)
by: Akhare, Deepak, et al.
Published: (2025)
Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts
by: Cao, Xianwei, et al.
Published: (2026)
by: Cao, Xianwei, et al.
Published: (2026)
Reason-to-Recommend: Using Interaction-of-Thought Reasoning to Enhance LLM Recommendation
by: Zhao, Keyu, et al.
Published: (2025)
by: Zhao, Keyu, et al.
Published: (2025)
Finding RELIEF: Shaping Reasoning Behavior without Reasoning Supervision via Belief Engineering
by: Leong, Chak Tou, et al.
Published: (2026)
by: Leong, Chak Tou, et al.
Published: (2026)
Decision-aware User Simulation Agent for Evaluating Conversational Recommender Systems
by: Li, Yuan-Chi, et al.
Published: (2026)
by: Li, Yuan-Chi, et al.
Published: (2026)
Similar Items
-
Target-constrained Bidirectional Planning for Generation of Target-oriented Proactive Dialogue
by: Wang, Jian, et al.
Published: (2024) -
Instruct Once, Chat Consistently in Multiple Rounds: An Efficient Tuning Framework for Dialogue
by: Wang, Jian, et al.
Published: (2024) -
R$^2$ec: Towards Large Recommender Models with Reasoning
by: You, Runyang, et al.
Published: (2025) -
Scaling over Scaling: Exploring Test-Time Scaling Plateau in Large Reasoning Models
by: Wang, Jian, et al.
Published: (2025) -
Parallel Test-Time Scaling for Latent Reasoning Models
by: You, Runyang, et al.
Published: (2025)