Saved in:
| Main Authors: | Yang, Wei, Xie, Hong, Tan, Tao, Li, Xin, Lian, Defu, Chen, Enhong |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01346 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective
by: Xie, Hong, et al.
Published: (2026)
by: Xie, Hong, et al.
Published: (2026)
Analytical and Empirical Study of Herding Effects in Recommendation Systems
by: Xie, Hong, et al.
Published: (2024)
by: Xie, Hong, et al.
Published: (2024)
Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities
by: Xie, Hong, et al.
Published: (2024)
by: Xie, Hong, et al.
Published: (2024)
Securing Recommender System via Cooperative Training
by: Wang, Qingyang, et al.
Published: (2024)
by: Wang, Qingyang, et al.
Published: (2024)
Efficient Machine Unlearning via Influence Approximation
by: Liu, Jiawei, et al.
Published: (2025)
by: Liu, Jiawei, et al.
Published: (2025)
Understanding Privacy Risks of Embeddings Induced by Large Language Models
by: Zhu, Zhihao, et al.
Published: (2024)
by: Zhu, Zhihao, et al.
Published: (2024)
Refine Large Language Model Fine-tuning via Instruction Vector
by: Jiang, Gangwei, et al.
Published: (2024)
by: Jiang, Gangwei, et al.
Published: (2024)
Multiple-play Stochastic Bandits with Prioritized Arm Capacity Sharing
by: Xie, Hong, et al.
Published: (2025)
by: Xie, Hong, et al.
Published: (2025)
UniMEL: A Unified Framework for Multimodal Entity Linking with Large Language Models
by: Qi, Liu, et al.
Published: (2024)
by: Qi, Liu, et al.
Published: (2024)
Federated Contextual Cascading Bandits with Asynchronous Communication and Heterogeneous Users
by: Yang, Hantao, et al.
Published: (2024)
by: Yang, Hantao, et al.
Published: (2024)
Rethinking Reinforcement fine-tuning of LLMs: A Multi-armed Bandit Learning Perspective
by: Hu, Xiao, et al.
Published: (2026)
by: Hu, Xiao, et al.
Published: (2026)
PiXTime: A Model for Federated Time Series Forecasting with Heterogeneous Data across Nodes
by: Zhou, Yiming, et al.
Published: (2026)
by: Zhou, Yiming, et al.
Published: (2026)
RecExplainer: Aligning Large Language Models for Explaining Recommendation Models
by: Lei, Yuxuan, et al.
Published: (2023)
by: Lei, Yuxuan, et al.
Published: (2023)
Do All Individual Layers Help? An Empirical Study of Task-Interfering Layers in Vision-Language Models
by: Liu, Zhiming, et al.
Published: (2026)
by: Liu, Zhiming, et al.
Published: (2026)
Recommender AI Agent: Integrating Large Language Models for Interactive Recommendations
by: Huang, Xu, et al.
Published: (2023)
by: Huang, Xu, et al.
Published: (2023)
Mitigate Negative Transfer with Similarity Heuristic Lifelong Prompt Tuning
by: Wu, Chenyuan, et al.
Published: (2024)
by: Wu, Chenyuan, et al.
Published: (2024)
Efficient and Stable Reinforcement Learning for Diffusion Language Models
by: Liu, Jiawei, et al.
Published: (2026)
by: Liu, Jiawei, et al.
Published: (2026)
AgentCAT: An LLM Agent for Extracting and Analyzing Catalytic Reaction Data from Chemical Engineering Literature
by: Yang, Wei, et al.
Published: (2026)
by: Yang, Wei, et al.
Published: (2026)
Learning from Emptiness: De-biasing Listwise Rerankers with Content-Agnostic Probability Calibration
by: Lv, Hang, et al.
Published: (2026)
by: Lv, Hang, et al.
Published: (2026)
Foundations and Frontiers of Graph Learning Theory
by: Huang, Yu, et al.
Published: (2024)
by: Huang, Yu, et al.
Published: (2024)
A Unified Frequency Domain Decomposition Framework for Interpretable and Robust Time Series Forecasting
by: He, Cheng, et al.
Published: (2025)
by: He, Cheng, et al.
Published: (2025)
CoSteer: Collaborative Decoding-Time Personalization via Local Delta Steering
by: Lv, Hang, et al.
Published: (2025)
by: Lv, Hang, et al.
Published: (2025)
Verifying Large Language Models' Reasoning Paths via Correlation Matrix Rank
by: Liu, Jiayu, et al.
Published: (2025)
by: Liu, Jiayu, et al.
Published: (2025)
When Large Language Models Meet Personalization: Perspectives of Challenges and Opportunities
by: Chen, Jin, et al.
Published: (2023)
by: Chen, Jin, et al.
Published: (2023)
CeProAgents: A Hierarchical Agents System for Automated Chemical Process Development
by: Yang, Yuhang, et al.
Published: (2026)
by: Yang, Yuhang, et al.
Published: (2026)
Foundation Models for Demand Forecasting via Dual-Strategy Ensembling
by: Yang, Wei, et al.
Published: (2025)
by: Yang, Wei, et al.
Published: (2025)
InstructTime++: Time Series Classification with Multimodal Language Modeling via Implicit Feature Enhancement
by: Cheng, Mingyue, et al.
Published: (2026)
by: Cheng, Mingyue, et al.
Published: (2026)
Learning Complete Topology-Aware Correlations Between Relations for Inductive Link Prediction
by: Wang, Jie, et al.
Published: (2023)
by: Wang, Jie, et al.
Published: (2023)
Invariant Representation via Decoupling Style and Spurious Features from Images
by: Li, Ruimeng, et al.
Published: (2023)
by: Li, Ruimeng, et al.
Published: (2023)
Learning Partially Aligned Item Representation for Cross-Domain Sequential Recommendation
by: Yin, Mingjia, et al.
Published: (2024)
by: Yin, Mingjia, et al.
Published: (2024)
CastFlow: Learning Role-Specialized Agentic Workflows for Time Series Forecasting
by: Pan, Bokai, et al.
Published: (2026)
by: Pan, Bokai, et al.
Published: (2026)
HyperTree Planning: Enhancing LLM Reasoning via Hierarchical Thinking
by: Gui, Runquan, et al.
Published: (2025)
by: Gui, Runquan, et al.
Published: (2025)
Evaluating Small Language Models for Agentic On-Farm Decision Support Systems
by: Liu, Enhong, et al.
Published: (2025)
by: Liu, Enhong, et al.
Published: (2025)
SwiftVLM: Efficient Vision-Language Model Inference via Cross-Layer Token Bypass
by: Qian, Chen, et al.
Published: (2026)
by: Qian, Chen, et al.
Published: (2026)
WESE: Weak Exploration to Strong Exploitation for LLM Agents
by: Huang, Xu, et al.
Published: (2024)
by: Huang, Xu, et al.
Published: (2024)
Cross-Layer Vision Smoothing: Enhancing Visual Understanding via Sustained Focus on Key Objects in Large Vision-Language Models
by: Zhao, Jianfei, et al.
Published: (2025)
by: Zhao, Jianfei, et al.
Published: (2025)
Denoising Pre-Training and Customized Prompt Learning for Efficient Multi-Behavior Sequential Recommendation
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Vision-Language Model Selection and Reuse for Downstream Adaptation
by: Tan, Hao-Zhe, et al.
Published: (2025)
by: Tan, Hao-Zhe, et al.
Published: (2025)
Distilling to Hybrid Attention Models via KL-Guided Layer Selection
by: Li, Yanhong, et al.
Published: (2025)
by: Li, Yanhong, et al.
Published: (2025)
AHAMask: Reliable Task Specification for Large Audio Language Models without Instructions
by: Guo, Yiwei, et al.
Published: (2025)
by: Guo, Yiwei, et al.
Published: (2025)
Similar Items
-
Demystifying Design Choices of Reinforcement Fine-tuning: A Batched Contextual Bandit Learning Perspective
by: Xie, Hong, et al.
Published: (2026) -
Analytical and Empirical Study of Herding Effects in Recommendation Systems
by: Xie, Hong, et al.
Published: (2024) -
Multi-agent Multi-armed Bandits with Stochastic Sharable Arm Capacities
by: Xie, Hong, et al.
Published: (2024) -
Securing Recommender System via Cooperative Training
by: Wang, Qingyang, et al.
Published: (2024) -
Efficient Machine Unlearning via Influence Approximation
by: Liu, Jiawei, et al.
Published: (2025)