Saved in:
| Main Authors: | Ni, Juntong, Wang, Shiyu, He, Qi, Jin, Ming, Jin, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.03248 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation
by: Ni, Juntong, et al.
Published: (2025)
by: Ni, Juntong, et al.
Published: (2025)
How Long Reasoning Chains Influence LLMs' Judgment of Answer Factuality
by: Tu, Minzhu, et al.
Published: (2026)
by: Tu, Minzhu, et al.
Published: (2026)
STDec: Spatio-Temporal Stability Guided Decoding for dLLMs
by: Chen, Yuzhe, et al.
Published: (2026)
by: Chen, Yuzhe, et al.
Published: (2026)
TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2024)
by: Gu, Shangding, et al.
Published: (2024)
KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning
by: Wang, Shuai, et al.
Published: (2026)
by: Wang, Shuai, et al.
Published: (2026)
RePST: Language Model Empowered Spatio-Temporal Forecasting via Semantic-Oriented Reprogramming
by: Wang, Hao, et al.
Published: (2024)
by: Wang, Hao, et al.
Published: (2024)
Time Series Forecasting with LLMs: Understanding and Enhancing Model Capabilities
by: Tang, Hua, et al.
Published: (2024)
by: Tang, Hua, et al.
Published: (2024)
Ever: Mitigating Hallucination in Large Language Models through Real-Time Verification and Rectification
by: Kang, Haoqiang, et al.
Published: (2023)
by: Kang, Haoqiang, et al.
Published: (2023)
RLKD: Distilling LLMs' Reasoning via Reinforcement Learning
by: Xu, Shicheng, et al.
Published: (2025)
by: Xu, Shicheng, et al.
Published: (2025)
Tool-Star: Empowering LLM-Brained Multi-Tool Reasoner via Reinforcement Learning
by: Dong, Guanting, et al.
Published: (2025)
by: Dong, Guanting, et al.
Published: (2025)
Sonar-TS: Search-Then-Verify Natural Language Querying for Time Series Databases
by: Tan, Zhao, et al.
Published: (2026)
by: Tan, Zhao, et al.
Published: (2026)
R-Search: Empowering LLM Reasoning with Search via Multi-Reward Reinforcement Learning
by: Zhao, Qingfei, et al.
Published: (2025)
by: Zhao, Qingfei, et al.
Published: (2025)
Hybrid Latent Reasoning via Reinforcement Learning
by: Yue, Zhenrui, et al.
Published: (2025)
by: Yue, Zhenrui, et al.
Published: (2025)
Rectify Evaluation Preference: Improving LLMs' Critique on Math Reasoning via Perplexity-aware Reinforcement Learning
by: Tian, Changyuan, et al.
Published: (2025)
by: Tian, Changyuan, et al.
Published: (2025)
Tagging the Thought: Unlocking Personalization Reasoning via Reinforcement Learning
by: Jin, Song, et al.
Published: (2025)
by: Jin, Song, et al.
Published: (2025)
U-Cast: Learning Hierarchical Structures for High-Dimensional Time Series Forecasting
by: Ni, Juntong, et al.
Published: (2025)
by: Ni, Juntong, et al.
Published: (2025)
Amplification Effects in Test-Time Reinforcement Learning: Safety and Reasoning Vulnerabilities
by: Khattar, Vanshaj, et al.
Published: (2026)
by: Khattar, Vanshaj, et al.
Published: (2026)
R3-RAG: Learning Step-by-Step Reasoning and Retrieval for LLMs via Reinforcement Learning
by: Li, Yuan, et al.
Published: (2025)
by: Li, Yuan, et al.
Published: (2025)
Graph Reasoning Paradigm: Structured and Symbolic Reasoning with Topology-Aware Reinforcement Learning for Large Language Models
by: Liu, Runxuan, et al.
Published: (2026)
by: Liu, Runxuan, et al.
Published: (2026)
ThinkPrune: Pruning Long Chain-of-Thought of LLMs via Reinforcement Learning
by: Hou, Bairu, et al.
Published: (2025)
by: Hou, Bairu, et al.
Published: (2025)
Unlocking Reasoning Capabilities in LLMs via Reinforcement Learning Exploration
by: Deng, Wenhao, et al.
Published: (2025)
by: Deng, Wenhao, et al.
Published: (2025)
When Do LLMs Need Retrieval Augmentation? Mitigating LLMs' Overconfidence Helps Retrieval Augmentation
by: Ni, Shiyu, et al.
Published: (2024)
by: Ni, Shiyu, et al.
Published: (2024)
Adapting LLMs to Time Series Forecasting via Temporal Heterogeneity Modeling and Semantic Alignment
by: Sun, Yanru, et al.
Published: (2025)
by: Sun, Yanru, et al.
Published: (2025)
SpatialThinker: Reinforcing 3D Reasoning in Multimodal LLMs via Spatial Rewards
by: Batra, Hunar, et al.
Published: (2025)
by: Batra, Hunar, et al.
Published: (2025)
SRPO: Enhancing Multimodal LLM Reasoning via Reflection-Aware Reinforcement Learning
by: Wan, Zhongwei, et al.
Published: (2025)
by: Wan, Zhongwei, et al.
Published: (2025)
ReSearch: Learning to Reason with Search for LLMs via Reinforcement Learning
by: Chen, Mingyang, et al.
Published: (2025)
by: Chen, Mingyang, et al.
Published: (2025)
LiveThinking: Enabling Real-Time Efficient Reasoning for AI-Powered Livestreaming via Reinforcement Learning
by: Sun, Yuhan, et al.
Published: (2025)
by: Sun, Yuhan, et al.
Published: (2025)
GUI-KV: Efficient GUI Agents via KV Cache with Spatio-Temporal Awareness
by: Huang, Kung-Hsiang, et al.
Published: (2025)
by: Huang, Kung-Hsiang, et al.
Published: (2025)
EPO: Explicit Policy Optimization for Strategic Reasoning in LLMs via Reinforcement Learning
by: Liu, Xiaoqian, et al.
Published: (2025)
by: Liu, Xiaoqian, et al.
Published: (2025)
Can Large Language Models Adequately Perform Symbolic Reasoning Over Time Series?
by: Liu, Zewen, et al.
Published: (2025)
by: Liu, Zewen, et al.
Published: (2025)
Test of Time: A Benchmark for Evaluating LLMs on Temporal Reasoning
by: Fatemi, Bahare, et al.
Published: (2024)
by: Fatemi, Bahare, et al.
Published: (2024)
Question Answering Over Spatio-Temporal Knowledge Graph
by: Dai, Xinbang, et al.
Published: (2024)
by: Dai, Xinbang, et al.
Published: (2024)
From Building Blocks to Planning: Multi-Step Spatial Reasoning in LLMs with Reinforcement Learning
by: Tahmasbi, Amir, et al.
Published: (2025)
by: Tahmasbi, Amir, et al.
Published: (2025)
Training LLMs for EHR-Based Reasoning Tasks via Reinforcement Learning
by: Lin, Jiacheng, et al.
Published: (2025)
by: Lin, Jiacheng, et al.
Published: (2025)
TimeSage-MT: A Multi-Turn Benchmark for Evaluating Agentic Time Series Reasoning
by: Kong, Yaxuan, et al.
Published: (2026)
by: Kong, Yaxuan, et al.
Published: (2026)
Reliable Use of Lemmas via Eligibility Reasoning and Section$-$Aware Reinforcement Learning
by: Xu, Zhikun, et al.
Published: (2026)
by: Xu, Zhikun, et al.
Published: (2026)
Search-R1: Training LLMs to Reason and Leverage Search Engines with Reinforcement Learning
by: Jin, Bowen, et al.
Published: (2025)
by: Jin, Bowen, et al.
Published: (2025)
KDRL: Post-Training Reasoning LLMs via Unified Knowledge Distillation and Reinforcement Learning
by: Xu, Hongling, et al.
Published: (2025)
by: Xu, Hongling, et al.
Published: (2025)
Cell-o1: Training LLMs to Solve Single-Cell Reasoning Puzzles with Reinforcement Learning
by: Fang, Yin, et al.
Published: (2025)
by: Fang, Yin, et al.
Published: (2025)
LLM-PS: Empowering Large Language Models for Time Series Forecasting with Temporal Patterns and Semantics
by: Tang, Jialiang, et al.
Published: (2025)
by: Tang, Jialiang, et al.
Published: (2025)
Similar Items
-
TimeDistill: Efficient Long-Term Time Series Forecasting with MLP via Cross-Architecture Distillation
by: Ni, Juntong, et al.
Published: (2025) -
How Long Reasoning Chains Influence LLMs' Judgment of Answer Factuality
by: Tu, Minzhu, et al.
Published: (2026) -
STDec: Spatio-Temporal Stability Guided Decoding for dLLMs
by: Chen, Yuzhe, et al.
Published: (2026) -
TeaMs-RL: Teaching LLMs to Generate Better Instruction Datasets via Reinforcement Learning
by: Gu, Shangding, et al.
Published: (2024) -
KG-Hopper: Empowering Compact Open LLMs with Knowledge Graph Reasoning via Reinforcement Learning
by: Wang, Shuai, et al.
Published: (2026)