Saved in:
| Main Authors: | Liu, Yihao, Li, Shuocheng, Cao, Lang, Xie, Yuhang, Zhou, Mengyu, Dong, Haoyu, Ma, Xiaojun, Han, Shi, Zhang, Dongmei |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.01096 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Not All Tokens Matter: Towards Efficient LLM Reasoning via Token Significance in Reinforcement Learning
by: Liu, Hanbing, et al.
Published: (2025)
by: Liu, Hanbing, et al.
Published: (2025)
TablePilot: Recommending Human-Preferred Tabular Data Analysis with Large Language Models
by: Yi, Deyin, et al.
Published: (2025)
by: Yi, Deyin, et al.
Published: (2025)
Formula-R1: Incentivizing LLM Reasoning over Complex Tables with Numerical Computation via Formula-Driven Reinforcement Learning
by: Cao, Lang, et al.
Published: (2025)
by: Cao, Lang, et al.
Published: (2025)
Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
by: Li, Shuocheng, et al.
Published: (2025)
by: Li, Shuocheng, et al.
Published: (2025)
TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models
by: He, Xinyi, et al.
Published: (2025)
by: He, Xinyi, et al.
Published: (2025)
SheetBrain: A Neuro-Symbolic Agent for Accurate Reasoning over Complex and Large Spreadsheets
by: Wang, Ziwei, et al.
Published: (2025)
by: Wang, Ziwei, et al.
Published: (2025)
Table-LLM-Specialist: Language Model Specialists for Tables using Iterative Generator-Validator Fine-tuning
by: Xing, Junjie, et al.
Published: (2024)
by: Xing, Junjie, et al.
Published: (2024)
Vision Language Models for Spreadsheet Understanding: Challenges and Opportunities
by: Xia, Shiyu, et al.
Published: (2024)
by: Xia, Shiyu, et al.
Published: (2024)
Table Meets LLM: Can Large Language Models Understand Structured Table Data? A Benchmark and Empirical Study
by: Sui, Yuan, et al.
Published: (2023)
by: Sui, Yuan, et al.
Published: (2023)
PromptIntern: Saving Inference Costs by Internalizing Recurrent Prompt during Large Language Model Fine-tuning
by: Zou, Jiaru, et al.
Published: (2024)
by: Zou, Jiaru, et al.
Published: (2024)
MMTU: A Massive Multi-Task Table Understanding and Reasoning Benchmark
by: Xing, Junjie, et al.
Published: (2025)
by: Xing, Junjie, et al.
Published: (2025)
TAP4LLM: Table Provider on Sampling, Augmenting, and Packing Semi-structured Data for Large Language Model Reasoning
by: Sui, Yuan, et al.
Published: (2023)
by: Sui, Yuan, et al.
Published: (2023)
Boosting the Generalization and Reasoning of Vision Language Models with Curriculum Reinforcement Learning
by: Deng, Huilin, et al.
Published: (2025)
by: Deng, Huilin, et al.
Published: (2025)
SpreadsheetLLM: Encoding Spreadsheets for Large Language Models
by: Dong, Haoyu, et al.
Published: (2024)
by: Dong, Haoyu, et al.
Published: (2024)
TwT: Thinking without Tokens by Habitual Reasoning Distillation with Multi-Teachers' Guidance
by: Xu, Jingxian, et al.
Published: (2025)
by: Xu, Jingxian, et al.
Published: (2025)
From Task to Tutorial: An Automated GUI Framework for Excel Tutorial Document and Video Creation
by: Xie, Yuhang, et al.
Published: (2025)
by: Xie, Yuhang, et al.
Published: (2025)
CapRL: Stimulating Dense Image Caption Capabilities via Reinforcement Learning
by: Xing, Long, et al.
Published: (2025)
by: Xing, Long, et al.
Published: (2025)
ProRL: Prolonged Reinforcement Learning Expands Reasoning Boundaries in Large Language Models
by: Liu, Mingjie, et al.
Published: (2025)
by: Liu, Mingjie, et al.
Published: (2025)
SupChain-Bench: Benchmarking Large Language Models for Real-World Supply Chain Management
by: Guan, Shengyue, et al.
Published: (2026)
by: Guan, Shengyue, et al.
Published: (2026)
QuRL: Efficient Reinforcement Learning with Quantized Rollout
by: Li, Yuhang, et al.
Published: (2026)
by: Li, Yuhang, et al.
Published: (2026)
ResRL: Boosting LLM Reasoning via Negative Sample Projection Residual Reinforcement Learning
by: Lin, Zihan, et al.
Published: (2026)
by: Lin, Zihan, et al.
Published: (2026)
IPD: Boosting Sequential Policy with Imaginary Planning Distillation in Offline Reinforcement Learning
by: Qin, Yihao, et al.
Published: (2026)
by: Qin, Yihao, et al.
Published: (2026)
SpeakRL: Synergizing Reasoning, Speaking, and Acting in Language Models with Reinforcement Learning
by: Acikgoz, Emre Can, et al.
Published: (2025)
by: Acikgoz, Emre Can, et al.
Published: (2025)
Evolution-based Region Adversarial Prompt Learning for Robustness Enhancement in Vision-Language Models
by: Jia, Xiaojun, et al.
Published: (2025)
by: Jia, Xiaojun, et al.
Published: (2025)
Spatial-SSRL: Enhancing Spatial Understanding via Self-Supervised Reinforcement Learning
by: Liu, Yuhong, et al.
Published: (2025)
by: Liu, Yuhong, et al.
Published: (2025)
GraphReason: Enhancing Reasoning Capabilities of Large Language Models through A Graph-Based Verification Approach
by: Cao, Lang
Published: (2023)
by: Cao, Lang
Published: (2023)
BoostStep: Boosting mathematical capability of Large Language Models via improved single-step reasoning
by: Zhang, Beichen, et al.
Published: (2025)
by: Zhang, Beichen, et al.
Published: (2025)
Attention as a Compass: Efficient Exploration for Process-Supervised RL in Reasoning Models
by: Liu, Runze, et al.
Published: (2025)
by: Liu, Runze, et al.
Published: (2025)
RoiRL: Efficient, Self-Supervised Reasoning with Offline Iterative Reinforcement Learning
by: Arzhantsev, Aleksei, et al.
Published: (2025)
by: Arzhantsev, Aleksei, et al.
Published: (2025)
KET-QA: A Dataset for Knowledge Enhanced Table Question Answering
by: Hu, Mengkang, et al.
Published: (2024)
by: Hu, Mengkang, et al.
Published: (2024)
NavCoT: Boosting LLM-Based Vision-and-Language Navigation via Learning Disentangled Reasoning
by: Lin, Bingqian, et al.
Published: (2024)
by: Lin, Bingqian, et al.
Published: (2024)
RL$^3$: Boosting Meta Reinforcement Learning via RL inside RL$^2$
by: Bhatia, Abhinav, et al.
Published: (2023)
by: Bhatia, Abhinav, et al.
Published: (2023)
UloRL:An Ultra-Long Output Reinforcement Learning Approach for Advancing Large Language Models' Reasoning Abilities
by: Du, Dong, et al.
Published: (2025)
by: Du, Dong, et al.
Published: (2025)
Logic-RL: Unleashing LLM Reasoning with Rule-Based Reinforcement Learning
by: Xie, Tian, et al.
Published: (2025)
by: Xie, Tian, et al.
Published: (2025)
Unlock the Correlation between Supervised Fine-Tuning and Reinforcement Learning in Training Code Large Language Models
by: Chen, Jie, et al.
Published: (2024)
by: Chen, Jie, et al.
Published: (2024)
KnowRL: Boosting LLM Reasoning via Reinforcement Learning with Minimal-Sufficient Knowledge Guidance
by: Yu, Linhao, et al.
Published: (2026)
by: Yu, Linhao, et al.
Published: (2026)
MIRG-RL: Multi-Image Reasoning and Grounding with Reinforcement Learning
by: Zheng, Lihao, et al.
Published: (2025)
by: Zheng, Lihao, et al.
Published: (2025)
RL-BioAug: Label-Efficient Reinforcement Learning for Self-Supervised EEG Representation Learning
by: Lee, Cheol-Hui, et al.
Published: (2026)
by: Lee, Cheol-Hui, et al.
Published: (2026)
Dark matter production accompanied by gravitational wave signals during cosmological phase transitions
by: Xu, Shuocheng, et al.
Published: (2023)
by: Xu, Shuocheng, et al.
Published: (2023)
Reinforced MLLM: A Survey on RL-Based Reasoning in Multimodal Large Language Models
by: Zhou, Guanghao, et al.
Published: (2025)
by: Zhou, Guanghao, et al.
Published: (2025)
Similar Items
-
Not All Tokens Matter: Towards Efficient LLM Reasoning via Token Significance in Reinforcement Learning
by: Liu, Hanbing, et al.
Published: (2025) -
TablePilot: Recommending Human-Preferred Tabular Data Analysis with Large Language Models
by: Yi, Deyin, et al.
Published: (2025) -
Formula-R1: Incentivizing LLM Reasoning over Complex Tables with Numerical Computation via Formula-Driven Reinforcement Learning
by: Cao, Lang, et al.
Published: (2025) -
Jupiter: Enhancing LLM Data Analysis Capabilities via Notebook and Inference-Time Value-Guided Search
by: Li, Shuocheng, et al.
Published: (2025) -
TableLoRA: Low-rank Adaptation on Table Structure Understanding for Large Language Models
by: He, Xinyi, et al.
Published: (2025)