Saved in:
| Main Authors: | Wang, Haonan, Du, Chao, Kawaguchi, Kenji, Pang, Tianyu |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2512.02874 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Fostering Video Reasoning via Next-Event Prediction
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
by: Wang, Haonan, et al.
Published: (2024)
by: Wang, Haonan, et al.
Published: (2024)
Reasoning Resides in Layers: Restoring Temporal Reasoning in Video-Language Models with Layer-Selective Merging
by: Fu, Zihang, et al.
Published: (2026)
by: Fu, Zihang, et al.
Published: (2026)
Variational Reasoning for Language Models
by: Zhou, Xiangxin, et al.
Published: (2025)
by: Zhou, Xiangxin, et al.
Published: (2025)
Reinforcing General Reasoning without Verifiers
by: Zhou, Xiangxin, et al.
Published: (2025)
by: Zhou, Xiangxin, et al.
Published: (2025)
OpenSIR: Open-Ended Self-Improving Reasoner
by: Kwan, Wai-Chung, et al.
Published: (2025)
by: Kwan, Wai-Chung, et al.
Published: (2025)
LongSpec: Long-Context Lossless Speculative Decoding with Efficient Drafting and Verification
by: Yang, Penghui, et al.
Published: (2025)
by: Yang, Penghui, et al.
Published: (2025)
An Answer is just the Start: Related Insight Generation for Open-Ended Document-Grounded QA
by: Sharma, Saransh, et al.
Published: (2026)
by: Sharma, Saransh, et al.
Published: (2026)
Reverse-Engineered Reasoning for Open-Ended Generation
by: Wang, Haozhe, et al.
Published: (2025)
by: Wang, Haozhe, et al.
Published: (2025)
Marco-o1: Towards Open Reasoning Models for Open-Ended Solutions
by: Zhao, Yu, et al.
Published: (2024)
by: Zhao, Yu, et al.
Published: (2024)
Context Matters: Pushing the Boundaries of Open-Ended Answer Generation with Graph-Structured Knowledge Context
by: Banerjee, Somnath, et al.
Published: (2024)
by: Banerjee, Somnath, et al.
Published: (2024)
Scaling Open-Ended Reasoning to Predict the Future
by: Chandak, Nikhil, et al.
Published: (2025)
by: Chandak, Nikhil, et al.
Published: (2025)
Scaling Reasoning Tokens via RL and Parallel Thinking: Evidence From Competitive Programming
by: Zhang, Qianfan, et al.
Published: (2026)
by: Zhang, Qianfan, et al.
Published: (2026)
IRLBench: A Multi-modal, Culturally Grounded, Parallel Irish-English Benchmark for Open-Ended LLM Reasoning Evaluation
by: Tran, Khanh-Tung, et al.
Published: (2025)
by: Tran, Khanh-Tung, et al.
Published: (2025)
SeedPrints: Fingerprints Can Even Tell Which Seed Your Large Language Model Was Trained From
by: Tong, Yao, et al.
Published: (2025)
by: Tong, Yao, et al.
Published: (2025)
Chain of Preference Optimization: Improving Chain-of-Thought Reasoning in LLMs
by: Zhang, Xuan, et al.
Published: (2024)
by: Zhang, Xuan, et al.
Published: (2024)
From Harm to Help: Turning Reasoning In-Context Demos into Assets for Reasoning LMs
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
GuessingGame: Measuring the Informativeness of Open-Ended Questions in Large Language Models
by: Hutson, Dylan, et al.
Published: (2025)
by: Hutson, Dylan, et al.
Published: (2025)
OpenEP: Open-Ended Future Event Prediction
by: Guan, Yong, et al.
Published: (2024)
by: Guan, Yong, et al.
Published: (2024)
Jointly Generating and Attributing Answers using Logits of Document-Identifier Tokens
by: Albarede, Lucas, et al.
Published: (2025)
by: Albarede, Lucas, et al.
Published: (2025)
Self-Rewarding Rubric-Based Reinforcement Learning for Open-Ended Reasoning
by: Ye, Zhiling, et al.
Published: (2025)
by: Ye, Zhiling, et al.
Published: (2025)
From Answers to Rationales: Self-Aligning Multimodal Reasoning with Answer-Oriented Chain-of-Thought
by: Tan, Wentao, et al.
Published: (2025)
by: Tan, Wentao, et al.
Published: (2025)
Optimizing Anytime Reasoning via Budget Relative Policy Optimization
by: Qi, Penghui, et al.
Published: (2025)
by: Qi, Penghui, et al.
Published: (2025)
PuzzleWorld: A Benchmark for Multimodal, Open-Ended Reasoning in Puzzlehunts
by: Li, Hengzhi, et al.
Published: (2025)
by: Li, Hengzhi, et al.
Published: (2025)
R2-Write: Reflection and Revision for Open-Ended Writing with Deep Reasoning
by: Liu, Wanlong, et al.
Published: (2026)
by: Liu, Wanlong, et al.
Published: (2026)
AHP-Powered LLM Reasoning for Multi-Criteria Evaluation of Open-Ended Responses
by: Lu, Xiaotian, et al.
Published: (2024)
by: Lu, Xiaotian, et al.
Published: (2024)
Learning Instruction-Following Policies through Open-Ended Instruction Relabeling with Large Language Models
by: Zhang, Zhicheng, et al.
Published: (2025)
by: Zhang, Zhicheng, et al.
Published: (2025)
Recursive Think-Answer Process for LLMs and VLMs
by: Lee, Byung-Kwan, et al.
Published: (2026)
by: Lee, Byung-Kwan, et al.
Published: (2026)
Bagpiper: Solving Open-Ended Audio Tasks via Rich Captions
by: Tian, Jinchuan, et al.
Published: (2026)
by: Tian, Jinchuan, et al.
Published: (2026)
Improving Open-Ended Text Generation via Adaptive Decoding
by: Zhu, Wenhong, et al.
Published: (2024)
by: Zhu, Wenhong, et al.
Published: (2024)
Your Large Language Model is Secretly a Fairness Proponent and You Should Prompt it Like One
by: Li, Tianlin, et al.
Published: (2024)
by: Li, Tianlin, et al.
Published: (2024)
xVerify: Efficient Answer Verifier for Reasoning Model Evaluations
by: Chen, Ding, et al.
Published: (2025)
by: Chen, Ding, et al.
Published: (2025)
Logit Arithmetic Elicits Long Reasoning Capabilities Without Training
by: Zhang, Yunxiang, et al.
Published: (2025)
by: Zhang, Yunxiang, et al.
Published: (2025)
PrefixMemory-Tuning: Modernizing Prefix-Tuning by Decoupling the Prefix from Attention
by: Wang, Haonan, et al.
Published: (2025)
by: Wang, Haonan, et al.
Published: (2025)
How Do Answer Tokens Read Reasoning Traces? Self-Reading Patterns in Thinking LLMs for Quantitative Reasoning
by: Chen, Haoyang, et al.
Published: (2026)
by: Chen, Haoyang, et al.
Published: (2026)
Open-Ended Wargames with Large Language Models
by: Hogan, Daniel P., et al.
Published: (2024)
by: Hogan, Daniel P., et al.
Published: (2024)
Aligning Large Language Models with Human Opinions through Persona Selection and Value--Belief--Norm Reasoning
by: Long, Do Xuan, et al.
Published: (2023)
by: Long, Do Xuan, et al.
Published: (2023)
AlphaOne: Reasoning Models Thinking Slow and Fast at Test Time
by: Zhang, Junyu, et al.
Published: (2025)
by: Zhang, Junyu, et al.
Published: (2025)
LogitsCoder: Towards Efficient Chain-of-Thought Path Search via Logits Preference Decoding for Code Generation
by: Chen, Jizheng, et al.
Published: (2026)
by: Chen, Jizheng, et al.
Published: (2026)
Agent Smith: A Single Image Can Jailbreak One Million Multimodal LLM Agents Exponentially Fast
by: Gu, Xiangming, et al.
Published: (2024)
by: Gu, Xiangming, et al.
Published: (2024)
Similar Items
-
Fostering Video Reasoning via Next-Event Prediction
by: Wang, Haonan, et al.
Published: (2025) -
When Precision Meets Position: BFloat16 Breaks Down RoPE in Long-Context Training
by: Wang, Haonan, et al.
Published: (2024) -
Reasoning Resides in Layers: Restoring Temporal Reasoning in Video-Language Models with Layer-Selective Merging
by: Fu, Zihang, et al.
Published: (2026) -
Variational Reasoning for Language Models
by: Zhou, Xiangxin, et al.
Published: (2025) -
Reinforcing General Reasoning without Verifiers
by: Zhou, Xiangxin, et al.
Published: (2025)