Skip to content
VuFind
  • Login
    • English
    • Deutsch
    • Español
    • Français
    • Italiano
Advanced
  • Cite this
  • Text this
  • Email this
  • Print
  • Export Record
    • Export to RefWorks
    • Export to EndNoteWeb
    • Export to EndNote
  • Save to List
  • Permanent link
Cover Image

Saved in:
Bibliographic Details
Main Authors: Cheng, Zhoujun, Fan, Richard, Hao, Shibo, Killian, Taylor W., Li, Haonan, Sun, Suqi, Ren, Hector, Moreno, Alexander, Zhang, Daqian, Zhong, Tianjun, Xiong, Yuxin, Hu, Yuanzhe, Xie, Yutao, Han, Xudong, Wang, Yuqi, Pimpalkhute, Varad, Zhuang, Yonghao, Singh, Aaryamonvikram, Liang, Xuezhi, Xie, Anze, She, Jianshu, Fan, Desai, Gao, Chengqian, Ma, Liqun, Yurochkin, Mikhail, Maggs, John, Ma, Xuezhe, He, Guowei, Hu, Zhiting, Liu, Zhengzhong, Xing, Eric P.
Format: Preprint
Published: 2025
Subjects:
Machine Learning
Online Access:https://arxiv.org/abs/2509.07604
Tags: Add Tag
No Tags, Be the first to tag this record!
  • Holdings
  • Description
  • Table of Contents
  • Comments
  • Similar Items
  • Staff View
Description
Description not available.

Similar Items

  • IsoCompute Playbook: Optimally Scaling Sampling Compute for LLM RL
    by: Cheng, Zhoujun, et al.
    Published: (2026)
  • K2-V2: A 360-Open, Reasoning-Enhanced LLM
    by: K2 Team, et al.
    Published: (2025)
  • Concise Reasoning in the Lens of Lagrangian Optimization
    by: Gao, Chengqian, et al.
    Published: (2025)
  • Efficient Agentic Reasoning Through Self-Regulated Simulative Planning
    by: Deng, Mingkai, et al.
    Published: (2026)
  • Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective
    by: Cheng, Zhoujun, et al.
    Published: (2025)

Search Options

  • Search History
  • Advanced Search

Find More

  • Browse the Catalog
  • Browse Alphabetically
  • Explore Channels
  • Course Reserves
  • New Items

Need Help?

  • Search Tips
  • Ask a Librarian
  • FAQs