Saved in:
| Main Authors: | Wang, Chao, Yang, Tao, Tian, Hongtao, Shi, Yunsheng, Ma, Qiyao, Liu, Xiaotao, Yao, Ting, Ding, Wenbo |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2509.22115 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
From Faithfulness to Correctness: Generative Reward Models that Think Critically
by: Ma, Qiyao, et al.
Published: (2025)
by: Ma, Qiyao, et al.
Published: (2025)
Discriminative Policy Optimization for Token-Level Reward Models
by: Chen, Hongzhan, et al.
Published: (2025)
by: Chen, Hongzhan, et al.
Published: (2025)
CAPO: Towards Enhancing LLM Reasoning through Generative Credit Assignment
by: Xie, Guofu, et al.
Published: (2025)
by: Xie, Guofu, et al.
Published: (2025)
WeChat-YATT: A Scalable, Simple, Efficient, and Production Ready Training Library
by: Wu, Junyu, et al.
Published: (2025)
by: Wu, Junyu, et al.
Published: (2025)
G-Core: A Simple, Scalable and Balanced RLHF Trainer
by: Wu, Junyu, et al.
Published: (2025)
by: Wu, Junyu, et al.
Published: (2025)
Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning
by: Shrivastava, Vaishnavi, et al.
Published: (2025)
by: Shrivastava, Vaishnavi, et al.
Published: (2025)
Learning More With Less: Sample Efficient Model-Based RL for Loco-Manipulation
by: Hoffman, Benjamin, et al.
Published: (2025)
by: Hoffman, Benjamin, et al.
Published: (2025)
Less is More: Resource-Efficient Low-Rank Adaptation
by: Tian, Chunlin, et al.
Published: (2025)
by: Tian, Chunlin, et al.
Published: (2025)
More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents
by: Gao, Pengfei, et al.
Published: (2025)
by: Gao, Pengfei, et al.
Published: (2025)
Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective Generation
by: Xie, Guofu, et al.
Published: (2025)
by: Xie, Guofu, et al.
Published: (2025)
Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning
by: Zhang, Zhi, et al.
Published: (2026)
by: Zhang, Zhi, et al.
Published: (2026)
SAC Flow: Sample-Efficient Reinforcement Learning of Flow-Based Policies via Velocity-Reparameterized Sequential Modeling
by: Zhang, Yixian, et al.
Published: (2025)
by: Zhang, Yixian, et al.
Published: (2025)
Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction
by: Liao, Haicheng, et al.
Published: (2024)
by: Liao, Haicheng, et al.
Published: (2024)
Constrained Dynamics Simulation: More With Less
by: Sathya, Ajay Suresha
Published: (2024)
by: Sathya, Ajay Suresha
Published: (2024)
Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
by: Lou, Chao, et al.
Published: (2024)
by: Lou, Chao, et al.
Published: (2024)
More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
by: Tian, Xinyu, et al.
Published: (2025)
by: Tian, Xinyu, et al.
Published: (2025)
Less is More: Efficient Weight Farcasting with 1-Layer Neural Network
by: Shou, Xiao, et al.
Published: (2025)
by: Shou, Xiao, et al.
Published: (2025)
Less is More: Towards Simple Graph Contrastive Learning
by: Zhao, Yanan, et al.
Published: (2025)
by: Zhao, Yanan, et al.
Published: (2025)
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation
by: Wu, Shaojin, et al.
Published: (2025)
by: Wu, Shaojin, et al.
Published: (2025)
Learning More from Less: Unlocking Internal Representations for Benchmark Compression
by: Zhang, Yueqi, et al.
Published: (2026)
by: Zhang, Yueqi, et al.
Published: (2026)
Human-assisted Robotic Policy Refinement via Action Preference Optimization
by: Xia, Wenke, et al.
Published: (2025)
by: Xia, Wenke, et al.
Published: (2025)
Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning
by: Schulte, David, et al.
Published: (2024)
by: Schulte, David, et al.
Published: (2024)
Beyond Fixed Length: Bucket Pre-training is All You Need
by: Yang, Qing, et al.
Published: (2024)
by: Yang, Qing, et al.
Published: (2024)
Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention
by: Yang, Zhen, et al.
Published: (2025)
by: Yang, Zhen, et al.
Published: (2025)
Sampling More, Getting Less: Calibration is the Diversity Bottleneck in LLMs
by: Banayeeanzade, Amin, et al.
Published: (2026)
by: Banayeeanzade, Amin, et al.
Published: (2026)
Learn More with Less: Uncertainty Consistency Guided Query Selection for RLVR
by: Yi, Hao, et al.
Published: (2026)
by: Yi, Hao, et al.
Published: (2026)
From <Answer> to <Think>: Multidimensional Supervision of Reasoning Process for LLM Optimization
by: Wang, Beining, et al.
Published: (2025)
by: Wang, Beining, et al.
Published: (2025)
XRec: Large Language Models for Explainable Recommendation
by: Ma, Qiyao, et al.
Published: (2024)
by: Ma, Qiyao, et al.
Published: (2024)
ULMRec: User-centric Large Language Model for Sequential Recommendation
by: Shao, Minglai, et al.
Published: (2024)
by: Shao, Minglai, et al.
Published: (2024)
Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)
by: Liu, Zongkai, et al.
Published: (2024)
Less is More, More or Less... Finding the Optimal Threshold for Lexicalization in Chunking
by: Balázs Indig
Published: (2017)
by: Balázs Indig
Published: (2017)
Policy Newton Algorithm in Reproducing Kernel Hilbert Space
by: Zhang, Yixian, et al.
Published: (2025)
by: Zhang, Yixian, et al.
Published: (2025)
Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies
by: Fan, Junchao, et al.
Published: (2024)
by: Fan, Junchao, et al.
Published: (2024)
When Less Is More.
by: Lettis, Lucy
Published: (1998)
by: Lettis, Lucy
Published: (1998)
Doing More with Less.
by: Wagenveld, Linda M.
Published: (1987)
by: Wagenveld, Linda M.
Published: (1987)
Learning More from Less: Exploiting Counterfactuals for Data-Efficient Chart Understanding
by: Bao, Jianzhu, et al.
Published: (2026)
by: Bao, Jianzhu, et al.
Published: (2026)
Less is More: Decoder-Free Masked Modeling for Efficient Skeleton Representation Learning
by: Do, Jeonghyeok, et al.
Published: (2026)
by: Do, Jeonghyeok, et al.
Published: (2026)
Less is More: Multimodal Region Representation via Pairwise Inter-view Learning
by: Namgung, Min, et al.
Published: (2025)
by: Namgung, Min, et al.
Published: (2025)
Less is More: Adaptive Program Repair with Bug Localization and Preference Learning
by: Dai, Zhenlong, et al.
Published: (2025)
by: Dai, Zhenlong, et al.
Published: (2025)
Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension
by: Gong, Wenbo, et al.
Published: (2025)
by: Gong, Wenbo, et al.
Published: (2025)
Similar Items
-
From Faithfulness to Correctness: Generative Reward Models that Think Critically
by: Ma, Qiyao, et al.
Published: (2025) -
Discriminative Policy Optimization for Token-Level Reward Models
by: Chen, Hongzhan, et al.
Published: (2025) -
CAPO: Towards Enhancing LLM Reasoning through Generative Credit Assignment
by: Xie, Guofu, et al.
Published: (2025) -
WeChat-YATT: A Scalable, Simple, Efficient, and Production Ready Training Library
by: Wu, Junyu, et al.
Published: (2025) -
G-Core: A Simple, Scalable and Balanced RLHF Trainer
by: Wu, Junyu, et al.
Published: (2025)