:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Chao, Yang, Tao, Tian, Hongtao, Shi, Yunsheng, Ma, Qiyao, Liu, Xiaotao, Yao, Ting, Ding, Wenbo
Format:	Preprint
Published:	2025
Subjects:	Machine Learning Artificial Intelligence
Online Access:	https://arxiv.org/abs/2509.22115
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

From Faithfulness to Correctness: Generative Reward Models that Think Critically
by: Ma, Qiyao, et al.
Published: (2025)

Discriminative Policy Optimization for Token-Level Reward Models
by: Chen, Hongzhan, et al.
Published: (2025)

CAPO: Towards Enhancing LLM Reasoning through Generative Credit Assignment
by: Xie, Guofu, et al.
Published: (2025)

WeChat-YATT: A Scalable, Simple, Efficient, and Production Ready Training Library
by: Wu, Junyu, et al.
Published: (2025)

G-Core: A Simple, Scalable and Balanced RLHF Trainer
by: Wu, Junyu, et al.
Published: (2025)

Sample More to Think Less: Group Filtered Policy Optimization for Concise Reasoning
by: Shrivastava, Vaishnavi, et al.
Published: (2025)

Learning More With Less: Sample Efficient Model-Based RL for Loco-Manipulation
by: Hoffman, Benjamin, et al.
Published: (2025)

Less is More: Resource-Efficient Low-Rank Adaptation
by: Tian, Chunlin, et al.
Published: (2025)

More with Less: An Empirical Study of Turn-Control Strategies for Efficient Coding Agents
by: Gao, Pengfei, et al.
Published: (2025)

Bone Soups: A Seek-and-Soup Model Merging Approach for Controllable Multi-Objective Generation
by: Xie, Guofu, et al.
Published: (2025)

Train Less, Learn More: Adaptive Efficient Rollout Optimization for Group-Based Reinforcement Learning
by: Zhang, Zhi, et al.
Published: (2026)

SAC Flow: Sample-Efficient Reinforcement Learning of Flow-Based Policies via Velocity-Reparameterized Sequential Modeling
by: Zhang, Yixian, et al.
Published: (2025)

Less is More: Efficient Brain-Inspired Learning for Autonomous Driving Trajectory Prediction
by: Liao, Haicheng, et al.
Published: (2024)

Constrained Dynamics Simulation: More With Less
by: Sathya, Ajay Suresha
Published: (2024)

Sparser is Faster and Less is More: Efficient Sparse Attention for Long-Range Transformers
by: Lou, Chao, et al.
Published: (2024)

More Thought, Less Accuracy? On the Dual Nature of Reasoning in Vision-Language Models
by: Tian, Xinyu, et al.
Published: (2025)

Less is More: Efficient Weight Farcasting with 1-Layer Neural Network
by: Shou, Xiao, et al.
Published: (2025)

Less is More: Towards Simple Graph Contrastive Learning
by: Zhao, Yanan, et al.
Published: (2025)

Less-to-More Generalization: Unlocking More Controllability by In-Context Generation
by: Wu, Shaojin, et al.
Published: (2025)

Learning More from Less: Unlocking Internal Representations for Benchmark Compression
by: Zhang, Yueqi, et al.
Published: (2026)

Human-assisted Robotic Policy Refinement via Action Preference Optimization
by: Xia, Wenke, et al.
Published: (2025)

Less is More: Parameter-Efficient Selection of Intermediate Tasks for Transfer Learning
by: Schulte, David, et al.
Published: (2024)

Beyond Fixed Length: Bucket Pre-training is All You Need
by: Yang, Qing, et al.
Published: (2024)

Less is More: Improving LLM Reasoning with Minimal Test-Time Intervention
by: Yang, Zhen, et al.
Published: (2025)

Sampling More, Getting Less: Calibration is the Diversity Bottleneck in LLMs
by: Banayeeanzade, Amin, et al.
Published: (2026)

Learn More with Less: Uncertainty Consistency Guided Query Selection for RLVR
by: Yi, Hao, et al.
Published: (2026)

From <Answer> to <Think>: Multidimensional Supervision of Reasoning Process for LLM Optimization
by: Wang, Beining, et al.
Published: (2025)

XRec: Large Language Models for Explainable Recommendation
by: Ma, Qiyao, et al.
Published: (2024)

ULMRec: User-centric Large Language Model for Sequential Recommendation
by: Shao, Minglai, et al.
Published: (2024)

Offline Multi-Agent Reinforcement Learning via In-Sample Sequential Policy Optimization
by: Liu, Zongkai, et al.
Published: (2024)

Less is More, More or Less... Finding the Optimal Threshold for Lexicalization in Chunking
by: Balázs Indig
Published: (2017)

Policy Newton Algorithm in Reproducing Kernel Hilbert Space
by: Zhang, Yixian, et al.
Published: (2025)

Less is More: A Stealthy and Efficient Adversarial Attack Method for DRL-based Autonomous Driving Policies
by: Fan, Junchao, et al.
Published: (2024)

When Less Is More.
by: Lettis, Lucy
Published: (1998)

Doing More with Less.
by: Wagenveld, Linda M.
Published: (1987)

Learning More from Less: Exploiting Counterfactuals for Data-Efficient Chart Understanding
by: Bao, Jianzhu, et al.
Published: (2026)

Less is More: Decoder-Free Masked Modeling for Efficient Skeleton Representation Learning
by: Do, Jeonghyeok, et al.
Published: (2026)

Less is More: Multimodal Region Representation via Pairwise Inter-view Learning
by: Namgung, Min, et al.
Published: (2025)

Less is More: Adaptive Program Repair with Bug Localization and Preference Learning
by: Dai, Zhenlong, et al.
Published: (2025)

Towards Efficient Optimizer Design for LLM via Structured Fisher Approximation with a Low-Rank Extension
by: Gong, Wenbo, et al.
Published: (2025)