Saved in:
| Main Authors: | Wang, Xinglin, Shi, Jiayi, Feng, Shaoxiong, Yuan, Peiwen, Li, Yiwei, Zhang, Yueqi, Tan, Chuyi, Zhang, Ji, Pan, Boyuan, Hu, Yao, Li, Kan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.21684 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
by: Wang, Xinglin, et al.
Published: (2025)
by: Wang, Xinglin, et al.
Published: (2025)
LLM-Powered Benchmark Factory: Reliable, Generic, and Efficient
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
by: Wang, Xinglin, et al.
Published: (2024)
by: Wang, Xinglin, et al.
Published: (2024)
UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules
by: Zhang, Yueqi, et al.
Published: (2025)
by: Zhang, Yueqi, et al.
Published: (2025)
From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
by: Li, Yiwei, et al.
Published: (2025)
by: Li, Yiwei, et al.
Published: (2025)
Diagnosing and Mitigating System Bias in Self-Rewarding RL
by: Tan, Chuyi, et al.
Published: (2025)
by: Tan, Chuyi, et al.
Published: (2025)
PatternKV: Flattening KV Representation Expands Quantization Headroom
by: Zhang, Ji, et al.
Published: (2025)
by: Zhang, Ji, et al.
Published: (2025)
Speculative Decoding for Multi-Sample Inference
by: Li, Yiwei, et al.
Published: (2025)
by: Li, Yiwei, et al.
Published: (2025)
InsBank: Evolving Instruction Subset for Ongoing Alignment
by: Shi, Jiayi, et al.
Published: (2025)
by: Shi, Jiayi, et al.
Published: (2025)
Focused Large Language Models are Stable Many-Shot Learners
by: Yuan, Peiwen, et al.
Published: (2024)
by: Yuan, Peiwen, et al.
Published: (2024)
On Time, Within Budget: Constraint-Driven Online Resource Allocation for Agentic Workflows
by: Wang, Xinglin, et al.
Published: (2026)
by: Wang, Xinglin, et al.
Published: (2026)
Learning More from Less: Unlocking Internal Representations for Benchmark Compression
by: Zhang, Yueqi, et al.
Published: (2026)
by: Zhang, Yueqi, et al.
Published: (2026)
Instruction Embedding: Latent Representations of Instructions Towards Task Identification
by: Li, Yiwei, et al.
Published: (2024)
by: Li, Yiwei, et al.
Published: (2024)
BatchEval: Towards Human-like Text Evaluation
by: Yuan, Peiwen, et al.
Published: (2023)
by: Yuan, Peiwen, et al.
Published: (2023)
CogLM: Tracking Cognitive Development of Large Language Models
by: Wang, Xinglin, et al.
Published: (2024)
by: Wang, Xinglin, et al.
Published: (2024)
Poor-Supervised Evaluation for SuperLLM via Mutual Consistency
by: Yuan, Peiwen, et al.
Published: (2024)
by: Yuan, Peiwen, et al.
Published: (2024)
Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation
by: Wang, Xinglin, et al.
Published: (2024)
by: Wang, Xinglin, et al.
Published: (2024)
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning
by: Li, Yiwei, et al.
Published: (2024)
by: Li, Yiwei, et al.
Published: (2024)
Generative Dense Retrieval: Memory Can Be a Burden
by: Yuan, Peiwen, et al.
Published: (2024)
by: Yuan, Peiwen, et al.
Published: (2024)
Test-Time Scaling of Reasoning Models for Machine Translation
by: Li, Zihao, et al.
Published: (2025)
by: Li, Zihao, et al.
Published: (2025)
Each Prompt Matters: Scaling Reinforcement Learning Without Wasting Rollouts on Hundred-Billion-Scale MoE
by: Zeng, Anxiang, et al.
Published: (2025)
by: Zeng, Anxiang, et al.
Published: (2025)
Closed‐Loop Recyclable Vitrimer Plastics from PET Waste: A Design for Circularity
by: Mary K. Danielson, et al.
Published: (2025)
by: Mary K. Danielson, et al.
Published: (2025)
Alcoholysis of Silicone Rubber Waste: A Method for Efficiently Recycling Silicone Rubber Waste and Preparing Multifunctional Rubber Additive
by: Chongtao Zhang, et al.
Published: (2025)
by: Chongtao Zhang, et al.
Published: (2025)
Stitch and Tell: A Structured Multimodal Data Augmentation Method for Spatial Understanding
by: Yin, Hang, et al.
Published: (2025)
by: Yin, Hang, et al.
Published: (2025)
Music Education With GenAI : Exploring the Mediating Roles of Enjoyment Between Smart Service Interactional Experience and Behavioural Intention
by: Peiwen Li
Published: (2025)
by: Peiwen Li
Published: (2025)
Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
by: Li, Yiwei, et al.
Published: (2024)
by: Li, Yiwei, et al.
Published: (2024)
Front Cover: Closed‐Loop Recyclable Vitrimer Plastics from PET Waste: A Design for Circularity (ChemSusChem 18/2025)
by: Mary K. Danielson, et al.
Published: (2025)
by: Mary K. Danielson, et al.
Published: (2025)
Efficient Plastic Waste Recycling Using Polymer
by: Vidushi Nain, et al.
Published: (2024)
by: Vidushi Nain, et al.
Published: (2024)
Rotated Multi-Scale Interaction Network for Referring Remote Sensing Image Segmentation
by: Liu, Sihan, et al.
Published: (2023)
by: Liu, Sihan, et al.
Published: (2023)
LArctan-SKAN: Simple and Efficient Single-Parameterized Kolmogorov-Arnold Networks using Learnable Trigonometric Function
by: Chen, Zhijie, et al.
Published: (2024)
by: Chen, Zhijie, et al.
Published: (2024)
Architectural Scaling Surpass Basis Complexity? Efficient KANs with Single-Parameter Design
by: Chen, Zhijie, et al.
Published: (2024)
by: Chen, Zhijie, et al.
Published: (2024)
Organic Waste Recycling
by: Polprasert, Chongrak
Published: (2017)
by: Polprasert, Chongrak
Published: (2017)
Fractional quantum anomalous Hall and anyon density-wave halo in a minimal interacting lattice model of twisted bilayer MoTe$_2$
by: Tuo, Chuyi, et al.
Published: (2025)
by: Tuo, Chuyi, et al.
Published: (2025)
Video-T1: Test-Time Scaling for Video Generation
by: Liu, Fangfu, et al.
Published: (2025)
by: Liu, Fangfu, et al.
Published: (2025)
Synergistic Effects of Electricity and Light for Efficient Iron‐Catalyzed Recycling of Polystyrene Waste
by: Maxime Hourtoule, et al.
Published: (2026)
by: Maxime Hourtoule, et al.
Published: (2026)
CuSearch: Curriculum Rollout Sampling via Search Depth for Agentic RAG
by: Shen, Jianghan, et al.
Published: (2026)
by: Shen, Jianghan, et al.
Published: (2026)
Similar Items
-
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
by: Wang, Xinglin, et al.
Published: (2025) -
LLM-Powered Benchmark Factory: Reliable, Generic, and Efficient
by: Yuan, Peiwen, et al.
Published: (2025) -
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
by: Yuan, Peiwen, et al.
Published: (2025) -
Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
by: Wang, Xinglin, et al.
Published: (2024) -
UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization
by: Yuan, Peiwen, et al.
Published: (2025)