Saved in:
| Main Authors: | Zhang, Yueqi, Hu, Jin, Feng, Shaoxiong, Yuan, Peiwen, Wang, Xinglin, Li, Yiwei, Shi, Jiayi, Tan, Chuyi, Zhang, Ji, Pan, Boyuan, Hu, Yao, Li, Kan |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.00710 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
LLM-Powered Benchmark Factory: Reliable, Generic, and Efficient
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
PatternKV: Flattening KV Representation Expands Quantization Headroom
by: Zhang, Ji, et al.
Published: (2025)
by: Zhang, Ji, et al.
Published: (2025)
UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Every Rollout Counts: Optimal Resource Allocation for Efficient Test-Time Scaling
by: Wang, Xinglin, et al.
Published: (2025)
by: Wang, Xinglin, et al.
Published: (2025)
Mind the Quote: Enabling Quotation-Aware Dialogue in LLMs via Plug-and-Play Modules
by: Zhang, Yueqi, et al.
Published: (2025)
by: Zhang, Yueqi, et al.
Published: (2025)
From Sub-Ability Diagnosis to Human-Aligned Generation: Bridging the Gap for Text Length Control via MARKERGEN
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Revisiting Self-Consistency from Dynamic Distributional Alignment Perspective on Answer Aggregation
by: Li, Yiwei, et al.
Published: (2025)
by: Li, Yiwei, et al.
Published: (2025)
Diagnosing and Mitigating System Bias in Self-Rewarding RL
by: Tan, Chuyi, et al.
Published: (2025)
by: Tan, Chuyi, et al.
Published: (2025)
Do Not Waste Your Rollouts: Recycling Search Experience for Efficient Test-Time Scaling
by: Wang, Xinglin, et al.
Published: (2026)
by: Wang, Xinglin, et al.
Published: (2026)
Speculative Decoding for Multi-Sample Inference
by: Li, Yiwei, et al.
Published: (2025)
by: Li, Yiwei, et al.
Published: (2025)
Make Every Penny Count: Difficulty-Adaptive Self-Consistency for Cost-Efficient Reasoning
by: Wang, Xinglin, et al.
Published: (2024)
by: Wang, Xinglin, et al.
Published: (2024)
InsBank: Evolving Instruction Subset for Ongoing Alignment
by: Shi, Jiayi, et al.
Published: (2025)
by: Shi, Jiayi, et al.
Published: (2025)
On Time, Within Budget: Constraint-Driven Online Resource Allocation for Agentic Workflows
by: Wang, Xinglin, et al.
Published: (2026)
by: Wang, Xinglin, et al.
Published: (2026)
Focused Large Language Models are Stable Many-Shot Learners
by: Yuan, Peiwen, et al.
Published: (2024)
by: Yuan, Peiwen, et al.
Published: (2024)
Instruction Embedding: Latent Representations of Instructions Towards Task Identification
by: Li, Yiwei, et al.
Published: (2024)
by: Li, Yiwei, et al.
Published: (2024)
CogLM: Tracking Cognitive Development of Large Language Models
by: Wang, Xinglin, et al.
Published: (2024)
by: Wang, Xinglin, et al.
Published: (2024)
Poor-Supervised Evaluation for SuperLLM via Mutual Consistency
by: Yuan, Peiwen, et al.
Published: (2024)
by: Yuan, Peiwen, et al.
Published: (2024)
Integrate the Essence and Eliminate the Dross: Fine-Grained Self-Consistency for Free-Form Language Generation
by: Wang, Xinglin, et al.
Published: (2024)
by: Wang, Xinglin, et al.
Published: (2024)
BatchEval: Towards Human-like Text Evaluation
by: Yuan, Peiwen, et al.
Published: (2023)
by: Yuan, Peiwen, et al.
Published: (2023)
Escape Sky-high Cost: Early-stopping Self-Consistency for Multi-step Reasoning
by: Li, Yiwei, et al.
Published: (2024)
by: Li, Yiwei, et al.
Published: (2024)
Generative Dense Retrieval: Memory Can Be a Burden
by: Yuan, Peiwen, et al.
Published: (2024)
by: Yuan, Peiwen, et al.
Published: (2024)
Less is More: Revisiting the Gaussian Mechanism for Differential Privacy
by: Ji, Tianxi, et al.
Published: (2023)
by: Ji, Tianxi, et al.
Published: (2023)
Stitch and Tell: A Structured Multimodal Data Augmentation Method for Spatial Understanding
by: Yin, Hang, et al.
Published: (2025)
by: Yin, Hang, et al.
Published: (2025)
When Less is More: The LLM Scaling Paradox in Context Compression
by: Guo, Ruishan, et al.
Published: (2026)
by: Guo, Ruishan, et al.
Published: (2026)
Less-to-More Generalization: Unlocking More Controllability by In-Context Generation
by: Wu, Shaojin, et al.
Published: (2025)
by: Wu, Shaojin, et al.
Published: (2025)
Less Is More -- Until It Breaks: Security Pitfalls of Vision Token Compression in Large Vision-Language Models
by: Zhang, Xiaomei, et al.
Published: (2026)
by: Zhang, Xiaomei, et al.
Published: (2026)
Less is More: On the Importance of Data Quality for Unit Test Generation
by: Zhang, Junwei, et al.
Published: (2025)
by: Zhang, Junwei, et al.
Published: (2025)
Less is More: Pseudo-Label Filtering for Continual Test-Time Adaptation
by: Tan, Jiayao, et al.
Published: (2024)
by: Tan, Jiayao, et al.
Published: (2024)
Look Less, Reason More: Rollout-Guided Adaptive Pixel-Space Reasoning
by: Li, Xuchen, et al.
Published: (2025)
by: Li, Xuchen, et al.
Published: (2025)
Say More with Less: Understanding Prompt Learning Behaviors through Gist Compression
by: Li, Xinze, et al.
Published: (2024)
by: Li, Xinze, et al.
Published: (2024)
Select Less, Reason More: Prioritizing Evidence Purity for Video Reasoning
by: Li, Xuchen, et al.
Published: (2025)
by: Li, Xuchen, et al.
Published: (2025)
Less is More: DocString Compression in Code Generation
by: Yang, Guang, et al.
Published: (2024)
by: Yang, Guang, et al.
Published: (2024)
Less Is More: Elevating RAG via Performance-Driven Context Compression
by: Cui, Ziqiang, et al.
Published: (2025)
by: Cui, Ziqiang, et al.
Published: (2025)
Learning to Compress: Unlocking the Potential of Large Language Models for Text Representation
by: Zhang, Yeqin, et al.
Published: (2025)
by: Zhang, Yeqin, et al.
Published: (2025)
Sell More, Play Less: Benchmarking LLM Realistic Selling Skill
by: Su, Xuanbo, et al.
Published: (2026)
by: Su, Xuanbo, et al.
Published: (2026)
CoS++: Towards More General and Explicit Implementations for Sampling High-Order Feynman Diagrammatic Series
by: Shi, Boyuan
Published: (2025)
by: Shi, Boyuan
Published: (2025)
LIMI: Less is More for Agency
by: Xiao, Yang, et al.
Published: (2025)
by: Xiao, Yang, et al.
Published: (2025)
Dynamic Stochastic Decoding Strategy for Open-Domain Dialogue Generation
by: Li, Yiwei, et al.
Published: (2024)
by: Li, Yiwei, et al.
Published: (2024)
Similar Items
-
LLM-Powered Benchmark Factory: Reliable, Generic, and Efficient
by: Yuan, Peiwen, et al.
Published: (2025) -
Silencer: From Discovery to Mitigation of Self-Bias in LLM-as-Benchmark-Generator
by: Yuan, Peiwen, et al.
Published: (2025) -
Beyond One-Size-Fits-All: Tailored Benchmarks for Efficient Evaluation
by: Yuan, Peiwen, et al.
Published: (2025) -
PatternKV: Flattening KV Representation Expands Quantization Headroom
by: Zhang, Ji, et al.
Published: (2025) -
UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization
by: Yuan, Peiwen, et al.
Published: (2025)