Saved in:
| Main Authors: | Li, Sunzhu, Zhao, Jiale, Wei, Miteto, Ren, Huimin, Zhou, Yang, Yang, Jingwen, Liu, Shunyu, Zhang, Kaike, Chen, Wei |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.08430 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
by: Zhou, Yang, et al.
Published: (2025)
by: Zhou, Yang, et al.
Published: (2025)
ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization
by: Li, Sunzhu, et al.
Published: (2025)
by: Li, Sunzhu, et al.
Published: (2025)
RubricBench: Aligning Model-Generated Rubrics with Human Standards
by: Zhang, Qiyuan, et al.
Published: (2026)
by: Zhang, Qiyuan, et al.
Published: (2026)
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
by: Liu, Tianci, et al.
Published: (2025)
by: Liu, Tianci, et al.
Published: (2025)
Decoding the Ear: A Framework for Objectifying Expressiveness from Human Preference Through Efficient Alignment
by: Lin, Zhiyu, et al.
Published: (2025)
by: Lin, Zhiyu, et al.
Published: (2025)
Open Rubric System: Scaling Reinforcement Learning with Pairwise Adaptive Rubric
by: Jia, Ruipeng, et al.
Published: (2026)
by: Jia, Ruipeng, et al.
Published: (2026)
Rubrics.
by: Callison, Daniel
Published: (2000)
by: Callison, Daniel
Published: (2000)
When and What to Ask: AskBench and Rubric-Guided RLVR for LLM Clarification
by: Zhao, Jiale, et al.
Published: (2026)
by: Zhao, Jiale, et al.
Published: (2026)
iRULER: Intelligible Rubric-Based User-Defined LLM Evaluation for Revision
by: Bai, Jingwen, et al.
Published: (2026)
by: Bai, Jingwen, et al.
Published: (2026)
AutoRubric: Rubric-Based Generative Rewards for Faithful Multimodal Reasoning
by: Jia, Mengzhao, et al.
Published: (2025)
by: Jia, Mengzhao, et al.
Published: (2025)
PresentBench: A Fine-Grained Rubric-Based Benchmark for Slide Generation
by: Chen, Xin-Sheng, et al.
Published: (2026)
by: Chen, Xin-Sheng, et al.
Published: (2026)
Auto-Rubric: Learning From Implicit Weights to Explicit Rubrics for Reward Modeling
by: Xie, Lipeng, et al.
Published: (2025)
by: Xie, Lipeng, et al.
Published: (2025)
EvoRubric: Self-Evolving Rubric-Driven RL for Open-Ended Generation
by: Guan, Xin, et al.
Published: (2026)
by: Guan, Xin, et al.
Published: (2026)
Learning Query-Specific Rubrics from Human Preferences for DeepResearch Report Generation
by: Lv, Changze, et al.
Published: (2026)
by: Lv, Changze, et al.
Published: (2026)
AMARIS: A Memory-Augmented Rubric Improvement System for Rubric-Based Reinforcement Learning
by: Wu, Peilin, et al.
Published: (2026)
by: Wu, Peilin, et al.
Published: (2026)
Reinforcement Learning with Rubric Anchors
by: Huang, Zenan, et al.
Published: (2025)
by: Huang, Zenan, et al.
Published: (2025)
RubricRAG: Towards Interpretable and Reliable LLM Evaluation via Domain Knowledge Retrieval for Rubric Generation
by: Dhole, Kaustubh D., et al.
Published: (2026)
by: Dhole, Kaustubh D., et al.
Published: (2026)
A Comprehensive Rubric for Annotating Pathological Speech
by: Corrales-Astorgano, Mario, et al.
Published: (2024)
by: Corrales-Astorgano, Mario, et al.
Published: (2024)
Automated Rubrics for Reliable Evaluation of Medical Dialogue Systems
by: Chen, Yinzhu, et al.
Published: (2026)
by: Chen, Yinzhu, et al.
Published: (2026)
RubricEval: A Rubric-Level Meta-Evaluation Benchmark for LLM Judges in Instruction Following
by: Pan, Tianjun, et al.
Published: (2026)
by: Pan, Tianjun, et al.
Published: (2026)
Confusion-Aware Rubric Optimization for LLM-based Automated Grading
by: Chu, Yucheng, et al.
Published: (2026)
by: Chu, Yucheng, et al.
Published: (2026)
Beyond Verifiable Rewards: Rubric-Based GRM for Reinforced Fine-Tuning SWE Agents
by: Huang, Jiawei, et al.
Published: (2026)
by: Huang, Jiawei, et al.
Published: (2026)
EvoLM: Self-Evolving Language Models through Co-Evolved Discriminative Rubrics
by: Li, Shuyue Stella, et al.
Published: (2026)
by: Li, Shuyue Stella, et al.
Published: (2026)
Not Every Rubric Teaches Equally: Policy-Aware Rubric Rewards for RLVR
by: Tyagi, Utkarsh, et al.
Published: (2026)
by: Tyagi, Utkarsh, et al.
Published: (2026)
ResearchRubrics: A Benchmark of Prompts and Rubrics For Evaluating Deep Research Agents
by: Sharma, Manasi, et al.
Published: (2025)
by: Sharma, Manasi, et al.
Published: (2025)
Preference-Aware Rubric Learning for Personalized Evaluation
by: Qiu, Yilun, et al.
Published: (2026)
by: Qiu, Yilun, et al.
Published: (2026)
RubricEM: Meta-RL with Rubric-guided Policy Decomposition beyond Verifiable Rewards
by: Li, Gaotang, et al.
Published: (2026)
by: Li, Gaotang, et al.
Published: (2026)
CDRRM: Contrast-Driven Rubric Generation for Reliable and Interpretable Reward Modeling
by: Liu, Dengcan, et al.
Published: (2026)
by: Liu, Dengcan, et al.
Published: (2026)
Deep Research as Rubric for Reinforcement Learning
by: Mei, Wangyi, et al.
Published: (2026)
by: Mei, Wangyi, et al.
Published: (2026)
LP-Eval: Rubric and Dataset for Measuring the Quality of Legal Proposition Generation
by: Xu, Shanshan, et al.
Published: (2026)
by: Xu, Shanshan, et al.
Published: (2026)
Rubric-Guided Process Reward for Stepwise Model Routing
by: Ye, Shenghao, et al.
Published: (2026)
by: Ye, Shenghao, et al.
Published: (2026)
Visual Preference Optimization with Rubric Rewards
by: Yu, Ya-Qi, et al.
Published: (2026)
by: Yu, Ya-Qi, et al.
Published: (2026)
Reinforcement Learning with Robust Rubric Rewards
by: Yu, Ya-Qi, et al.
Published: (2026)
by: Yu, Ya-Qi, et al.
Published: (2026)
Optimsyn: Influence-Guided Rubrics Optimization for Synthetic Data Generation
by: Fan, Zhiting, et al.
Published: (2026)
by: Fan, Zhiting, et al.
Published: (2026)
AdaRubric: Task-Adaptive Rubrics for Reliable LLM Agent Evaluation and Reward Learning
by: Ding, Liang
Published: (2026)
by: Ding, Liang
Published: (2026)
ARES: Automated Rubric Synthesis for Scalable LLM Reinforcement Learning
by: Li, Xiaoyuan, et al.
Published: (2026)
by: Li, Xiaoyuan, et al.
Published: (2026)
SedarEval: Automated Evaluation using Self-Adaptive Rubrics
by: Fan, Zhiyuan, et al.
Published: (2025)
by: Fan, Zhiyuan, et al.
Published: (2025)
Automated Refinement of Essay Scoring Rubrics for Language Models via Reflect-and-Revise
by: Harada, Keno, et al.
Published: (2025)
by: Harada, Keno, et al.
Published: (2025)
Robust Reward Modeling via Causal Rubrics
by: Srivastava, Pragya, et al.
Published: (2025)
by: Srivastava, Pragya, et al.
Published: (2025)
Rubric-based On-policy Distillation
by: Fang, Junfeng, et al.
Published: (2026)
by: Fang, Junfeng, et al.
Published: (2026)
Similar Items
-
Breaking the Exploration Bottleneck: Rubric-Scaffolded Reinforcement Learning for General LLM Reasoning
by: Zhou, Yang, et al.
Published: (2025) -
ThinkPilot: Steering Reasoning Models via Automated Think-prefixes Optimization
by: Li, Sunzhu, et al.
Published: (2025) -
RubricBench: Aligning Model-Generated Rubrics with Human Standards
by: Zhang, Qiyuan, et al.
Published: (2026) -
OpenRubrics: Towards Scalable Synthetic Rubric Generation for Reward Modeling and LLM Alignment
by: Liu, Tianci, et al.
Published: (2025) -
Decoding the Ear: A Framework for Objectifying Expressiveness from Human Preference Through Efficient Alignment
by: Lin, Zhiyu, et al.
Published: (2025)