Saved in:
| Main Authors: | Liu, Yinqi, Zhu, Yueqi, Zhang, Yongkang, Liu, Feiran, Shen, Yutong, Sun, Yufei, Wang, Xin, Liang, Renzhao, Wang, Yidong, Wang, Cunxiang |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.09504 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models
by: Feng, Andrew Zhuoer, et al.
Published: (2026)
by: Feng, Andrew Zhuoer, et al.
Published: (2026)
DVD: A Robust Method for Detecting Variant Contamination in Large Language Model Evaluation
by: Liang, Renzhao, et al.
Published: (2026)
by: Liang, Renzhao, et al.
Published: (2026)
Deep Literature Survey Automation with an Iterative Workflow
by: Zhang, Hongbo, et al.
Published: (2025)
by: Zhang, Hongbo, et al.
Published: (2025)
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation
by: Liu, Yongkang, et al.
Published: (2024)
by: Liu, Yongkang, et al.
Published: (2024)
TraceSIR: A Multi-Agent Framework for Structured Analysis and Reporting of Agentic Execution Traces
by: Yang, Shu-Xun, et al.
Published: (2026)
by: Yang, Shu-Xun, et al.
Published: (2026)
SHIELD: Evaluation and Defense Strategies for Copyright Compliance in LLM Text Generation
by: Liu, Xiaoze, et al.
Published: (2024)
by: Liu, Xiaoze, et al.
Published: (2024)
A Survey on Evaluation of Large Language Models
by: Chang, Yupeng, et al.
Published: (2023)
by: Chang, Yupeng, et al.
Published: (2023)
HPSS: Heuristic Prompting Strategy Search for LLM Evaluators
by: Wen, Bosi, et al.
Published: (2025)
by: Wen, Bosi, et al.
Published: (2025)
Data Management For Training Large Language Models: A Survey
by: Wang, Zige, et al.
Published: (2023)
by: Wang, Zige, et al.
Published: (2023)
Temporal Self-Rewarding Language Models: Decoupling Chosen-Rejected via Past-Future
by: Wang, Yidong, et al.
Published: (2025)
by: Wang, Yidong, et al.
Published: (2025)
Knowledge Conflicts for LLMs: A Survey
by: Xu, Rongwu, et al.
Published: (2024)
by: Xu, Rongwu, et al.
Published: (2024)
Towards a Unified View of Preference Learning for Large Language Models: A Survey
by: Gao, Bofei, et al.
Published: (2024)
by: Gao, Bofei, et al.
Published: (2024)
UniCBE: An Uniformity-driven Comparing Based Evaluation Framework with Unified Multi-Objective Optimization
by: Yuan, Peiwen, et al.
Published: (2025)
by: Yuan, Peiwen, et al.
Published: (2025)
Nash CoT: Multi-Path Inference with Preference Equilibrium
by: Zhang, Ziqi, et al.
Published: (2024)
by: Zhang, Ziqi, et al.
Published: (2024)
Unlocking Recursive Thinking of LLMs: Alignment via Refinement
by: Zhang, Haoke, et al.
Published: (2025)
by: Zhang, Haoke, et al.
Published: (2025)
LeMix: Unified Scheduling for LLM Training and Inference on Multi-GPU Systems
by: Li, Yufei, et al.
Published: (2025)
by: Li, Yufei, et al.
Published: (2025)
ReflectRM: Boosting Generative Reward Models via Self-Reflection within a Unified Judgment Framework
by: Qin, Kai, et al.
Published: (2026)
by: Qin, Kai, et al.
Published: (2026)
TMD-TTS: A Unified Tibetan Multi-Dialect Text-to-Speech Framework for Ü-Tsang, Amdo and Kham Speech Dataset Generation
by: Liu, Yutong, et al.
Published: (2025)
by: Liu, Yutong, et al.
Published: (2025)
Reasoning on Multiple Needles In A Haystack
by: Wang, Yidong
Published: (2025)
by: Wang, Yidong
Published: (2025)
UniMS-RAG: A Unified Multi-source Retrieval-Augmented Generation for Personalized Dialogue Systems
by: Wang, Hongru, et al.
Published: (2024)
by: Wang, Hongru, et al.
Published: (2024)
MoLAN: A Unified Modality-Aware Noise Dynamic Editing Framework for Multimodal Sentiment Analysis
by: Xu, Xingle, et al.
Published: (2025)
by: Xu, Xingle, et al.
Published: (2025)
RAVEL: Reasoning Agents for Validating and Evaluating LLM Text Synthesis
by: Feng, Andrew Zhuoer, et al.
Published: (2026)
by: Feng, Andrew Zhuoer, et al.
Published: (2026)
AgentDropout: Dynamic Agent Elimination for Token-Efficient and High-Performance LLM-Based Multi-Agent Collaboration
by: Wang, Zhexuan, et al.
Published: (2025)
by: Wang, Zhexuan, et al.
Published: (2025)
IndustryCode: A Benchmark for Industry Code Generation
by: Zeng, Puyu, et al.
Published: (2026)
by: Zeng, Puyu, et al.
Published: (2026)
Minstrel: Structural Prompt Generation with Multi-Agents Coordination for Non-AI Experts
by: Wang, Ming, et al.
Published: (2024)
by: Wang, Ming, et al.
Published: (2024)
RAGLAB: A Modular and Research-Oriented Unified Framework for Retrieval-Augmented Generation
by: Zhang, Xuanwang, et al.
Published: (2024)
by: Zhang, Xuanwang, et al.
Published: (2024)
Loki's Dance of Illusions: A Comprehensive Survey of Hallucination in Large Language Models
by: Li, Chaozhuo, et al.
Published: (2025)
by: Li, Chaozhuo, et al.
Published: (2025)
Long$^2$RAG: Evaluating Long-Context & Long-Form Retrieval-Augmented Generation with Key Point Recall
by: Qi, Zehan, et al.
Published: (2024)
by: Qi, Zehan, et al.
Published: (2024)
F\textsuperscript{2}LP-AP: Fast \& Flexible Label Propagation with Adaptive Propagation Kernel
by: Shen, Yutong, et al.
Published: (2026)
by: Shen, Yutong, et al.
Published: (2026)
DreamReward: Text-to-3D Generation with Human Preference
by: Ye, Junliang, et al.
Published: (2024)
by: Ye, Junliang, et al.
Published: (2024)
Evaluate What You Can't Evaluate: Unassessable Quality for Generated Response
by: Liu, Yongkang, et al.
Published: (2023)
by: Liu, Yongkang, et al.
Published: (2023)
Why Attention Patterns Exist: A Unifying Temporal Perspective Analysis
by: Yang, Qingyue, et al.
Published: (2026)
by: Yang, Qingyue, et al.
Published: (2026)
ChatZero:Zero-shot Cross-Lingual Dialogue Generation via Pseudo-Target Language
by: Liu, Yongkang, et al.
Published: (2024)
by: Liu, Yongkang, et al.
Published: (2024)
Bridging Text and Molecule: A Survey on Multimodal Frameworks for Molecule
by: Xiao, Yi, et al.
Published: (2024)
by: Xiao, Yi, et al.
Published: (2024)
RAGChecker: A Fine-grained Framework for Diagnosing Retrieval-Augmented Generation
by: Ru, Dongyu, et al.
Published: (2024)
by: Ru, Dongyu, et al.
Published: (2024)
TrustUQA: A Trustful Framework for Unified Structured Data Question Answering
by: Zhang, Wen, et al.
Published: (2024)
by: Zhang, Wen, et al.
Published: (2024)
Reasoning Through Execution: Unifying Process and Outcome Rewards for Code Generation
by: Yu, Zhuohao, et al.
Published: (2024)
by: Yu, Zhuohao, et al.
Published: (2024)
TrustJudge: Inconsistencies of LLM-as-a-Judge and How to Alleviate Them
by: Wang, Yidong, et al.
Published: (2025)
by: Wang, Yidong, et al.
Published: (2025)
Towards a Unified View of Large Language Model Post-Training
by: Lv, Xingtai, et al.
Published: (2025)
by: Lv, Xingtai, et al.
Published: (2025)
Survey on Knowledge Distillation for Large Language Models: Methods, Evaluation, and Application
by: Yang, Chuanpeng, et al.
Published: (2024)
by: Yang, Chuanpeng, et al.
Published: (2024)
Similar Items
-
RLAR: An Agentic Reward System for Multi-task Reinforcement Learning on Large Language Models
by: Feng, Andrew Zhuoer, et al.
Published: (2026) -
DVD: A Robust Method for Detecting Variant Contamination in Large Language Model Evaluation
by: Liang, Renzhao, et al.
Published: (2026) -
Deep Literature Survey Automation with an Iterative Workflow
by: Zhang, Hongbo, et al.
Published: (2025) -
A Unified Data Augmentation Framework for Low-Resource Multi-Domain Dialogue Generation
by: Liu, Yongkang, et al.
Published: (2024) -
TraceSIR: A Multi-Agent Framework for Structured Analysis and Reporting of Agentic Execution Traces
by: Yang, Shu-Xun, et al.
Published: (2026)