Saved in:
| Main Authors: | Xu, Mingyuan, Tan, Xinzi, Wu, Jiawei, Zhou, Doudou |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2601.21817 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Ranking Large Language Models without Ground Truth
by: Dhurandhar, Amit, et al.
Published: (2024)
by: Dhurandhar, Amit, et al.
Published: (2024)
From Hawkes Processes to Attention: Time-Modulated Mechanisms for Event Sequences
by: Tan, Xinzi, et al.
Published: (2026)
by: Tan, Xinzi, et al.
Published: (2026)
Evaluating Model Explanations without Ground Truth
by: Rawal, Kaivalya, et al.
Published: (2025)
by: Rawal, Kaivalya, et al.
Published: (2025)
Calibration without Ground Truth
by: Kong, Yuqing, et al.
Published: (2026)
by: Kong, Yuqing, et al.
Published: (2026)
Self-Compatibility: Evaluating Causal Discovery without Ground Truth
by: Faller, Philipp M., et al.
Published: (2023)
by: Faller, Philipp M., et al.
Published: (2023)
Learning Sequential Decisions from Multiple Sources via Group-Robust Markov Decision Processes
by: Xu, Mingyuan, et al.
Published: (2026)
by: Xu, Mingyuan, et al.
Published: (2026)
REAL: Regression-Aware Reinforcement Learning for LLM-as-a-Judge
by: Zhang, Yasi, et al.
Published: (2026)
by: Zhang, Yasi, et al.
Published: (2026)
Consensus Knowledge Graph Learning via Multi-view Sparse Low Rank Block Model
by: Cai, Tianxi, et al.
Published: (2022)
by: Cai, Tianxi, et al.
Published: (2022)
SAGE: Scalable Ground Truth Evaluations for Large Sparse Autoencoders
by: Venhoff, Constantin, et al.
Published: (2024)
by: Venhoff, Constantin, et al.
Published: (2024)
A Trainable Centrality Framework for Modern Data
by: Vu, Minh Duc, et al.
Published: (2025)
by: Vu, Minh Duc, et al.
Published: (2025)
Time-Aware Attention for Enhanced Electronic Health Records Modeling
by: Yu, Junhan, et al.
Published: (2025)
by: Yu, Junhan, et al.
Published: (2025)
Fairness Evaluation for Uplift Modeling in the Absence of Ground Truth
by: Kadioglu, Serdar, et al.
Published: (2024)
by: Kadioglu, Serdar, et al.
Published: (2024)
Learning-based Sketches for Frequency Estimation in Data Streams without Ground Truth
by: Yuan, Xinyu, et al.
Published: (2024)
by: Yuan, Xinyu, et al.
Published: (2024)
GT-Space: Enhancing Heterogeneous Collaborative Perception with Ground Truth Feature Space
by: Wang, Wentao, et al.
Published: (2026)
by: Wang, Wentao, et al.
Published: (2026)
Diff-eRank: A Novel Rank-Based Metric for Evaluating Large Language Models
by: Wei, Lai, et al.
Published: (2024)
by: Wei, Lai, et al.
Published: (2024)
Diversity-Aware Policy Optimization for Large Language Model Reasoning
by: Yao, Jian, et al.
Published: (2025)
by: Yao, Jian, et al.
Published: (2025)
HyperNet Fields: Efficiently Training Hypernetworks without Ground Truth by Learning Weight Trajectories
by: Hedlin, Eric, et al.
Published: (2024)
by: Hedlin, Eric, et al.
Published: (2024)
ORCE: Order-Aware Alignment of Verbalized Confidence in Large Language Models
by: Li, Chen, et al.
Published: (2026)
by: Li, Chen, et al.
Published: (2026)
Evaluation of Missing Data Imputation for Time Series Without Ground Truth
by: Farjallah, Rania, et al.
Published: (2025)
by: Farjallah, Rania, et al.
Published: (2025)
On the Minimax Regret in Online Ranking with Top-k Feedback
by: Zhang, Mingyuan, et al.
Published: (2023)
by: Zhang, Mingyuan, et al.
Published: (2023)
Confidence Calibration under Ambiguous Ground Truth
by: Tao, Linwei, et al.
Published: (2026)
by: Tao, Linwei, et al.
Published: (2026)
Evaluating Judges as Evaluators: The JETTS Benchmark of LLM-as-Judges as Test-Time Scaling Evaluators
by: Zhou, Yilun, et al.
Published: (2025)
by: Zhou, Yilun, et al.
Published: (2025)
Large Language Model Compression with Global Rank and Sparsity Optimization
by: Zhou, Changhai, et al.
Published: (2025)
by: Zhou, Changhai, et al.
Published: (2025)
CodeJudge: Evaluating Code Generation with Large Language Models
by: Tong, Weixi, et al.
Published: (2024)
by: Tong, Weixi, et al.
Published: (2024)
AFLoRA: Adaptive Federated Fine-Tuning of Large Language Models with Resource-Aware Low-Rank Adaption
by: Zhou, Yajie, et al.
Published: (2025)
by: Zhou, Yajie, et al.
Published: (2025)
Hallucination to Truth: A Review of Fact-Checking and Factuality Evaluation in Large Language Models
by: Rahman, Subhey Sadi, et al.
Published: (2025)
by: Rahman, Subhey Sadi, et al.
Published: (2025)
TruthX: Alleviating Hallucinations by Editing Large Language Models in Truthful Space
by: Zhang, Shaolei, et al.
Published: (2024)
by: Zhang, Shaolei, et al.
Published: (2024)
A Novel Score-CAM based Denoiser for Spectrographic Signature Extraction without Ground Truth
by: Elias, Noel
Published: (2024)
by: Elias, Noel
Published: (2024)
Gaussian Mixture Vector Quantization with Aggregated Categorical Posterior
by: Yan, Mingyuan, et al.
Published: (2024)
by: Yan, Mingyuan, et al.
Published: (2024)
VQSynery: Robust Drug Synergy Prediction With Vector Quantization Mechanism
by: Wu, Jiawei, et al.
Published: (2024)
by: Wu, Jiawei, et al.
Published: (2024)
Sparsity-Aware Low-Rank Representation for Efficient Fine-Tuning of Large Language Models
by: Zhang, Longteng, et al.
Published: (2026)
by: Zhang, Longteng, et al.
Published: (2026)
QoS-QoE Translation with Large Language Model
by: Yu, Yingjie, et al.
Published: (2026)
by: Yu, Yingjie, et al.
Published: (2026)
TalkLoRA: Communication-Aware Mixture of Low-Rank Adaptation for Large Language Models
by: Mu, Lin, et al.
Published: (2026)
by: Mu, Lin, et al.
Published: (2026)
From Ground Truth to Measurement: A Statistical Framework for Human Labeling
by: Chew, Robert, et al.
Published: (2026)
by: Chew, Robert, et al.
Published: (2026)
DLM-One: Diffusion Language Models for One-Step Sequence Generation
by: Chen, Tianqi, et al.
Published: (2025)
by: Chen, Tianqi, et al.
Published: (2025)
Free Draft-and-Verification: Toward Lossless Parallel Decoding for Diffusion Large Language Models
by: Wu, Shutong, et al.
Published: (2025)
by: Wu, Shutong, et al.
Published: (2025)
A Hybrid Model for Traffic Incident Detection based on Generative Adversarial Networks and Transformer Model
by: Lu, Xinying, et al.
Published: (2024)
by: Lu, Xinying, et al.
Published: (2024)
FairJudge: Abstention-Aware Multimodal Judges for Fairness and Alignment Evaluation in Text-to-Image Models
by: Sahili, Zahraa Al, et al.
Published: (2025)
by: Sahili, Zahraa Al, et al.
Published: (2025)
Conformalized Credal Regions for Classification with Ambiguous Ground Truth
by: Caprio, Michele, et al.
Published: (2024)
by: Caprio, Michele, et al.
Published: (2024)
From Rubrics to Reliable Scores: Evidence-Grounded Text Evaluation with LLM Judges
by: Hong, Yihan, et al.
Published: (2026)
by: Hong, Yihan, et al.
Published: (2026)
Similar Items
-
Ranking Large Language Models without Ground Truth
by: Dhurandhar, Amit, et al.
Published: (2024) -
From Hawkes Processes to Attention: Time-Modulated Mechanisms for Event Sequences
by: Tan, Xinzi, et al.
Published: (2026) -
Evaluating Model Explanations without Ground Truth
by: Rawal, Kaivalya, et al.
Published: (2025) -
Calibration without Ground Truth
by: Kong, Yuqing, et al.
Published: (2026) -
Self-Compatibility: Evaluating Causal Discovery without Ground Truth
by: Faller, Philipp M., et al.
Published: (2023)