Saved in:
| Main Authors: | Wang, Qian, Zhao, Xuandong, Zhang, Zirui, Lou, Zhanzhi, Chen, Nuo, Song, Dawn, He, Bingsheng |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.01528 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Assessing Judging Bias in Large Reasoning Models: An Empirical Study
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
Towards Evaluting Fake Reasoning Bias in Language Models
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
Self-Sovereign Agent
by: Qu, Wenjie, et al.
Published: (2026)
by: Qu, Wenjie, et al.
Published: (2026)
Learning to Reason without External Rewards
by: Zhao, Xuandong, et al.
Published: (2025)
by: Zhao, Xuandong, et al.
Published: (2025)
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
by: Kang, Zhewei, et al.
Published: (2025)
by: Kang, Zhewei, et al.
Published: (2025)
Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs
by: Cai, Will, et al.
Published: (2025)
by: Cai, Will, et al.
Published: (2025)
MemFail: Stress-Testing Failure Modes of LLM Memory Systems
by: Garg, Ishir, et al.
Published: (2026)
by: Garg, Ishir, et al.
Published: (2026)
Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies
by: Lou, Zhanzhi, et al.
Published: (2026)
by: Lou, Zhanzhi, et al.
Published: (2026)
"They've Stolen My GPL-Licensed Model!": Toward Standardized and Transparent Model Licensing
by: Duan, Moming, et al.
Published: (2024)
by: Duan, Moming, et al.
Published: (2024)
Integrating Reason-Based Moral Decision-Making in the Reinforcement Learning Architecture
by: Dargasz, Lisa
Published: (2025)
by: Dargasz, Lisa
Published: (2025)
An Undetectable Watermark for Generative Image Models
by: Gunn, Sam, et al.
Published: (2024)
by: Gunn, Sam, et al.
Published: (2024)
Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
Training Fair Models in Federated Learning without Data Privacy Infringement
by: Che, Xin, et al.
Published: (2021)
by: Che, Xin, et al.
Published: (2021)
The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1
by: Zhou, Kaiwen, et al.
Published: (2025)
by: Zhou, Kaiwen, et al.
Published: (2025)
From ChatGPT to DeepSeek: Can LLMs Simulate Humanity?
by: Wang, Qian, et al.
Published: (2025)
by: Wang, Qian, et al.
Published: (2025)
Improving LLM Safety Alignment with Dual-Objective Optimization
by: Zhao, Xuandong, et al.
Published: (2025)
by: Zhao, Xuandong, et al.
Published: (2025)
Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation
by: Zhao, Shiji, et al.
Published: (2023)
by: Zhao, Shiji, et al.
Published: (2023)
The Landscape of Memorization in LLMs: Mechanisms, Measurement, and Mitigation
by: Xiong, Alexander, et al.
Published: (2025)
by: Xiong, Alexander, et al.
Published: (2025)
DCAST: Diverse Class-Aware Self-Training Mitigates Selection Bias for Fairer Learning
by: Tepeli, Yasin I., et al.
Published: (2024)
by: Tepeli, Yasin I., et al.
Published: (2024)
Machine Learning-Driven Student Performance Prediction for Enhancing Tiered Instruction
by: Chen, Yawen, et al.
Published: (2025)
by: Chen, Yawen, et al.
Published: (2025)
Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
by: Xu, Yuancheng, et al.
Published: (2023)
by: Xu, Yuancheng, et al.
Published: (2023)
Beyond Brainstorming: What Drives High-Quality Scientific Ideas? Lessons from Multi-Agent Collaboration
by: Chen, Nuo, et al.
Published: (2025)
by: Chen, Nuo, et al.
Published: (2025)
Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare
by: Shang, Tianqi, et al.
Published: (2024)
by: Shang, Tianqi, et al.
Published: (2024)
Towards Responsible AI in Banking: Addressing Bias for Fair Decision-Making
by: Castelnovo, Alessandro
Published: (2024)
by: Castelnovo, Alessandro
Published: (2024)
Exploring LLM Cryptocurrency Trading Through Fact-Subjectivity Aware Reasoning
by: Wang, Qian, et al.
Published: (2024)
by: Wang, Qian, et al.
Published: (2024)
Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
by: Bao, Keqin, et al.
Published: (2025)
by: Bao, Keqin, et al.
Published: (2025)
BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses
by: Xu, Xin, et al.
Published: (2025)
by: Xu, Xin, et al.
Published: (2025)
Robust Meta-Model for Predicting the Need for Blood Transfusion in Non-traumatic ICU Patients
by: Rafiei, Alireza, et al.
Published: (2024)
by: Rafiei, Alireza, et al.
Published: (2024)
LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
by: Nie, Yuzhou, et al.
Published: (2024)
by: Nie, Yuzhou, et al.
Published: (2024)
Backdoor for Debias: Mitigating Model Bias with Backdoor Attack-based Artificial Bias
by: Wu, Shangxi, et al.
Published: (2023)
by: Wu, Shangxi, et al.
Published: (2023)
LLM DNA: Tracing Model Evolution via Functional Representations
by: Wu, Zhaomin, et al.
Published: (2025)
by: Wu, Zhaomin, et al.
Published: (2025)
Bias Begins with Data: The FairGround Corpus for Robust and Reproducible Research on Algorithmic Fairness
by: Simson, Jan, et al.
Published: (2025)
by: Simson, Jan, et al.
Published: (2025)
Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning
by: Tang, Yinzhou, et al.
Published: (2025)
by: Tang, Yinzhou, et al.
Published: (2025)
Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer
by: Tamboli, Dipesh, et al.
Published: (2024)
by: Tamboli, Dipesh, et al.
Published: (2024)
Addressing Discretization-Induced Bias in Demographic Prediction
by: Dong, Evan, et al.
Published: (2024)
by: Dong, Evan, et al.
Published: (2024)
BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
by: Xue, Jiaqi, et al.
Published: (2024)
by: Xue, Jiaqi, et al.
Published: (2024)
The Relative Value of Prediction in Algorithmic Decision Making
by: Perdomo, Juan Carlos
Published: (2023)
by: Perdomo, Juan Carlos
Published: (2023)
Mitigating Gender Bias in Depression Detection via Counterfactual Inference
by: Hu, Mingxuan, et al.
Published: (2025)
by: Hu, Mingxuan, et al.
Published: (2025)
Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making
by: Alamdari, Parand A., et al.
Published: (2023)
by: Alamdari, Parand A., et al.
Published: (2023)
Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
by: Wang, Jitao, et al.
Published: (2025)
by: Wang, Jitao, et al.
Published: (2025)
Similar Items
-
Assessing Judging Bias in Large Reasoning Models: An Empirical Study
by: Wang, Qian, et al.
Published: (2025) -
Towards Evaluting Fake Reasoning Bias in Language Models
by: Wang, Qian, et al.
Published: (2025) -
Self-Sovereign Agent
by: Qu, Wenjie, et al.
Published: (2026) -
Learning to Reason without External Rewards
by: Zhao, Xuandong, et al.
Published: (2025) -
Scalable Best-of-N Selection for Large Language Models via Self-Certainty
by: Kang, Zhewei, et al.
Published: (2025)