:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Wang, Qian, Zhao, Xuandong, Zhang, Zirui, Lou, Zhanzhi, Chen, Nuo, Song, Dawn, He, Bingsheng
Format:	Preprint
Published:	2026
Subjects:	Computers and Society Machine Learning
Online Access:	https://arxiv.org/abs/2602.01528
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Assessing Judging Bias in Large Reasoning Models: An Empirical Study
by: Wang, Qian, et al.
Published: (2025)

Towards Evaluting Fake Reasoning Bias in Language Models
by: Wang, Qian, et al.
Published: (2025)

Self-Sovereign Agent
by: Qu, Wenjie, et al.
Published: (2026)

Learning to Reason without External Rewards
by: Zhao, Xuandong, et al.
Published: (2025)

Scalable Best-of-N Selection for Large Language Models via Self-Certainty
by: Kang, Zhewei, et al.
Published: (2025)

Are You Getting What You Pay For? Auditing Model Substitution in LLM APIs
by: Cai, Will, et al.
Published: (2025)

MemFail: Stress-Testing Failure Modes of LLM Memory Systems
by: Garg, Ishir, et al.
Published: (2026)

Learning to Learn-at-Test-Time: Language Agents with Learnable Adaptation Policies
by: Lou, Zhanzhi, et al.
Published: (2026)

"They've Stolen My GPL-Licensed Model!": Toward Standardized and Transparent Model Licensing
by: Duan, Moming, et al.
Published: (2024)

Integrating Reason-Based Moral Decision-Making in the Reinforcement Learning Architecture
by: Dargasz, Lisa
Published: (2025)

An Undetectable Watermark for Generative Image Models
by: Gunn, Sam, et al.
Published: (2024)

Position: The Current AI Conference Model is Unsustainable! Diagnosing the Crisis of Centralized AI Conference
by: Chen, Nuo, et al.
Published: (2025)

Training Fair Models in Federated Learning without Data Privacy Infringement
by: Che, Xin, et al.
Published: (2021)

The Hidden Risks of Large Reasoning Models: A Safety Assessment of R1
by: Zhou, Kaiwen, et al.
Published: (2025)

From ChatGPT to DeepSeek: Can LLMs Simulate Humanity?
by: Wang, Qian, et al.
Published: (2025)

Improving LLM Safety Alignment with Dual-Objective Optimization
by: Zhao, Xuandong, et al.
Published: (2025)

Improving Adversarial Robust Fairness via Anti-Bias Soft Label Distillation
by: Zhao, Shiji, et al.
Published: (2023)

The Landscape of Memorization in LLMs: Mechanisms, Measurement, and Mitigation
by: Xiong, Alexander, et al.
Published: (2025)

DCAST: Diverse Class-Aware Self-Training Mitigates Selection Bias for Fairer Learning
by: Tepeli, Yasin I., et al.
Published: (2024)

Machine Learning-Driven Student Performance Prediction for Enhancing Tiered Instruction
by: Chen, Yawen, et al.
Published: (2025)

Adapting Static Fairness to Sequential Decision-Making: Bias Mitigation Strategies towards Equal Long-term Benefit Rate
by: Xu, Yuancheng, et al.
Published: (2023)

Beyond Brainstorming: What Drives High-Quality Scientific Ideas? Lessons from Multi-Agent Collaboration
by: Chen, Nuo, et al.
Published: (2025)

Integrating Social Determinants of Health into Knowledge Graphs: Evaluating Prediction Bias and Fairness in Healthcare
by: Shang, Tianqi, et al.
Published: (2024)

Towards Responsible AI in Banking: Addressing Bias for Fair Decision-Making
by: Castelnovo, Alessandro
Published: (2024)

Exploring LLM Cryptocurrency Trading Through Fact-Subjectivity Aware Reasoning
by: Wang, Qian, et al.
Published: (2024)

Teaching LLM to Reason: Reinforcement Learning from Algorithmic Problems without Code
by: Bao, Keqin, et al.
Published: (2025)

BiasFreeBench: a Benchmark for Mitigating Bias in Large Language Model Responses
by: Xu, Xin, et al.
Published: (2025)

Robust Meta-Model for Predicting the Need for Blood Transfusion in Non-traumatic ICU Patients
by: Rafiei, Alireza, et al.
Published: (2024)

LeakAgent: RL-based Red-teaming Agent for LLM Privacy Leakage
by: Nie, Yuzhou, et al.
Published: (2024)

Backdoor for Debias: Mitigating Model Bias with Backdoor Attack-based Artificial Bias
by: Wu, Shangxi, et al.
Published: (2023)

LLM DNA: Tracing Model Evolution via Functional Representations
by: Wu, Zhaomin, et al.
Published: (2025)

Bias Begins with Data: The FairGround Corpus for Robust and Reproducible Research on Algorithmic Fairness
by: Simson, Jan, et al.
Published: (2025)

Predicting Human Mobility during Extreme Events via LLM-Enhanced Cross-City Learning
by: Tang, Yinzhou, et al.
Published: (2025)

Reinforced Sequential Decision-Making for Sepsis Treatment: The POSNEGDM Framework with Mortality Classifier and Transformer
by: Tamboli, Dipesh, et al.
Published: (2024)

Addressing Discretization-Induced Bias in Demographic Prediction
by: Dong, Evan, et al.
Published: (2024)

BadFair: Backdoored Fairness Attacks with Group-conditioned Triggers
by: Xue, Jiaqi, et al.
Published: (2024)

The Relative Value of Prediction in Algorithmic Decision Making
by: Perdomo, Juan Carlos
Published: (2023)

Mitigating Gender Bias in Depression Detection via Counterfactual Inference
by: Hu, Mingxuan, et al.
Published: (2025)

Remembering to Be Fair: Non-Markovian Fairness in Sequential Decision Making
by: Alamdari, Parand A., et al.
Published: (2023)

Counterfactually Fair Reinforcement Learning via Sequential Data Preprocessing
by: Wang, Jitao, et al.
Published: (2025)