Saved in:
| Main Authors: | Li, Yanlin, Liu, Hao, Liu, Huimin, Wang, Kun, Wei, Yinwei, Hu, Yupeng |
|---|---|
| Format: | Preprint |
| Published: |
2025
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2506.14161 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Implicit Bias in LLMs: A Survey
by: Lin, Xinru, et al.
Published: (2025)
by: Lin, Xinru, et al.
Published: (2025)
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
by: Xiao, Yang, et al.
Published: (2025)
by: Xiao, Yang, et al.
Published: (2025)
Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
by: Chen, Chao, et al.
Published: (2025)
by: Chen, Chao, et al.
Published: (2025)
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions
by: Borah, Angana, et al.
Published: (2024)
by: Borah, Angana, et al.
Published: (2024)
Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind
by: Ruan, Minyuan, et al.
Published: (2026)
by: Ruan, Minyuan, et al.
Published: (2026)
Assessing LLMs in Art Contexts: Critique Generation and Theory of Mind Evaluation
by: Arita, Takaya, et al.
Published: (2025)
by: Arita, Takaya, et al.
Published: (2025)
LLMs and their Limited Theory of Mind: Evaluating Mental State Annotations in Situated Dialogue
by: Kowalyshyn, Katharine, et al.
Published: (2025)
by: Kowalyshyn, Katharine, et al.
Published: (2025)
Context-Agent: Dynamic Discourse Trees for Non-Linear Dialogue
by: Hu, Junan, et al.
Published: (2026)
by: Hu, Junan, et al.
Published: (2026)
DIF: A Framework for Benchmarking and Verifying Implicit Bias in LLMs
by: Yin, Lake, et al.
Published: (2025)
by: Yin, Lake, et al.
Published: (2025)
Towards Safety Evaluations of Theory of Mind in Large Language Models
by: Aoshima, Tatsuhiro, et al.
Published: (2025)
by: Aoshima, Tatsuhiro, et al.
Published: (2025)
Evaluating and Enhancing LLMs Agent based on Theory of Mind in Guandan: A Multi-Player Cooperative Game under Imperfect Information
by: Yim, Yauwai, et al.
Published: (2024)
by: Yim, Yauwai, et al.
Published: (2024)
QGEval: Benchmarking Multi-dimensional Evaluation for Question Generation
by: Fu, Weiping, et al.
Published: (2024)
by: Fu, Weiping, et al.
Published: (2024)
Can LLMs Outshine Conventional Recommenders? A Comparative Evaluation
by: Liu, Qijiong, et al.
Published: (2025)
by: Liu, Qijiong, et al.
Published: (2025)
Mind the Language Gap: Automated and Augmented Evaluation of Bias in LLMs for High- and Low-Resource Languages
by: Buscemi, Alessio, et al.
Published: (2025)
by: Buscemi, Alessio, et al.
Published: (2025)
CreDes: Causal Reasoning Enhancement and Dual-End Searching for Solving Long-Range Reasoning Problems using LLMs
by: Wang, Kangsheng, et al.
Published: (2024)
by: Wang, Kangsheng, et al.
Published: (2024)
Bias Runs Deep: Implicit Reasoning Biases in Persona-Assigned LLMs
by: Gupta, Shashank, et al.
Published: (2023)
by: Gupta, Shashank, et al.
Published: (2023)
AesBiasBench: Evaluating Bias and Alignment in Multimodal Language Models for Personalized Image Aesthetic Assessment
by: Li, Kun, et al.
Published: (2025)
by: Li, Kun, et al.
Published: (2025)
MIST: Jailbreaking Black-box Large Language Models via Iterative Semantic Tuning
by: Zheng, Muyang, et al.
Published: (2025)
by: Zheng, Muyang, et al.
Published: (2025)
LLMs Are Biased Towards Output Formats! Systematically Evaluating and Mitigating Output Format Bias of LLMs
by: Long, Do Xuan, et al.
Published: (2024)
by: Long, Do Xuan, et al.
Published: (2024)
Do LLMs Exhibit Human-Like Reasoning? Evaluating Theory of Mind in LLMs for Open-Ended Responses
by: Amirizaniani, Maryam, et al.
Published: (2024)
by: Amirizaniani, Maryam, et al.
Published: (2024)
Towards Transfer Unlearning: Empirical Evidence of Cross-Domain Bias Mitigation
by: Lu, Huimin, et al.
Published: (2024)
by: Lu, Huimin, et al.
Published: (2024)
States Hidden in Hidden States: LLMs Emerge Discrete State Representations Implicitly
by: Chen, Junhao, et al.
Published: (2024)
by: Chen, Junhao, et al.
Published: (2024)
TactfulToM: Do LLMs Have the Theory of Mind Ability to Understand White Lies?
by: Liu, Yiwei, et al.
Published: (2025)
by: Liu, Yiwei, et al.
Published: (2025)
MALIBU Benchmark: Multi-Agent LLM Implicit Bias Uncovered
by: Mirza, Imran, et al.
Published: (2025)
by: Mirza, Imran, et al.
Published: (2025)
MindDial: Belief Dynamics Tracking with Theory-of-Mind Modeling for Situated Neural Dialogue Generation
by: Qiu, Shuwen, et al.
Published: (2023)
by: Qiu, Shuwen, et al.
Published: (2023)
Evaluate Bias without Manual Test Sets: A Concept Representation Perspective for LLMs
by: Gao, Lang, et al.
Published: (2025)
by: Gao, Lang, et al.
Published: (2025)
AstroMind: A High-Fidelity Benchmark for Spacecraft Behavior Reasoning Based on Large Language Models
by: Liu, Hao, et al.
Published: (2026)
by: Liu, Hao, et al.
Published: (2026)
UserHarness: Harnessing User Minds for Stronger Agent Theory-of-Mind
by: Qian, Cheng, et al.
Published: (2026)
by: Qian, Cheng, et al.
Published: (2026)
Towards Multimodal Sentiment Analysis Debiasing via Bias Purification
by: Yang, Dingkang, et al.
Published: (2024)
by: Yang, Dingkang, et al.
Published: (2024)
SoMi-ToM: Evaluating Multi-Perspective Theory of Mind in Embodied Social Interactions
by: Fan, Xianzhe, et al.
Published: (2025)
by: Fan, Xianzhe, et al.
Published: (2025)
Theory of Mind and Self-Attributions of Mentality are Dissociable in LLMs
by: Kim, Junsol, et al.
Published: (2026)
by: Kim, Junsol, et al.
Published: (2026)
Multi-ToM: Evaluating Multilingual Theory of Mind Capabilities in Large Language Models
by: Sadhu, Jayanta, et al.
Published: (2024)
by: Sadhu, Jayanta, et al.
Published: (2024)
Evaluating the Bias in LLMs for Surveying Opinion and Decision Making in Healthcare
by: Khaokaew, Yonchanok, et al.
Published: (2025)
by: Khaokaew, Yonchanok, et al.
Published: (2025)
Evaluating Implicit Bias in Large Language Models by Attacking From a Psychometric Perspective
by: Wen, Yuchen, et al.
Published: (2024)
by: Wen, Yuchen, et al.
Published: (2024)
MidPO: Dual Preference Optimization for Safety and Helpfulness in Large Language Models via a Mixture of Experts Framework
by: Qi, Yupeng, et al.
Published: (2025)
by: Qi, Yupeng, et al.
Published: (2025)
CSSBench: Evaluating the Safety of Lightweight LLMs against Chinese-Specific Adversarial Patterns
by: Zhou, Zhenhong, et al.
Published: (2026)
by: Zhou, Zhenhong, et al.
Published: (2026)
Promoting Equality in Large Language Models: Identifying and Mitigating the Implicit Bias based on Bayesian Theory
by: Deng, Yongxin, et al.
Published: (2024)
by: Deng, Yongxin, et al.
Published: (2024)
Mind the Ambiguity: Aleatoric Uncertainty Quantification in LLMs for Safe Medical Question Answering
by: Liu, Yaokun, et al.
Published: (2026)
by: Liu, Yaokun, et al.
Published: (2026)
McBE: A Multi-task Chinese Bias Evaluation Benchmark for Large Language Models
by: Lan, Tian, et al.
Published: (2025)
by: Lan, Tian, et al.
Published: (2025)
Evaluating Large Language Models in Theory of Mind Tasks
by: Kosinski, Michal
Published: (2023)
by: Kosinski, Michal
Published: (2023)
Similar Items
-
Implicit Bias in LLMs: A Survey
by: Lin, Xinru, et al.
Published: (2025) -
Towards Dynamic Theory of Mind: Evaluating LLM Adaptation to Temporal Evolution of Human States
by: Xiao, Yang, et al.
Published: (2025) -
Reasoning in the Dark: Interleaved Vision-Text Reasoning in Latent Space
by: Chen, Chao, et al.
Published: (2025) -
Towards Implicit Bias Detection and Mitigation in Multi-Agent LLM Interactions
by: Borah, Angana, et al.
Published: (2024) -
Beyond Words: Evaluating and Bridging Epistemic Divergence in User-Agent Interaction via Theory of Mind
by: Ruan, Minyuan, et al.
Published: (2026)