Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Yuan, Youliang, Jiao, Wenxiang, Xie, Yuejin, Shen, Chihao, Tian, Menghan, Wang, Wenxuan, Huang, Jen-tse, He, Pinjia
Format:	Preprint
Published:	2025
Subjects:	Computation and Language Artificial Intelligence
Online Access:	https://arxiv.org/abs/2505.17455
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866912658709544960
author	Yuan, Youliang Jiao, Wenxiang Xie, Yuejin Shen, Chihao Tian, Menghan Wang, Wenxuan Huang, Jen-tse He, Pinjia
author_facet	Yuan, Youliang Jiao, Wenxiang Xie, Yuejin Shen, Chihao Tian, Menghan Wang, Wenxuan Huang, Jen-tse He, Pinjia
contents	Human safety awareness gaps often prevent the timely recognition of everyday risks. In solving this problem, a proactive safety artificial intelligence (AI) system would work better than a reactive one. Instead of just reacting to users' questions, it would actively watch people's behavior and their environment to detect potential dangers in advance. Our Proactive Safety Bench (PaSBench) evaluates this capability through 416 multimodal scenarios (128 image sequences, 288 text logs) spanning 5 safety-critical domains. Evaluation of 36 advanced models reveals fundamental limitations: Top performers like Gemini-2.5-pro achieve 71% image and 64% text accuracy, but miss 45-55% risks in repeated trials. Through failure analysis, we identify unstable proactive reasoning rather than knowledge deficits as the primary limitation. This work establishes (1) a proactive safety benchmark, (2) systematic evidence of model limitations, and (3) critical directions for developing reliable protective AI. We believe our dataset and findings can promote the development of safer AI assistants that actively prevent harm rather than merely respond to requests. Our dataset can be found at https://huggingface.co/datasets/Youliang/PaSBench.
format	Preprint
id	arxiv_https___arxiv_org_abs_2505_17455
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	Towards Evaluating Proactive Risk Awareness of Multimodal Language Models Yuan, Youliang Jiao, Wenxiang Xie, Yuejin Shen, Chihao Tian, Menghan Wang, Wenxuan Huang, Jen-tse He, Pinjia Computation and Language Artificial Intelligence Human safety awareness gaps often prevent the timely recognition of everyday risks. In solving this problem, a proactive safety artificial intelligence (AI) system would work better than a reactive one. Instead of just reacting to users' questions, it would actively watch people's behavior and their environment to detect potential dangers in advance. Our Proactive Safety Bench (PaSBench) evaluates this capability through 416 multimodal scenarios (128 image sequences, 288 text logs) spanning 5 safety-critical domains. Evaluation of 36 advanced models reveals fundamental limitations: Top performers like Gemini-2.5-pro achieve 71% image and 64% text accuracy, but miss 45-55% risks in repeated trials. Through failure analysis, we identify unstable proactive reasoning rather than knowledge deficits as the primary limitation. This work establishes (1) a proactive safety benchmark, (2) systematic evidence of model limitations, and (3) critical directions for developing reliable protective AI. We believe our dataset and findings can promote the development of safer AI assistants that actively prevent harm rather than merely respond to requests. Our dataset can be found at https://huggingface.co/datasets/Youliang/PaSBench.
title	Towards Evaluating Proactive Risk Awareness of Multimodal Language Models
topic	Computation and Language Artificial Intelligence
url	https://arxiv.org/abs/2505.17455

Similar Items