Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Cheng, Ruoxi, Ma, Haoxuan, Ma, Teng, Zhang, Hongyi
Format:	Preprint
Published:	2025
Subjects:	Artificial Intelligence
Online Access:	https://arxiv.org/abs/2511.11301
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866911516811329536
author	Cheng, Ruoxi Ma, Haoxuan Ma, Teng Zhang, Hongyi
author_facet	Cheng, Ruoxi Ma, Haoxuan Ma, Teng Zhang, Hongyi
contents	Large Vision-Language Models (LVLMs) exhibit powerful reasoning capabilities but suffer sophisticated jailbreak vulnerabilities. Fundamentally, aligning LVLMs is not just a safety challenge but a problem of economic efficiency. Current alignment methods struggle with the trade-off between safety, utility, and operational costs. Critically, a focus solely on final outputs (process-blindness) wastes significant computational budget on unsafe deliberation. This flaw allows harmful reasoning to be disguised with benign justifications, thereby circumventing simple additive safety scores. To address this, we propose EcoAlign, an inference-time framework that reframes alignment as an economically rational search by treating the LVLM as a boundedly rational agent. EcoAlign incrementally expands a thought graph and scores actions using a forward-looking function (analogous to net present value) that dynamically weighs expected safety, utility, and cost against the remaining budget. To prevent deception, path safety is enforced via the weakest-link principle. Extensive experiments across 3 closed-source and 2 open-source models on 6 datasets show that EcoAlign matches or surpasses state-of-the-art safety and utility at a lower computational cost, thereby offering a principled, economical pathway to robust LVLM alignment.
format	Preprint
id	arxiv_https___arxiv_org_abs_2511_11301
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment Cheng, Ruoxi Ma, Haoxuan Ma, Teng Zhang, Hongyi Artificial Intelligence Large Vision-Language Models (LVLMs) exhibit powerful reasoning capabilities but suffer sophisticated jailbreak vulnerabilities. Fundamentally, aligning LVLMs is not just a safety challenge but a problem of economic efficiency. Current alignment methods struggle with the trade-off between safety, utility, and operational costs. Critically, a focus solely on final outputs (process-blindness) wastes significant computational budget on unsafe deliberation. This flaw allows harmful reasoning to be disguised with benign justifications, thereby circumventing simple additive safety scores. To address this, we propose EcoAlign, an inference-time framework that reframes alignment as an economically rational search by treating the LVLM as a boundedly rational agent. EcoAlign incrementally expands a thought graph and scores actions using a forward-looking function (analogous to net present value) that dynamically weighs expected safety, utility, and cost against the remaining budget. To prevent deception, path safety is enforced via the weakest-link principle. Extensive experiments across 3 closed-source and 2 open-source models on 6 datasets show that EcoAlign matches or surpasses state-of-the-art safety and utility at a lower computational cost, thereby offering a principled, economical pathway to robust LVLM alignment.
title	EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment
topic	Artificial Intelligence
url	https://arxiv.org/abs/2511.11301

Similar Items