Saved in:
Bibliographic Details
Main Authors: Cheng, Ruoxi, Ma, Haoxuan, Ma, Teng, Zhang, Hongyi
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2511.11301
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866911516811329536
author Cheng, Ruoxi
Ma, Haoxuan
Ma, Teng
Zhang, Hongyi
author_facet Cheng, Ruoxi
Ma, Haoxuan
Ma, Teng
Zhang, Hongyi
contents Large Vision-Language Models (LVLMs) exhibit powerful reasoning capabilities but suffer sophisticated jailbreak vulnerabilities. Fundamentally, aligning LVLMs is not just a safety challenge but a problem of economic efficiency. Current alignment methods struggle with the trade-off between safety, utility, and operational costs. Critically, a focus solely on final outputs (process-blindness) wastes significant computational budget on unsafe deliberation. This flaw allows harmful reasoning to be disguised with benign justifications, thereby circumventing simple additive safety scores. To address this, we propose EcoAlign, an inference-time framework that reframes alignment as an economically rational search by treating the LVLM as a boundedly rational agent. EcoAlign incrementally expands a thought graph and scores actions using a forward-looking function (analogous to net present value) that dynamically weighs expected safety, utility, and cost against the remaining budget. To prevent deception, path safety is enforced via the weakest-link principle. Extensive experiments across 3 closed-source and 2 open-source models on 6 datasets show that EcoAlign matches or surpasses state-of-the-art safety and utility at a lower computational cost, thereby offering a principled, economical pathway to robust LVLM alignment.
format Preprint
id arxiv_https___arxiv_org_abs_2511_11301
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment
Cheng, Ruoxi
Ma, Haoxuan
Ma, Teng
Zhang, Hongyi
Artificial Intelligence
Large Vision-Language Models (LVLMs) exhibit powerful reasoning capabilities but suffer sophisticated jailbreak vulnerabilities. Fundamentally, aligning LVLMs is not just a safety challenge but a problem of economic efficiency. Current alignment methods struggle with the trade-off between safety, utility, and operational costs. Critically, a focus solely on final outputs (process-blindness) wastes significant computational budget on unsafe deliberation. This flaw allows harmful reasoning to be disguised with benign justifications, thereby circumventing simple additive safety scores. To address this, we propose EcoAlign, an inference-time framework that reframes alignment as an economically rational search by treating the LVLM as a boundedly rational agent. EcoAlign incrementally expands a thought graph and scores actions using a forward-looking function (analogous to net present value) that dynamically weighs expected safety, utility, and cost against the remaining budget. To prevent deception, path safety is enforced via the weakest-link principle. Extensive experiments across 3 closed-source and 2 open-source models on 6 datasets show that EcoAlign matches or surpasses state-of-the-art safety and utility at a lower computational cost, thereby offering a principled, economical pathway to robust LVLM alignment.
title EcoAlign: An Economically Rational Framework for Efficient LVLM Alignment
topic Artificial Intelligence
url https://arxiv.org/abs/2511.11301