Table of Contents: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Howell, Anthony, Wu, Nancy, Bagchi, Sharmistha, Kim, Yushim, Sun, Chayn
Format:	Preprint
Published:	2025
Subjects:	Computers and Society Computer Vision and Pattern Recognition
Online Access:	https://arxiv.org/abs/2509.15132
Tags:	Add Tag No Tags, Be the first to tag this record!

Table of Contents:

This paper shows how a multimodal large language model (MLLM) can expand urban measurement capacity and support tracking of place-based policy interventions. Using a structured, reason-then-estimate pipeline on street-view imagery, GPT-4o infers neighborhood poverty and tree canopy, which we embed in a quasi-experimental design evaluating the legacy of 1930s redlining. GPT-4o recovers the expected adverse socio-environmental legacy effects of redlining, with estimates statistically indistinguishable from authoritative sources, and it outperforms a conventional pixel-based segmentation baseline-consistent with the idea that holistic scene reasoning extracts higher-order information beyond object counts alone. These results position MLLMs as policy-grade instruments for neighborhood measurement and motivate broader validation across policy-evaluation settings.

Similar Items