Saved in:
Bibliographic Details
Main Authors: Howell, Anthony, Wu, Nancy, Bagchi, Sharmistha, Kim, Yushim, Sun, Chayn
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2509.15132
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • This paper shows how a multimodal large language model (MLLM) can expand urban measurement capacity and support tracking of place-based policy interventions. Using a structured, reason-then-estimate pipeline on street-view imagery, GPT-4o infers neighborhood poverty and tree canopy, which we embed in a quasi-experimental design evaluating the legacy of 1930s redlining. GPT-4o recovers the expected adverse socio-environmental legacy effects of redlining, with estimates statistically indistinguishable from authoritative sources, and it outperforms a conventional pixel-based segmentation baseline-consistent with the idea that holistic scene reasoning extracts higher-order information beyond object counts alone. These results position MLLMs as policy-grade instruments for neighborhood measurement and motivate broader validation across policy-evaluation settings.