Saved in:
Bibliographic Details
Main Authors: Sandoval, Sandra C., Acquaye, Christabel, Cobbina, Kwesi, Teli, Mohammad Nayeem, Daumé III, Hal
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2502.04564
Tags: Add Tag
No Tags, Be the first to tag this record!
Table of Contents:
  • We examine the representation of African American English (AAE) in large language models (LLMs), exploring (a) the perceptions Black Americans have of how effective these technologies are at producing authentic AAE, and (b) in what contexts Black Americans find this desirable. Through both a survey of Black Americans ($n=$ 104) and annotation of LLM-produced AAE by Black Americans ($n=$ 228), we find that Black Americans favor choice and autonomy in determining when AAE is appropriate in LLM output. They tend to prefer that LLMs default to communicating in Mainstream U.S. English in formal settings, with greater interest in AAE production in less formal settings. When LLMs were appropriately prompted and provided in context examples, our participants found their outputs to have a level of AAE authenticity on par with transcripts of Black American speech. Select code and data for our project can be found here: https://github.com/smelliecat/AAEMime.git