Salvato in:
Dettagli Bibliografici
Autori principali: Boguraev, Sasha, Lipkin, Ben, Weissweiler, Leonie, Mahowald, Kyle
Natura: Preprint
Pubblicazione: 2024
Soggetti:
Accesso online:https://arxiv.org/abs/2409.17005
Tags: Aggiungi Tag
Nessun Tag, puoi essere il primo ad aggiungerne!!
_version_ 1866917824478314496
author Boguraev, Sasha
Lipkin, Ben
Weissweiler, Leonie
Mahowald, Kyle
author_facet Boguraev, Sasha
Lipkin, Ben
Weissweiler, Leonie
Mahowald, Kyle
contents Math is constructed by people for people: just as natural language corpora reflect not just propositions but the communicative goals of language users, the math data that models are trained on reflects not just idealized mathematical entities but rich communicative intentions. While there are important advantages to treating math in a purely symbolic manner, we here hypothesize that there are benefits to treating math as situated linguistic communication and that language models are well suited for this goal, in ways that are not fully appreciated. We illustrate these points with two case studies. First, we ran an experiment in which we found that language models interpret the equals sign in a humanlike way -- generating systematically different word problems for the same underlying equation arranged in different ways. Second, we found that language models prefer proofs to be ordered in naturalistic ways, even though other orders would be logically equivalent. We advocate for AI systems that learn from and represent the communicative intentions latent in human-generated math.
format Preprint
id arxiv_https___arxiv_org_abs_2409_17005
institution arXiv
publishDate 2024
record_format arxiv
spellingShingle Models Can and Should Embrace the Communicative Nature of Human-Generated Math
Boguraev, Sasha
Lipkin, Ben
Weissweiler, Leonie
Mahowald, Kyle
Artificial Intelligence
Computation and Language
Math is constructed by people for people: just as natural language corpora reflect not just propositions but the communicative goals of language users, the math data that models are trained on reflects not just idealized mathematical entities but rich communicative intentions. While there are important advantages to treating math in a purely symbolic manner, we here hypothesize that there are benefits to treating math as situated linguistic communication and that language models are well suited for this goal, in ways that are not fully appreciated. We illustrate these points with two case studies. First, we ran an experiment in which we found that language models interpret the equals sign in a humanlike way -- generating systematically different word problems for the same underlying equation arranged in different ways. Second, we found that language models prefer proofs to be ordered in naturalistic ways, even though other orders would be logically equivalent. We advocate for AI systems that learn from and represent the communicative intentions latent in human-generated math.
title Models Can and Should Embrace the Communicative Nature of Human-Generated Math
topic Artificial Intelligence
Computation and Language
url https://arxiv.org/abs/2409.17005