MARC21: :: Library Catalog

Salvato in:

Dettagli Bibliografici
Autori principali:	Sun, Luning, Yuan, Yuzhuo, Yao, Yuan, Li, Yanyan, Zhang, Hao, Xie, Xing, Wang, Xiting, Luo, Fang, Stillwell, David
Natura:	Preprint
Pubblicazione:	2024
Soggetti:	Artificial Intelligence
Accesso online:	https://arxiv.org/abs/2412.03151
Tags:	Aggiungi Tag Nessun Tag, puoi essere il primo ad aggiungerne!!

_version_	1866910958267400192
author	Sun, Luning Yuan, Yuzhuo Yao, Yuan Li, Yanyan Zhang, Hao Xie, Xing Wang, Xiting Luo, Fang Stillwell, David
author_facet	Sun, Luning Yuan, Yuzhuo Yao, Yuan Li, Yanyan Zhang, Hao Xie, Xing Wang, Xiting Luo, Fang Stillwell, David
contents	Artificial intelligence has, so far, largely automated routine tasks, but what does it mean for the future of work if Large Language Models (LLMs) show creativity comparable to humans? To measure the creativity of LLMs holistically, the current study uses 13 creative tasks spanning three domains. We benchmark the LLMs against individual humans, and also take a novel approach by comparing them to the collective creativity of groups of humans. We find that the best LLMs (Claude and GPT-4) rank in the 52nd percentile against humans, and overall LLMs excel in divergent thinking and problem solving but lag in creative writing. When questioned 10 times, an LLM's collective creativity is equivalent to 8-10 humans. When more responses are requested, two additional responses of LLMs equal one extra human. Ultimately, LLMs, when optimally applied, may compete with a small group of humans in the future of work.
format	Preprint
id	arxiv_https___arxiv_org_abs_2412_03151
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Large Language Models show both individual and collective creativity comparable to humans Sun, Luning Yuan, Yuzhuo Yao, Yuan Li, Yanyan Zhang, Hao Xie, Xing Wang, Xiting Luo, Fang Stillwell, David Artificial Intelligence Artificial intelligence has, so far, largely automated routine tasks, but what does it mean for the future of work if Large Language Models (LLMs) show creativity comparable to humans? To measure the creativity of LLMs holistically, the current study uses 13 creative tasks spanning three domains. We benchmark the LLMs against individual humans, and also take a novel approach by comparing them to the collective creativity of groups of humans. We find that the best LLMs (Claude and GPT-4) rank in the 52nd percentile against humans, and overall LLMs excel in divergent thinking and problem solving but lag in creative writing. When questioned 10 times, an LLM's collective creativity is equivalent to 8-10 humans. When more responses are requested, two additional responses of LLMs equal one extra human. Ultimately, LLMs, when optimally applied, may compete with a small group of humans in the future of work.
title	Large Language Models show both individual and collective creativity comparable to humans
topic	Artificial Intelligence
url	https://arxiv.org/abs/2412.03151

Documenti analoghi