Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Luo, Xiaoliang, Rechardt, Akilles, Sun, Guangzhi, Nejad, Kevin K., Yáñez, Felipe, Yilmaz, Bati, Lee, Kangjoo, Cohen, Alexandra O., Borghesani, Valentina, Pashkov, Anton, Marinazzo, Daniele, Nicholas, Jonathan, Salatiello, Alessandro, Sucholutsky, Ilia, Minervini, Pasquale, Razavi, Sepehr, Rocca, Roberta, Yusifov, Elkhan, Okalova, Tereza, Gu, Nianlong, Ferianc, Martin, Khona, Mikail, Patil, Kaustubh R., Lee, Pui-Shee, Mata, Rui, Myers, Nicholas E., Bizley, Jennifer K, Musslick, Sebastian, Bilgin, Isil Poyraz, Niso, Guiomar, Ales, Justin M., Gaebler, Michael, Murty, N Apurva Ratan, Loued-Khenissi, Leyla, Behler, Anna, Hall, Chloe M., Dafflon, Jessica, Bao, Sherry Dongqi, Love, Bradley C.
Format:	Preprint
Published:	2024
Subjects:	Neurons and Cognition Artificial Intelligence
Online Access:	https://arxiv.org/abs/2403.03230
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866917850605682688
author	Luo, Xiaoliang Rechardt, Akilles Sun, Guangzhi Nejad, Kevin K. Yáñez, Felipe Yilmaz, Bati Lee, Kangjoo Cohen, Alexandra O. Borghesani, Valentina Pashkov, Anton Marinazzo, Daniele Nicholas, Jonathan Salatiello, Alessandro Sucholutsky, Ilia Minervini, Pasquale Razavi, Sepehr Rocca, Roberta Yusifov, Elkhan Okalova, Tereza Gu, Nianlong Ferianc, Martin Khona, Mikail Patil, Kaustubh R. Lee, Pui-Shee Mata, Rui Myers, Nicholas E. Bizley, Jennifer K Musslick, Sebastian Bilgin, Isil Poyraz Niso, Guiomar Ales, Justin M. Gaebler, Michael Murty, N Apurva Ratan Loued-Khenissi, Leyla Behler, Anna Hall, Chloe M. Dafflon, Jessica Bao, Sherry Dongqi Love, Bradley C.
author_facet	Luo, Xiaoliang Rechardt, Akilles Sun, Guangzhi Nejad, Kevin K. Yáñez, Felipe Yilmaz, Bati Lee, Kangjoo Cohen, Alexandra O. Borghesani, Valentina Pashkov, Anton Marinazzo, Daniele Nicholas, Jonathan Salatiello, Alessandro Sucholutsky, Ilia Minervini, Pasquale Razavi, Sepehr Rocca, Roberta Yusifov, Elkhan Okalova, Tereza Gu, Nianlong Ferianc, Martin Khona, Mikail Patil, Kaustubh R. Lee, Pui-Shee Mata, Rui Myers, Nicholas E. Bizley, Jennifer K Musslick, Sebastian Bilgin, Isil Poyraz Niso, Guiomar Ales, Justin M. Gaebler, Michael Murty, N Apurva Ratan Loued-Khenissi, Leyla Behler, Anna Hall, Chloe M. Dafflon, Jessica Bao, Sherry Dongqi Love, Bradley C.
contents	Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs were confident in their predictions, they were more likely to be correct, which presages a future where humans and LLMs team together to make discoveries. Our approach is not neuroscience-specific and is transferable to other knowledge-intensive endeavors.
format	Preprint
id	arxiv_https___arxiv_org_abs_2403_03230
institution	arXiv
publishDate	2024
record_format	arxiv
spellingShingle	Large language models surpass human experts in predicting neuroscience results Luo, Xiaoliang Rechardt, Akilles Sun, Guangzhi Nejad, Kevin K. Yáñez, Felipe Yilmaz, Bati Lee, Kangjoo Cohen, Alexandra O. Borghesani, Valentina Pashkov, Anton Marinazzo, Daniele Nicholas, Jonathan Salatiello, Alessandro Sucholutsky, Ilia Minervini, Pasquale Razavi, Sepehr Rocca, Roberta Yusifov, Elkhan Okalova, Tereza Gu, Nianlong Ferianc, Martin Khona, Mikail Patil, Kaustubh R. Lee, Pui-Shee Mata, Rui Myers, Nicholas E. Bizley, Jennifer K Musslick, Sebastian Bilgin, Isil Poyraz Niso, Guiomar Ales, Justin M. Gaebler, Michael Murty, N Apurva Ratan Loued-Khenissi, Leyla Behler, Anna Hall, Chloe M. Dafflon, Jessica Bao, Sherry Dongqi Love, Bradley C. Neurons and Cognition Artificial Intelligence Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs were confident in their predictions, they were more likely to be correct, which presages a future where humans and LLMs team together to make discoveries. Our approach is not neuroscience-specific and is transferable to other knowledge-intensive endeavors.
title	Large language models surpass human experts in predicting neuroscience results
topic	Neurons and Cognition Artificial Intelligence
url	https://arxiv.org/abs/2403.03230

Similar Items