Saved in:
| Main Authors: | , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , , |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2403.03230 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
| _version_ | 1866917850605682688 |
|---|---|
| author | Luo, Xiaoliang Rechardt, Akilles Sun, Guangzhi Nejad, Kevin K. Yáñez, Felipe Yilmaz, Bati Lee, Kangjoo Cohen, Alexandra O. Borghesani, Valentina Pashkov, Anton Marinazzo, Daniele Nicholas, Jonathan Salatiello, Alessandro Sucholutsky, Ilia Minervini, Pasquale Razavi, Sepehr Rocca, Roberta Yusifov, Elkhan Okalova, Tereza Gu, Nianlong Ferianc, Martin Khona, Mikail Patil, Kaustubh R. Lee, Pui-Shee Mata, Rui Myers, Nicholas E. Bizley, Jennifer K Musslick, Sebastian Bilgin, Isil Poyraz Niso, Guiomar Ales, Justin M. Gaebler, Michael Murty, N Apurva Ratan Loued-Khenissi, Leyla Behler, Anna Hall, Chloe M. Dafflon, Jessica Bao, Sherry Dongqi Love, Bradley C. |
| author_facet | Luo, Xiaoliang Rechardt, Akilles Sun, Guangzhi Nejad, Kevin K. Yáñez, Felipe Yilmaz, Bati Lee, Kangjoo Cohen, Alexandra O. Borghesani, Valentina Pashkov, Anton Marinazzo, Daniele Nicholas, Jonathan Salatiello, Alessandro Sucholutsky, Ilia Minervini, Pasquale Razavi, Sepehr Rocca, Roberta Yusifov, Elkhan Okalova, Tereza Gu, Nianlong Ferianc, Martin Khona, Mikail Patil, Kaustubh R. Lee, Pui-Shee Mata, Rui Myers, Nicholas E. Bizley, Jennifer K Musslick, Sebastian Bilgin, Isil Poyraz Niso, Guiomar Ales, Justin M. Gaebler, Michael Murty, N Apurva Ratan Loued-Khenissi, Leyla Behler, Anna Hall, Chloe M. Dafflon, Jessica Bao, Sherry Dongqi Love, Bradley C. |
| contents | Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs were confident in their predictions, they were more likely to be correct, which presages a future where humans and LLMs team together to make discoveries. Our approach is not neuroscience-specific and is transferable to other knowledge-intensive endeavors. |
| format | Preprint |
| id |
arxiv_https___arxiv_org_abs_2403_03230 |
| institution | arXiv |
| publishDate | 2024 |
| record_format | arxiv |
| spellingShingle | Large language models surpass human experts in predicting neuroscience results Luo, Xiaoliang Rechardt, Akilles Sun, Guangzhi Nejad, Kevin K. Yáñez, Felipe Yilmaz, Bati Lee, Kangjoo Cohen, Alexandra O. Borghesani, Valentina Pashkov, Anton Marinazzo, Daniele Nicholas, Jonathan Salatiello, Alessandro Sucholutsky, Ilia Minervini, Pasquale Razavi, Sepehr Rocca, Roberta Yusifov, Elkhan Okalova, Tereza Gu, Nianlong Ferianc, Martin Khona, Mikail Patil, Kaustubh R. Lee, Pui-Shee Mata, Rui Myers, Nicholas E. Bizley, Jennifer K Musslick, Sebastian Bilgin, Isil Poyraz Niso, Guiomar Ales, Justin M. Gaebler, Michael Murty, N Apurva Ratan Loued-Khenissi, Leyla Behler, Anna Hall, Chloe M. Dafflon, Jessica Bao, Sherry Dongqi Love, Bradley C. Neurons and Cognition Artificial Intelligence Scientific discoveries often hinge on synthesizing decades of research, a task that potentially outstrips human information processing capacities. Large language models (LLMs) offer a solution. LLMs trained on the vast scientific literature could potentially integrate noisy yet interrelated findings to forecast novel results better than human experts. To evaluate this possibility, we created BrainBench, a forward-looking benchmark for predicting neuroscience results. We find that LLMs surpass experts in predicting experimental outcomes. BrainGPT, an LLM we tuned on the neuroscience literature, performed better yet. Like human experts, when LLMs were confident in their predictions, they were more likely to be correct, which presages a future where humans and LLMs team together to make discoveries. Our approach is not neuroscience-specific and is transferable to other knowledge-intensive endeavors. |
| title | Large language models surpass human experts in predicting neuroscience results |
| topic | Neurons and Cognition Artificial Intelligence |
| url | https://arxiv.org/abs/2403.03230 |