Saved in:
| Main Author: | |
|---|---|
| Format: | Recurso digital |
| Language: | |
| Published: |
Zenodo
2026
|
| Online Access: | https://doi.org/10.5281/zenodo.20054277 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Table of Contents:
- <p class="MsoNormal"><em><span>Maybe not.</span></em></p> <p class="MsoNormal"><span>Large language models (LLMs) can produce fluent and plausible peer-review reports, but fluency is not scholarly judgment. This paper presents a case study of three formal peer-review reports received during an actual submission to a Korea Citation Index (KCI) candidate journal. One review claimed that key methodological and experimental details were missing, although the manuscript explicitly contained information about validation data, hardware, software frameworks, model configuration, parameter count, algorithmic procedure, and limitations. After the author reported these grounding failures to the editor, the problematic review was not relied upon in the final decision, and the manuscript was subsequently accepted and published. The case is compared with two more grounded human reviews and an exploratory source-grounded LLM-generated review. The source-grounded LLM review was more aligned with the manuscript, but still produced minor factual errors and overstatements. The case suggests that grounding improves LLM-assisted review, but does not eliminate hallucination or accountability gaps.</span></p>