Saved in:
Bibliographic Details
Main Authors: Huotala, Aleksi, Kuutila, Miikka, Turtio, Olli-Pekka, Sipilä, Simo, Mäntylä, Mika
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2510.06708
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866915941243158528
author Huotala, Aleksi
Kuutila, Miikka
Turtio, Olli-Pekka
Sipilä, Simo
Mäntylä, Mika
author_facet Huotala, Aleksi
Kuutila, Miikka
Turtio, Olli-Pekka
Sipilä, Simo
Mäntylä, Mika
contents Conducting systematic reviews is laborious. In the screening or study selection phase, the number of papers can be overwhelming. Recent research has demonstrated that large language models (LLMs) can perform title-abstract screening and support humans in the task. To this end, we developed AISysRev, an LLM-based screening tool implemented as a containerized web application. The tool accepts CSV files containing paper titles and abstracts. Users specify inclusion and exclusion criteria. Multiple different LLMs can be used, such as Gemini, Claude, Mistral or ChatGPT via OpenRouter. We also support locally hosted models and any model compatible with the OpenAI SDK. AISysRev implements both zero-shot and few-shot prompting, and also allows for manual screening through interfaces that display LLM results as guidance for human reviewers. LLM calls are parallelized, meaning screening speed is typically between 100 to 300 papers per minute, depending on the model and the host. To demonstrate the tool's use in practice, we conducted a qualitative trial study with 137 papers using the tool. Our findings indicate that papers can be classified into four categories: Easy Includes, Easy Excludes, Boundary Includes, and Boundary Excludes. The Boundary cases, where LLMs are prone to errors, highlight the need for human intervention. While LLMs do not replace human judgment in systematic reviews, they can reduce the burden of assessing large volumes of scientific literature. Video: https://www.youtube.com/watch?v=HeblemlgnAQ Tool: https://github.com/EvoTestOps/AISysRev
format Preprint
id arxiv_https___arxiv_org_abs_2510_06708
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle AISysRev -- LLM-based Tool for Title-abstract Screening
Huotala, Aleksi
Kuutila, Miikka
Turtio, Olli-Pekka
Sipilä, Simo
Mäntylä, Mika
Software Engineering
Artificial Intelligence
Conducting systematic reviews is laborious. In the screening or study selection phase, the number of papers can be overwhelming. Recent research has demonstrated that large language models (LLMs) can perform title-abstract screening and support humans in the task. To this end, we developed AISysRev, an LLM-based screening tool implemented as a containerized web application. The tool accepts CSV files containing paper titles and abstracts. Users specify inclusion and exclusion criteria. Multiple different LLMs can be used, such as Gemini, Claude, Mistral or ChatGPT via OpenRouter. We also support locally hosted models and any model compatible with the OpenAI SDK. AISysRev implements both zero-shot and few-shot prompting, and also allows for manual screening through interfaces that display LLM results as guidance for human reviewers. LLM calls are parallelized, meaning screening speed is typically between 100 to 300 papers per minute, depending on the model and the host. To demonstrate the tool's use in practice, we conducted a qualitative trial study with 137 papers using the tool. Our findings indicate that papers can be classified into four categories: Easy Includes, Easy Excludes, Boundary Includes, and Boundary Excludes. The Boundary cases, where LLMs are prone to errors, highlight the need for human intervention. While LLMs do not replace human judgment in systematic reviews, they can reduce the burden of assessing large volumes of scientific literature. Video: https://www.youtube.com/watch?v=HeblemlgnAQ Tool: https://github.com/EvoTestOps/AISysRev
title AISysRev -- LLM-based Tool for Title-abstract Screening
topic Software Engineering
Artificial Intelligence
url https://arxiv.org/abs/2510.06708