Staff View: :: Library Catalog

Saved in:

Bibliographic Details
Main Authors:	Witsken, Gavin, Crk, Igor, Gultepe, Eren
Format:	Preprint
Published:	2025
Subjects:	Computers and Society Artificial Intelligence
Online Access:	https://arxiv.org/abs/2503.18995
Tags:	Add Tag No Tags, Be the first to tag this record!

_version_	1866916661813051392
author	Witsken, Gavin Crk, Igor Gultepe, Eren
author_facet	Witsken, Gavin Crk, Igor Gultepe, Eren
contents	We randomly deploy questions constructed with and without use of the LLM tool and gauge the ability of the students to correctly answer, as well as their ability to correctly perceive the difference between human-authored and LLM-authored questions. In determining whether the questions written with the aid of ChatGPT were consistent with the instructor's questions and source text, we computed representative vectors of both the human and ChatGPT questions using SBERT and compared cosine similarity to the course textbook. A non-significant Mann-Whitney U test (z = 1.018, p = .309) suggests that students were unable to perceive whether questions were written with or without the aid of ChatGPT. However, student scores on LLM-authored questions were almost 9% lower (z = 2.702, p < .01). This result may indicate that either the AI questions were more difficult or that the students were more familiar with the instructor's style of questions. Overall, the study suggests that while there is potential for using LLM tools to aid in the construction of assessments, care must be taken to ensure that the questions are fair, well-composed, and relevant to the course material.
format	Preprint
id	arxiv_https___arxiv_org_abs_2503_18995
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	LLMs in the Classroom: Outcomes and Perceptions of Questions Written with the Aid of AI Witsken, Gavin Crk, Igor Gultepe, Eren Computers and Society Artificial Intelligence We randomly deploy questions constructed with and without use of the LLM tool and gauge the ability of the students to correctly answer, as well as their ability to correctly perceive the difference between human-authored and LLM-authored questions. In determining whether the questions written with the aid of ChatGPT were consistent with the instructor's questions and source text, we computed representative vectors of both the human and ChatGPT questions using SBERT and compared cosine similarity to the course textbook. A non-significant Mann-Whitney U test (z = 1.018, p = .309) suggests that students were unable to perceive whether questions were written with or without the aid of ChatGPT. However, student scores on LLM-authored questions were almost 9% lower (z = 2.702, p < .01). This result may indicate that either the AI questions were more difficult or that the students were more familiar with the instructor's style of questions. Overall, the study suggests that while there is potential for using LLM tools to aid in the construction of assessments, care must be taken to ensure that the questions are fair, well-composed, and relevant to the course material.
title	LLMs in the Classroom: Outcomes and Perceptions of Questions Written with the Aid of AI
topic	Computers and Society Artificial Intelligence
url	https://arxiv.org/abs/2503.18995

Similar Items