Internformat: :: Library Catalog

Gespeichert in:

Bibliographische Detailangaben
Hauptverfasser:	Park, Kwangsuk, Yang, Jiwoong
Format:	Preprint
Veröffentlicht:	2025
Schlagworte:	Computers and Society Artificial Intelligence
Online-Zugang:	https://arxiv.org/abs/2507.05321
Tags:	Tag hinzufügen Keine Tags, Fügen Sie den ersten Tag hinzu!

_version_	1866915376329129984
author	Park, Kwangsuk Yang, Jiwoong
author_facet	Park, Kwangsuk Yang, Jiwoong
contents	Recent advances in AI-assisted education have encouraged the integration of vision-language models (VLMs) into academic assessment, particularly for tasks that require both quantitative and qualitative evaluation. However, existing VLM based approaches struggle with complex educational artifacts, such as programming tasks with executable components and measurable outputs, that require structured reasoning and alignment with clearly defined evaluation criteria. We introduce AGACCI, a multi-agent system that distributes specialized evaluation roles across collaborative agents to improve accuracy, interpretability, and consistency in code-oriented assessment. To evaluate the framework, we collected 360 graduate-level code-based assignments from 60 participants, each annotated by domain experts with binary rubric scores and qualitative feedback. Experimental results demonstrate that AGACCI outperforms a single GPT-based baseline in terms of rubric and feedback accuracy, relevance, consistency, and coherence, while preserving the instructional intent and evaluative depth of expert assessments. Although performance varies across task types, AGACCI highlights the potential of multi-agent systems for scalable and context-aware educational evaluation.
format	Preprint
id	arxiv_https___arxiv_org_abs_2507_05321
institution	arXiv
publishDate	2025
record_format	arxiv
spellingShingle	AGACCI : Affiliated Grading Agents for Criteria-Centric Interface in Educational Coding Contexts Park, Kwangsuk Yang, Jiwoong Computers and Society Artificial Intelligence Recent advances in AI-assisted education have encouraged the integration of vision-language models (VLMs) into academic assessment, particularly for tasks that require both quantitative and qualitative evaluation. However, existing VLM based approaches struggle with complex educational artifacts, such as programming tasks with executable components and measurable outputs, that require structured reasoning and alignment with clearly defined evaluation criteria. We introduce AGACCI, a multi-agent system that distributes specialized evaluation roles across collaborative agents to improve accuracy, interpretability, and consistency in code-oriented assessment. To evaluate the framework, we collected 360 graduate-level code-based assignments from 60 participants, each annotated by domain experts with binary rubric scores and qualitative feedback. Experimental results demonstrate that AGACCI outperforms a single GPT-based baseline in terms of rubric and feedback accuracy, relevance, consistency, and coherence, while preserving the instructional intent and evaluative depth of expert assessments. Although performance varies across task types, AGACCI highlights the potential of multi-agent systems for scalable and context-aware educational evaluation.
title	AGACCI : Affiliated Grading Agents for Criteria-Centric Interface in Educational Coding Contexts
topic	Computers and Society Artificial Intelligence
url	https://arxiv.org/abs/2507.05321

Ähnliche Einträge