Saved in:
Bibliographic Details
Main Authors: Zhang, Jialu, Sun, Qingyang, Wang, Qianyi, Zhang, Weiyi, Xiao, Zunjie, Zhang, Xiaoqing, Ren, Jianfeng, Liu, Jiang
Format: Preprint
Published: 2025
Subjects:
Online Access:https://arxiv.org/abs/2507.19556
Tags: Add Tag
No Tags, Be the first to tag this record!
_version_ 1866912502258860032
author Zhang, Jialu
Sun, Qingyang
Wang, Qianyi
Zhang, Weiyi
Xiao, Zunjie
Zhang, Xiaoqing
Ren, Jianfeng
Liu, Jiang
author_facet Zhang, Jialu
Sun, Qingyang
Wang, Qianyi
Zhang, Weiyi
Xiao, Zunjie
Zhang, Xiaoqing
Ren, Jianfeng
Liu, Jiang
contents The undergraduate thesis (UGTE) plays an indispensable role in assessing a student's cumulative academic development throughout their college years. Although large language models (LLMs) have advanced education intelligence, they typically focus on holistic assessment with only one single evaluation score, but ignore the intricate nuances across multifaceted criteria, limiting their ability to reflect structural criteria, pedagogical objectives, and diverse academic competencies. Meanwhile, pedagogical theories have long informed manual UGTE evaluation through multi-dimensional assessment of cognitive development, disciplinary thinking, and academic performance, yet remain underutilized in automated settings. Motivated by the research gap, we pioneer PEMUTA, a pedagogically-enriched framework that effectively activates domain-specific knowledge from LLMs for multi-granular UGTE assessment. Guided by Vygotsky's theory and Bloom's Taxonomy, PEMUTA incorporates a hierarchical prompting scheme that evaluates UGTEs across six fine-grained dimensions: Structure, Logic, Originality, Writing, Proficiency, and Rigor (SLOWPR), followed by holistic synthesis. Two in-context learning techniques, \ie, few-shot prompting and role-play prompting, are also incorporated to further enhance alignment with expert judgments without fine-tuning. We curate a dataset of authentic UGTEs with expert-provided SLOWPR-aligned annotations to support multi-granular UGTE assessment. Extensive experiments demonstrate that PEMUTA achieves strong alignment with expert evaluations, and exhibits strong potential for fine-grained, pedagogically-informed UGTE evaluations.
format Preprint
id arxiv_https___arxiv_org_abs_2507_19556
institution arXiv
publishDate 2025
record_format arxiv
spellingShingle PEMUTA: Pedagogically-Enriched Multi-Granular Undergraduate Thesis Assessment
Zhang, Jialu
Sun, Qingyang
Wang, Qianyi
Zhang, Weiyi
Xiao, Zunjie
Zhang, Xiaoqing
Ren, Jianfeng
Liu, Jiang
Computers and Society
Artificial Intelligence
The undergraduate thesis (UGTE) plays an indispensable role in assessing a student's cumulative academic development throughout their college years. Although large language models (LLMs) have advanced education intelligence, they typically focus on holistic assessment with only one single evaluation score, but ignore the intricate nuances across multifaceted criteria, limiting their ability to reflect structural criteria, pedagogical objectives, and diverse academic competencies. Meanwhile, pedagogical theories have long informed manual UGTE evaluation through multi-dimensional assessment of cognitive development, disciplinary thinking, and academic performance, yet remain underutilized in automated settings. Motivated by the research gap, we pioneer PEMUTA, a pedagogically-enriched framework that effectively activates domain-specific knowledge from LLMs for multi-granular UGTE assessment. Guided by Vygotsky's theory and Bloom's Taxonomy, PEMUTA incorporates a hierarchical prompting scheme that evaluates UGTEs across six fine-grained dimensions: Structure, Logic, Originality, Writing, Proficiency, and Rigor (SLOWPR), followed by holistic synthesis. Two in-context learning techniques, \ie, few-shot prompting and role-play prompting, are also incorporated to further enhance alignment with expert judgments without fine-tuning. We curate a dataset of authentic UGTEs with expert-provided SLOWPR-aligned annotations to support multi-granular UGTE assessment. Extensive experiments demonstrate that PEMUTA achieves strong alignment with expert evaluations, and exhibits strong potential for fine-grained, pedagogically-informed UGTE evaluations.
title PEMUTA: Pedagogically-Enriched Multi-Granular Undergraduate Thesis Assessment
topic Computers and Society
Artificial Intelligence
url https://arxiv.org/abs/2507.19556