:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Huynh, Ryan, Guerin, Frank, Callwood, Alison
Format:	Preprint
Published:	2026
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2602.02360
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Finding Challenging Metaphors that Confuse Pretrained Language Models
by: Li, Yucheng, et al.
Published: (2024)

An Open Source Data Contamination Report for Large Language Models
by: Li, Yucheng, et al.
Published: (2023)

LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction
by: Li, Yucheng, et al.
Published: (2023)

KAIJU: An Executive Kernel for Intent-Gated Execution of LLM Agents
by: Guerin, Cormac, et al.
Published: (2026)

Machine Learning-driven Multiscale MD Workflows: The Mini-MuMMI Experience
by: Pottier, Loïc, et al.
Published: (2025)

Evaluating Large Language Models for Generalization and Robustness via Data Compression
by: Li, Yucheng, et al.
Published: (2024)

Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments
by: Tu, Sichang, et al.
Published: (2024)

AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer
by: Leybzon, Danny D., et al.
Published: (2025)

GPTEval: A Survey on Assessments of ChatGPT and GPT-4
by: Mao, Rui, et al.
Published: (2023)

Automating the Information Extraction from Semi-Structured Interview Transcripts
by: Parfenova, Angelina
Published: (2024)

Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring
by: Cai, Yida, et al.
Published: (2025)

Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs)
by: Geathers, Jadon, et al.
Published: (2025)

Long Context Automated Essay Scoring with Language Models
by: Ormerod, Christopher, et al.
Published: (2025)

Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
by: Ormerod, Christopher
Published: (2025)

The Impact of Syntactic and Semantic Proximity on Machine Translation with Back-Translation
by: Guerin, Nicolas, et al.
Published: (2024)

From Prompting to Preference Optimization: A Comparative Study of LLM-based Automated Essay Scoring
by: Nguyen, Minh Hoang, et al.
Published: (2026)

PolInterviews -- A Dataset of German Politician Public Broadcast Interviews
by: Birkenmaier, Lukas, et al.
Published: (2025)

BioMNER: A Dataset for Biomedical Method Entity Recognition
by: Tang, Chen, et al.
Published: (2024)

Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection
by: Qwaider, Chatrine, et al.
Published: (2025)

Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards
by: Do, Heejin, et al.
Published: (2024)

Operationalizing Automated Essay Scoring: A Human-Aware Approach
by: Plasencia-Calaña, Yenisel
Published: (2025)

InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation
by: Li, Yu, et al.
Published: (2026)

IELTS Writing Revision Platform with Automated Essay Scoring and Adaptive Feedback
by: Ramancauskas, Titas, et al.
Published: (2025)

Automated Essay Scoring and Language Certification: Assessing Generalizability, Agreement and Validity for French
by: Wilkens, Rodrigo, et al.
Published: (2026)

AI Conversational Interviewing: Transforming Surveys with LLMs as Adaptive Interviewers
by: Wuttke, Alexander, et al.
Published: (2024)

Automated Refinement of Essay Scoring Rubrics for Language Models via Reflect-and-Revise
by: Harada, Keno, et al.
Published: (2025)

LAILA: A Large Trait-Based Dataset for Arabic Automated Essay Scoring
by: Bashendy, May, et al.
Published: (2025)

Automated Scoring of Clinical Patient Notes using Advanced NLP and Pseudo Labeling
by: Xu, Jingyu, et al.
Published: (2024)

Automated Genre-Aware Article Scoring and Feedback Using Large Language Models
by: Wang, Chihang, et al.
Published: (2024)

Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
by: Pala, Tej Deep, et al.
Published: (2024)

Adversarial Topic-aware Prompt-tuning for Cross-topic Automated Essay Scoring
by: Zhang, Chunyun, et al.
Published: (2025)

Beyond the Score: Uncertainty-Calibrated LLMs for Automated Essay Assessment
by: Karim, Ahmed, et al.
Published: (2025)

Empirical Analysis of the Effect of Context in the Task of Automated Essay Scoring in Transformer-Based Models
by: Chakravarty, Abhirup
Published: (2025)

Interview AI-ssistant: Designing for Real-Time Human-AI Collaboration in Interview Preparation and Execution
by: Liu, Zhe
Published: (2025)

Interpretability from the Ground Up: Stakeholder-Centric Design of Automated Scoring in Educational Assessments
by: Kim, Yunsung, et al.
Published: (2025)

Unveiling the Tapestry of Automated Essay Scoring: A Comprehensive Investigation of Accuracy, Fairness, and Generalizability
by: Yang, Kaixun, et al.
Published: (2024)

SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
by: Aynetdinov, Ansar, et al.
Published: (2024)

Hire-Smart: Automating Candidate Screening and Interview Analysis
by: Yadav, Sandesh, et al.
Published: (2026)

Exploration of Summarization by Generative Language Models for Automated Scoring of Long Essays
by: Hua, Haowei, et al.
Published: (2025)

AI-generated Essays: Characteristics and Implications on Automated Scoring and Academic Integrity
by: Zhong, Yang, et al.
Published: (2024)