Saved in:
| Main Authors: | Huynh, Ryan, Guerin, Frank, Callwood, Alison |
|---|---|
| Format: | Preprint |
| Published: |
2026
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2602.02360 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Finding Challenging Metaphors that Confuse Pretrained Language Models
by: Li, Yucheng, et al.
Published: (2024)
by: Li, Yucheng, et al.
Published: (2024)
An Open Source Data Contamination Report for Large Language Models
by: Li, Yucheng, et al.
Published: (2023)
by: Li, Yucheng, et al.
Published: (2023)
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction
by: Li, Yucheng, et al.
Published: (2023)
by: Li, Yucheng, et al.
Published: (2023)
KAIJU: An Executive Kernel for Intent-Gated Execution of LLM Agents
by: Guerin, Cormac, et al.
Published: (2026)
by: Guerin, Cormac, et al.
Published: (2026)
Machine Learning-driven Multiscale MD Workflows: The Mini-MuMMI Experience
by: Pottier, Loïc, et al.
Published: (2025)
by: Pottier, Loïc, et al.
Published: (2025)
Evaluating Large Language Models for Generalization and Robustness via Data Compression
by: Li, Yucheng, et al.
Published: (2024)
by: Li, Yucheng, et al.
Published: (2024)
Automating PTSD Diagnostics in Clinical Interviews: Leveraging Large Language Models for Trauma Assessments
by: Tu, Sichang, et al.
Published: (2024)
by: Tu, Sichang, et al.
Published: (2024)
AI Telephone Surveying: Automating Quantitative Data Collection with an AI Interviewer
by: Leybzon, Danny D., et al.
Published: (2025)
by: Leybzon, Danny D., et al.
Published: (2025)
GPTEval: A Survey on Assessments of ChatGPT and GPT-4
by: Mao, Rui, et al.
Published: (2023)
by: Mao, Rui, et al.
Published: (2023)
Automating the Information Extraction from Semi-Structured Interview Transcripts
by: Parfenova, Angelina
Published: (2024)
by: Parfenova, Angelina
Published: (2024)
Rank-Then-Score: Enhancing Large Language Models for Automated Essay Scoring
by: Cai, Yida, et al.
Published: (2025)
by: Cai, Yida, et al.
Published: (2025)
Benchmarking Generative AI for Scoring Medical Student Interviews in Objective Structured Clinical Examinations (OSCEs)
by: Geathers, Jadon, et al.
Published: (2025)
by: Geathers, Jadon, et al.
Published: (2025)
Long Context Automated Essay Scoring with Language Models
by: Ormerod, Christopher, et al.
Published: (2025)
by: Ormerod, Christopher, et al.
Published: (2025)
Automated Essay Scoring Incorporating Annotations from Automated Feedback Systems
by: Ormerod, Christopher
Published: (2025)
by: Ormerod, Christopher
Published: (2025)
The Impact of Syntactic and Semantic Proximity on Machine Translation with Back-Translation
by: Guerin, Nicolas, et al.
Published: (2024)
by: Guerin, Nicolas, et al.
Published: (2024)
From Prompting to Preference Optimization: A Comparative Study of LLM-based Automated Essay Scoring
by: Nguyen, Minh Hoang, et al.
Published: (2026)
by: Nguyen, Minh Hoang, et al.
Published: (2026)
PolInterviews -- A Dataset of German Politician Public Broadcast Interviews
by: Birkenmaier, Lukas, et al.
Published: (2025)
by: Birkenmaier, Lukas, et al.
Published: (2025)
BioMNER: A Dataset for Biomedical Method Entity Recognition
by: Tang, Chen, et al.
Published: (2024)
by: Tang, Chen, et al.
Published: (2024)
Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection
by: Qwaider, Chatrine, et al.
Published: (2025)
by: Qwaider, Chatrine, et al.
Published: (2025)
Autoregressive Multi-trait Essay Scoring via Reinforcement Learning with Scoring-aware Multiple Rewards
by: Do, Heejin, et al.
Published: (2024)
by: Do, Heejin, et al.
Published: (2024)
Operationalizing Automated Essay Scoring: A Human-Aware Approach
by: Plasencia-Calaña, Yenisel
Published: (2025)
by: Plasencia-Calaña, Yenisel
Published: (2025)
InterviewSim: A Scalable Framework for Interview-Grounded Personality Simulation
by: Li, Yu, et al.
Published: (2026)
by: Li, Yu, et al.
Published: (2026)
IELTS Writing Revision Platform with Automated Essay Scoring and Adaptive Feedback
by: Ramancauskas, Titas, et al.
Published: (2025)
by: Ramancauskas, Titas, et al.
Published: (2025)
Automated Essay Scoring and Language Certification: Assessing Generalizability, Agreement and Validity for French
by: Wilkens, Rodrigo, et al.
Published: (2026)
by: Wilkens, Rodrigo, et al.
Published: (2026)
AI Conversational Interviewing: Transforming Surveys with LLMs as Adaptive Interviewers
by: Wuttke, Alexander, et al.
Published: (2024)
by: Wuttke, Alexander, et al.
Published: (2024)
Automated Refinement of Essay Scoring Rubrics for Language Models via Reflect-and-Revise
by: Harada, Keno, et al.
Published: (2025)
by: Harada, Keno, et al.
Published: (2025)
LAILA: A Large Trait-Based Dataset for Arabic Automated Essay Scoring
by: Bashendy, May, et al.
Published: (2025)
by: Bashendy, May, et al.
Published: (2025)
Automated Scoring of Clinical Patient Notes using Advanced NLP and Pseudo Labeling
by: Xu, Jingyu, et al.
Published: (2024)
by: Xu, Jingyu, et al.
Published: (2024)
Automated Genre-Aware Article Scoring and Feedback Using Large Language Models
by: Wang, Chihang, et al.
Published: (2024)
by: Wang, Chihang, et al.
Published: (2024)
Ferret: Faster and Effective Automated Red Teaming with Reward-Based Scoring Technique
by: Pala, Tej Deep, et al.
Published: (2024)
by: Pala, Tej Deep, et al.
Published: (2024)
Adversarial Topic-aware Prompt-tuning for Cross-topic Automated Essay Scoring
by: Zhang, Chunyun, et al.
Published: (2025)
by: Zhang, Chunyun, et al.
Published: (2025)
Beyond the Score: Uncertainty-Calibrated LLMs for Automated Essay Assessment
by: Karim, Ahmed, et al.
Published: (2025)
by: Karim, Ahmed, et al.
Published: (2025)
Empirical Analysis of the Effect of Context in the Task of Automated Essay Scoring in Transformer-Based Models
by: Chakravarty, Abhirup
Published: (2025)
by: Chakravarty, Abhirup
Published: (2025)
Interview AI-ssistant: Designing for Real-Time Human-AI Collaboration in Interview Preparation and Execution
by: Liu, Zhe
Published: (2025)
by: Liu, Zhe
Published: (2025)
Interpretability from the Ground Up: Stakeholder-Centric Design of Automated Scoring in Educational Assessments
by: Kim, Yunsung, et al.
Published: (2025)
by: Kim, Yunsung, et al.
Published: (2025)
Unveiling the Tapestry of Automated Essay Scoring: A Comprehensive Investigation of Accuracy, Fairness, and Generalizability
by: Yang, Kaixun, et al.
Published: (2024)
by: Yang, Kaixun, et al.
Published: (2024)
SemScore: Automated Evaluation of Instruction-Tuned LLMs based on Semantic Textual Similarity
by: Aynetdinov, Ansar, et al.
Published: (2024)
by: Aynetdinov, Ansar, et al.
Published: (2024)
Hire-Smart: Automating Candidate Screening and Interview Analysis
by: Yadav, Sandesh, et al.
Published: (2026)
by: Yadav, Sandesh, et al.
Published: (2026)
Exploration of Summarization by Generative Language Models for Automated Scoring of Long Essays
by: Hua, Haowei, et al.
Published: (2025)
by: Hua, Haowei, et al.
Published: (2025)
AI-generated Essays: Characteristics and Implications on Automated Scoring and Academic Integrity
by: Zhong, Yang, et al.
Published: (2024)
by: Zhong, Yang, et al.
Published: (2024)
Similar Items
-
Finding Challenging Metaphors that Confuse Pretrained Language Models
by: Li, Yucheng, et al.
Published: (2024) -
An Open Source Data Contamination Report for Large Language Models
by: Li, Yucheng, et al.
Published: (2023) -
LatestEval: Addressing Data Contamination in Language Model Evaluation through Dynamic and Time-Sensitive Test Construction
by: Li, Yucheng, et al.
Published: (2023) -
KAIJU: An Executive Kernel for Intent-Gated Execution of LLM Agents
by: Guerin, Cormac, et al.
Published: (2026) -
Machine Learning-driven Multiscale MD Workflows: The Mini-MuMMI Experience
by: Pottier, Loïc, et al.
Published: (2025)