:: Library Catalog

Cover Image

Saved in:

Bibliographic Details
Main Authors:	Alhafni, Bashar, Vajjala, Sowmya, Bannò, Stefano, Maurya, Kaushal Kumar, Kochmar, Ekaterina
Format:	Preprint
Published:	2024
Subjects:	Computation and Language
Online Access:	https://arxiv.org/abs/2409.11917
Tags:	Add Tag No Tags, Be the first to tag this record!

Similar Items

Opportunities and Challenges of LLMs in Education: An NLP Perspective
by: Vajjala, Sowmya, et al.
Published: (2025)

Pedagogy-driven Evaluation of Generative AI-powered Intelligent Tutoring Systems
by: Maurya, Kaushal Kumar, et al.
Published: (2025)

LLMs cannot spot math errors, even when allowed to peek into the solution
by: Srivatsa, KV Aditya, et al.
Published: (2025)

Can LLMs Reliably Simulate Real Students' Abilities in Mathematics and Reading Comprehension?
by: Srivatsa, KV Aditya, et al.
Published: (2025)

SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
by: Maurya, Kaushal Kumar, et al.
Published: (2024)

Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing
by: Srivatsa, KV Aditya, et al.
Published: (2024)

AITutor-EvalKit: Exploring the Capabilities of AI Tutors
by: Naeem, Numaan, et al.
Published: (2025)

Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
by: Maurya, Kaushal Kumar, et al.
Published: (2024)

IndicGEC: Powerful Models, or a Measurement Mirage?
by: Vajjala, Sowmya
Published: (2025)

The Problem with Safety Classification is not just the Models
by: Vajjala, Sowmya
Published: (2025)

MetricalARGS: A Taxonomy for Studying Metrical Poetry with LLMs
by: Kranti, Chalamalasetti, et al.
Published: (2025)

What Makes Math Word Problems Challenging for LLMs?
by: Srivatsa, KV Aditya, et al.
Published: (2024)

Enhancing Text Editing for Grammatical Error Correction: Arabic as a Case Study
by: Alhafni, Bashar, et al.
Published: (2025)

Simulating LLM-to-LLM Tutoring for Multilingual Math Feedback
by: Tonga, Junior Cedric, et al.
Published: (2025)

What Makes Cryptic Crosswords Challenging for LLMs?
by: Sadallah, Abdelrahman, et al.
Published: (2024)

Dravidian language family through Universal Dependencies lens
by: Rama, Taraka, et al.
Published: (2024)

Text Classification in the LLM Era -- Where do we stand?
by: Vajjala, Sowmya, et al.
Published: (2025)

Does Synthetic Data Help Named Entity Recognition for Low-Resource Languages?
by: Kamath, Gaurav, et al.
Published: (2025)

MATA: Mindful Assessment of the Telugu Abilities of Large Language Models
by: Kranti, Chalamalasetti, et al.
Published: (2025)

Findings of the BEA 2025 Shared Task on Pedagogical Ability Assessment of AI-powered Tutors
by: Kochmar, Ekaterina, et al.
Published: (2025)

Annotation Errors and NER: A Study with OntoNotes 5.0
by: Bernier-Colborne, Gabriel, et al.
Published: (2024)

Arabic Morphosyntactic Tagging and Dependency Parsing with Large Language Models
by: Adel, Mohamed, et al.
Published: (2026)

Personalized Text Generation with Fine-Grained Linguistic Control
by: Alhafni, Bashar, et al.
Published: (2024)

Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation
by: Barakat, Mariam, et al.
Published: (2026)

A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic
by: Gonzalez, Juan Moreno, et al.
Published: (2025)

Are LLMs Good Cryptic Crossword Solvers?
by: Sadallah, Abdelrahman, et al.
Published: (2024)

REFeREE: A REference-FREE Model-Based Metric for Text Simplification
by: Huang, Yichen, et al.
Published: (2024)

Intent Matters: Enhancing AI Tutoring with Fine-Grained Pedagogical Intent Annotation
by: Petukhova, Kseniia, et al.
Published: (2025)

A Fully Automated Pipeline for Conversational Discourse Annotation: Tree Scheme Generation and Labeling with Large Language Models
by: Petukhova, Kseniia, et al.
Published: (2025)

Towards Reward Modeling for AI Tutors in Math Mistake Remediation
by: Petukhova, Kseniia, et al.
Published: (2026)

Scope Ambiguities in Large Language Models
by: Kamath, Gaurav, et al.
Published: (2024)

mEdIT: Multilingual Text Editing via Instruction Tuning
by: Raheja, Vipul, et al.
Published: (2024)

Test Set Quality in Multilingual LLM Evaluation
by: Kranti, Chalamalasetti, et al.
Published: (2025)

Strategies for Arabic Readability Modeling
by: Liberato, Juan Piñeros, et al.
Published: (2024)

Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection
by: Qwaider, Chatrine, et al.
Published: (2025)

ARWI: Arabic Write and Improve
by: Chirkunov, Kirill, et al.
Published: (2025)

The SAMER Arabic Text Simplification Corpus
by: Alhafni, Bashar, et al.
Published: (2024)

Towards Self-Referential Analytic Assessment: A Profile-Based Approach to L2 Writing Evaluation with LLMs
by: Bannò, Stefano, et al.
Published: (2026)

How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes
by: Elkins, Sabina, et al.
Published: (2024)

PetKaz at SemEval-2024 Task 3: Advancing Emotion Classification with an LLM for Emotion-Cause Pair Extraction in Conversations
by: Kazakov, Roman, et al.
Published: (2024)