Saved in:
| Main Authors: | Alhafni, Bashar, Vajjala, Sowmya, Bannò, Stefano, Maurya, Kaushal Kumar, Kochmar, Ekaterina |
|---|---|
| Format: | Preprint |
| Published: |
2024
|
| Subjects: | |
| Online Access: | https://arxiv.org/abs/2409.11917 |
| Tags: |
Add Tag
No Tags, Be the first to tag this record!
|
Similar Items
Opportunities and Challenges of LLMs in Education: An NLP Perspective
by: Vajjala, Sowmya, et al.
Published: (2025)
by: Vajjala, Sowmya, et al.
Published: (2025)
Pedagogy-driven Evaluation of Generative AI-powered Intelligent Tutoring Systems
by: Maurya, Kaushal Kumar, et al.
Published: (2025)
by: Maurya, Kaushal Kumar, et al.
Published: (2025)
LLMs cannot spot math errors, even when allowed to peek into the solution
by: Srivatsa, KV Aditya, et al.
Published: (2025)
by: Srivatsa, KV Aditya, et al.
Published: (2025)
Can LLMs Reliably Simulate Real Students' Abilities in Mathematics and Reading Comprehension?
by: Srivatsa, KV Aditya, et al.
Published: (2025)
by: Srivatsa, KV Aditya, et al.
Published: (2025)
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
by: Maurya, Kaushal Kumar, et al.
Published: (2024)
by: Maurya, Kaushal Kumar, et al.
Published: (2024)
Harnessing the Power of Multiple Minds: Lessons Learned from LLM Routing
by: Srivatsa, KV Aditya, et al.
Published: (2024)
by: Srivatsa, KV Aditya, et al.
Published: (2024)
AITutor-EvalKit: Exploring the Capabilities of AI Tutors
by: Naeem, Numaan, et al.
Published: (2025)
by: Naeem, Numaan, et al.
Published: (2025)
Unifying AI Tutor Evaluation: An Evaluation Taxonomy for Pedagogical Ability Assessment of LLM-Powered AI Tutors
by: Maurya, Kaushal Kumar, et al.
Published: (2024)
by: Maurya, Kaushal Kumar, et al.
Published: (2024)
IndicGEC: Powerful Models, or a Measurement Mirage?
by: Vajjala, Sowmya
Published: (2025)
by: Vajjala, Sowmya
Published: (2025)
The Problem with Safety Classification is not just the Models
by: Vajjala, Sowmya
Published: (2025)
by: Vajjala, Sowmya
Published: (2025)
MetricalARGS: A Taxonomy for Studying Metrical Poetry with LLMs
by: Kranti, Chalamalasetti, et al.
Published: (2025)
by: Kranti, Chalamalasetti, et al.
Published: (2025)
What Makes Math Word Problems Challenging for LLMs?
by: Srivatsa, KV Aditya, et al.
Published: (2024)
by: Srivatsa, KV Aditya, et al.
Published: (2024)
Enhancing Text Editing for Grammatical Error Correction: Arabic as a Case Study
by: Alhafni, Bashar, et al.
Published: (2025)
by: Alhafni, Bashar, et al.
Published: (2025)
Simulating LLM-to-LLM Tutoring for Multilingual Math Feedback
by: Tonga, Junior Cedric, et al.
Published: (2025)
by: Tonga, Junior Cedric, et al.
Published: (2025)
What Makes Cryptic Crosswords Challenging for LLMs?
by: Sadallah, Abdelrahman, et al.
Published: (2024)
by: Sadallah, Abdelrahman, et al.
Published: (2024)
Dravidian language family through Universal Dependencies lens
by: Rama, Taraka, et al.
Published: (2024)
by: Rama, Taraka, et al.
Published: (2024)
Text Classification in the LLM Era -- Where do we stand?
by: Vajjala, Sowmya, et al.
Published: (2025)
by: Vajjala, Sowmya, et al.
Published: (2025)
Does Synthetic Data Help Named Entity Recognition for Low-Resource Languages?
by: Kamath, Gaurav, et al.
Published: (2025)
by: Kamath, Gaurav, et al.
Published: (2025)
MATA: Mindful Assessment of the Telugu Abilities of Large Language Models
by: Kranti, Chalamalasetti, et al.
Published: (2025)
by: Kranti, Chalamalasetti, et al.
Published: (2025)
Findings of the BEA 2025 Shared Task on Pedagogical Ability Assessment of AI-powered Tutors
by: Kochmar, Ekaterina, et al.
Published: (2025)
by: Kochmar, Ekaterina, et al.
Published: (2025)
Annotation Errors and NER: A Study with OntoNotes 5.0
by: Bernier-Colborne, Gabriel, et al.
Published: (2024)
by: Bernier-Colborne, Gabriel, et al.
Published: (2024)
Arabic Morphosyntactic Tagging and Dependency Parsing with Large Language Models
by: Adel, Mohamed, et al.
Published: (2026)
by: Adel, Mohamed, et al.
Published: (2026)
Personalized Text Generation with Fine-Grained Linguistic Control
by: Alhafni, Bashar, et al.
Published: (2024)
by: Alhafni, Bashar, et al.
Published: (2024)
Teaching Through Analogies: A Modular Pipeline for Educational Analogy Generation
by: Barakat, Mariam, et al.
Published: (2026)
by: Barakat, Mariam, et al.
Published: (2026)
A Tale of Two Scripts: Transliteration and Post-Correction for Judeo-Arabic
by: Gonzalez, Juan Moreno, et al.
Published: (2025)
by: Gonzalez, Juan Moreno, et al.
Published: (2025)
Are LLMs Good Cryptic Crossword Solvers?
by: Sadallah, Abdelrahman, et al.
Published: (2024)
by: Sadallah, Abdelrahman, et al.
Published: (2024)
REFeREE: A REference-FREE Model-Based Metric for Text Simplification
by: Huang, Yichen, et al.
Published: (2024)
by: Huang, Yichen, et al.
Published: (2024)
Intent Matters: Enhancing AI Tutoring with Fine-Grained Pedagogical Intent Annotation
by: Petukhova, Kseniia, et al.
Published: (2025)
by: Petukhova, Kseniia, et al.
Published: (2025)
A Fully Automated Pipeline for Conversational Discourse Annotation: Tree Scheme Generation and Labeling with Large Language Models
by: Petukhova, Kseniia, et al.
Published: (2025)
by: Petukhova, Kseniia, et al.
Published: (2025)
Towards Reward Modeling for AI Tutors in Math Mistake Remediation
by: Petukhova, Kseniia, et al.
Published: (2026)
by: Petukhova, Kseniia, et al.
Published: (2026)
Scope Ambiguities in Large Language Models
by: Kamath, Gaurav, et al.
Published: (2024)
by: Kamath, Gaurav, et al.
Published: (2024)
mEdIT: Multilingual Text Editing via Instruction Tuning
by: Raheja, Vipul, et al.
Published: (2024)
by: Raheja, Vipul, et al.
Published: (2024)
Test Set Quality in Multilingual LLM Evaluation
by: Kranti, Chalamalasetti, et al.
Published: (2025)
by: Kranti, Chalamalasetti, et al.
Published: (2025)
Strategies for Arabic Readability Modeling
by: Liberato, Juan Piñeros, et al.
Published: (2024)
by: Liberato, Juan Piñeros, et al.
Published: (2024)
Enhancing Arabic Automated Essay Scoring with Synthetic Data and Error Injection
by: Qwaider, Chatrine, et al.
Published: (2025)
by: Qwaider, Chatrine, et al.
Published: (2025)
ARWI: Arabic Write and Improve
by: Chirkunov, Kirill, et al.
Published: (2025)
by: Chirkunov, Kirill, et al.
Published: (2025)
The SAMER Arabic Text Simplification Corpus
by: Alhafni, Bashar, et al.
Published: (2024)
by: Alhafni, Bashar, et al.
Published: (2024)
Towards Self-Referential Analytic Assessment: A Profile-Based Approach to L2 Writing Evaluation with LLMs
by: Bannò, Stefano, et al.
Published: (2026)
by: Bannò, Stefano, et al.
Published: (2026)
How Teachers Can Use Large Language Models and Bloom's Taxonomy to Create Educational Quizzes
by: Elkins, Sabina, et al.
Published: (2024)
by: Elkins, Sabina, et al.
Published: (2024)
PetKaz at SemEval-2024 Task 3: Advancing Emotion Classification with an LLM for Emotion-Cause Pair Extraction in Conversations
by: Kazakov, Roman, et al.
Published: (2024)
by: Kazakov, Roman, et al.
Published: (2024)
Similar Items
-
Opportunities and Challenges of LLMs in Education: An NLP Perspective
by: Vajjala, Sowmya, et al.
Published: (2025) -
Pedagogy-driven Evaluation of Generative AI-powered Intelligent Tutoring Systems
by: Maurya, Kaushal Kumar, et al.
Published: (2025) -
LLMs cannot spot math errors, even when allowed to peek into the solution
by: Srivatsa, KV Aditya, et al.
Published: (2025) -
Can LLMs Reliably Simulate Real Students' Abilities in Mathematics and Reading Comprehension?
by: Srivatsa, KV Aditya, et al.
Published: (2025) -
SelectLLM: Query-Aware Efficient Selection Algorithm for Large Language Models
by: Maurya, Kaushal Kumar, et al.
Published: (2024)